Gene GM21_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1065 
Symbol 
ID8136387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1247505 
End bp1249505 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content66% 
IMG OID644868676 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionYP_003020884 
Protein GI253699695 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value9.82932e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATTT CCGGCGAGCC TGCGACCTGT ATCTACCTGT TGTTGCTCGC CGTCGCCTTC 
CAGGCGCTCT CCGGACTCCC ACTTCTTTTC CGGCGCGGTT CCGCCGCCGC CCAGCGCCTT
TCCGCGGCGC TTCTCATAGC TGCTTCGCTC GCCGGCGTCT GCGGCGCGCT CATCGCCCTT
TTCTTCCCCT CCAGCGCCGC AGTCACCATC GCCTCCGGCC TCCCCTTCGG CCCCTTCGAG
GCGGGTGTGG ACCCCCTTTC CGGATTCTTC CTGCTCCCTA TGTTCATCGT GACCGGTTGC
GCGGCGCTCT ACGGCGTCAG CTACTGGCCC GCCGCGCTCC ATCCCCGCAA CTGCGGCAAG
CTCACGTTTT TCCTGGGGCT TTTGGCGGCG TCCCTCACCA CGCTCTTGAT GGCTAAGAGC
ACGATTCTTT TCCTGCTCGC CTGGGAGGTT ATGGCCTTCG CGGGCTACTT CGCCCTCACC
ACCGAGGATG AAAAGCCCCA GGTGCGGGAG GCTGGGACCC TCTACCTCAT CACGGCGCAC
CTGGGCGCAC TGGCGCTCTT CGCCATGTTC TCGCTGCTCA AGGGGGAAAC CGGGGAATGG
CTCTTCCCCG CGGCCGGCGC GCTTTCGGCG CAGACAGGGC TAGCCGCCGC CATCTTTCTC
ACCGCGATCC TGGGGTTCGG GCTGAAGGCC GGCTGCATGC CGCTGCACAT CTGGCTCCCT
TCGGCGCACG CGAACGCCCC AAGCCACATC TCCGCCATCC TCTCCGGCAT CGTCTTGAAG
ACCGGCATCT ACGGCATGAT GCGAGTCTTT TCCGCCTTTT CCGATCCTCC TCTTTGGTGG
GGGGGGACCG TGCTGGTCCT GGGGCTTGTC TCCGGCGTCC TCGGGGTGGC GTTCGCCATA
GCTCAGCACG ACATCAAACG GCTCCTGGCC TATCACAGCA TCGAGAACAT CGGCATCATC
ATGATGGGGA TCGGCATAGC ACTCATAGGG GAGAGTCAGG GGAACCCGGC GCTGACTGCG
CTGGGGGTGG CGGGGGCGCT GCTGCACGTG GTGAACCACG CCCTTTTCAA GGCGCTGCTC
TTTCTTTCCG CCGGTTCGAT CATCCACGCC ACGGGGACCA GGGAGATCGA TCTCATGGGG
GGGGTGGCGC GCCGCCTCCC GTACACCGCG TTCTTCTTCC TGACGGGGGC CGTCGCCATC
TGCGGGCTCC CTCCCTTGAA CGGCTTCGTG AGCGAGCTGA TGATCTACCT TGGCTCCTTT
ACCGCCATCA GTTCCGCGGG GGGGCTGTCG GGGATGTTCC CCGCGCTCAC CGCCCCCGTG
CTGGCGCTGG TGGGGGGGCT GGCTGTGGCC TGCTTCGTCA AGGTGTTCGG GATCGCTTTC
CTGGGGGCGC CCCGCTCGGA AGAACACGCG GTGGGACACG AGGCGCCTGG GGGGATGCTG
GCGCCGATGG GGGTGTTGGC TCTTTGCTGC GCCGTGATTG GGGTCGCGCC GGGACTTTTC
GCGATCCCCC TGAACAACGC GCTCTCGTCC TACCGCTCCT CCCTCTCGGG GGAATGGATC
GAGAACCTGG TCCCGTTTAC CTGGGTCTCC ATACTGGCCG CGGGGCTTAT CGCGGCCGCC
CTGCTGGCGG CGTATCTGCT GCAGCGGCGC GCCGCGAGGC TCCCGGTCGG CTCCGGCCCG
ACCTGGGGGT GCGGCTACCT TAGACCTTCA AGCTCCATGC AATACACGGC GAGCTCCTTC
GGCGCCACCC TGGTGAGCTG GTTCAAGATC GTGCTGCGTC CCGAGCTGCA TCGCGAGGAG
GTGGCGGGGC CCTTTCCCGA GCGGGCCCGG TTTGCAAGCC ACGTCCCTGA GACCGTGCTG
GAGCGGATCT ATCTCCCCTT CCTGGAATAC CTCTTCGAAA AGGCGCAGCC GGTCCGCAGG
CTGCAGCACG GCAAGCTGAA CATCTATATC TTCTACACCT TCATCACCCT CGTGGTCCTC
CTGGCGCTGA CCAGCCGGTA G
 
Protein sequence
MFISGEPATC IYLLLLAVAF QALSGLPLLF RRGSAAAQRL SAALLIAASL AGVCGALIAL 
FFPSSAAVTI ASGLPFGPFE AGVDPLSGFF LLPMFIVTGC AALYGVSYWP AALHPRNCGK
LTFFLGLLAA SLTTLLMAKS TILFLLAWEV MAFAGYFALT TEDEKPQVRE AGTLYLITAH
LGALALFAMF SLLKGETGEW LFPAAGALSA QTGLAAAIFL TAILGFGLKA GCMPLHIWLP
SAHANAPSHI SAILSGIVLK TGIYGMMRVF SAFSDPPLWW GGTVLVLGLV SGVLGVAFAI
AQHDIKRLLA YHSIENIGII MMGIGIALIG ESQGNPALTA LGVAGALLHV VNHALFKALL
FLSAGSIIHA TGTREIDLMG GVARRLPYTA FFFLTGAVAI CGLPPLNGFV SELMIYLGSF
TAISSAGGLS GMFPALTAPV LALVGGLAVA CFVKVFGIAF LGAPRSEEHA VGHEAPGGML
APMGVLALCC AVIGVAPGLF AIPLNNALSS YRSSLSGEWI ENLVPFTWVS ILAAGLIAAA
LLAAYLLQRR AARLPVGSGP TWGCGYLRPS SSMQYTASSF GATLVSWFKI VLRPELHREE
VAGPFPERAR FASHVPETVL ERIYLPFLEY LFEKAQPVRR LQHGKLNIYI FYTFITLVVL
LALTSR