Gene GM21_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1073 
Symbol 
ID8136395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1257798 
End bp1259309 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content67% 
IMG OID644868684 
ProductNADH dehydrogenase (ubiquinone) 30 kDa subunit 
Protein accessionYP_003020892 
Protein GI253699703 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.1537600000000002e-34 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACGCCTG GGCTCCTCTT CACTCAAAAC GGAAGCGCCG TCAACCGCGA GGAAATACCG 
GAGCACTCCG GCGACCGGTT CTGCGAGACG CTCATCTCCG CCGTCGACGG GGGATGGCGG
GCGGTTTCCT ACTTCGGCGC CGCCGAGGCT GACGCGGTGC GGCTTTACTG CATCCTCTCC
TTCAAAAGCC ACGCGGCGCT CGGCATCATG AGCACCGCCG TCACCGGCAA GAGCTTTCCC
TCGCTCGTCC TGGCCGCCCC GCAACTGCAC CTGTTCGAGC GAGAGATCGC CGAGCAGTTC
GGCATCAGCT TCGAGGGGCA TCCGTGGCTG AAGCCGGTCC GGTTCGAGGC CCCGTTCGCA
GCGCCCGTTG GGGAGACGCC CAAGCCTGCC GGGAAAATCG GGGTCATGGA CTTTTACCGG
GTGGCGGGAG ACGAGGTGCA CGAGGTCGCG GTGGGGCCGG TGCATGCGGG GATCATCGAG
CCGGGGCATT TCCGCTTCCA GTGCTTCGGC GAGGAGGTGA TGCACCTGGA GATCTCGCTC
GGCTACCAGC ACCGCGCCGT CGAGCGCATG GTGCTCGGGC GCCCCGGGCT GCGCACCCTT
AAATGCATGG AGACCGTCGC CGGGGACACC ACCATCGGCC ACGGCACCGC CTACGCCATG
GTGGTCGAGG CCCTTTCCAA GGCGCGGGTA CCGGCCAAGG CCGAGGCGAT ACGAGGCATA
GCCCTTGAGC TGGAGCGCCT GGCGAACCAT ACCGGCGACC TCGGGGCCAT AGCTGGCGAC
GTCGGCTACC TTCCCACCGC TTCCTTCTGC GGCAGGATCC GCGGCGACTT CCTGAACATG
AGCGCCGAGC TTTGCGGCAG CCGCTTCGGG CGCGGCCTAT TGACCCCGGG CGGGGTCCAG
TTCGACGTCG GGAGGGAACT GGCGGAGAAA CTCAGGAAGA GGATCGACGT GGCGCGCCGC
GAGGTGACGA ACGCGGTCGA GCTCCTCTGG GACAGCCCCT CTGTGATGGG GAGGCTGGAA
GGGACCGGGG TGGTGAGCGA AAAGGACGCG CTCGATCTCG GACTCGTGGG GCCGGCTGCG
CGGGCGAGCG GGCTCAACCG CGACATCCGC CGGGACCACC CCTTCGGCAT CTACAACGTG
ACGCAGCTCC CGGTTGAGAC GGCCAAGGGG GGGGACGTCT ACGCCCGCAC CCTGGTGCGC
TGGCTGGAGA TAGAGAAGTC GCTCCATTTC ATAGAGGAGC AGTTGGCGCA GCTCCCGGGA
GGGAGCATCG TCGCGCCGGC GACACAGGTC GGTGAAAACC GGATGGCGCT GGCCCTGGTG
GAGGGGTGGC GCGGCGAGCT CTGCCATGTC GCCCTGACCG ACGGGAACGG CGAATTCCGG
CGCTACAAGA TCACCGACCC CTCCTTCCAC AACTGGAGCG GTCTCGCCAT GGCGCTGCGC
GGGGGGCAGA TCTCCGACTT CCCGCTTTGC AACAAGAGCT TCAACCTCTC TTACTGCGGG
TTCGACCTTT AG
 
Protein sequence
MTPGLLFTQN GSAVNREEIP EHSGDRFCET LISAVDGGWR AVSYFGAAEA DAVRLYCILS 
FKSHAALGIM STAVTGKSFP SLVLAAPQLH LFEREIAEQF GISFEGHPWL KPVRFEAPFA
APVGETPKPA GKIGVMDFYR VAGDEVHEVA VGPVHAGIIE PGHFRFQCFG EEVMHLEISL
GYQHRAVERM VLGRPGLRTL KCMETVAGDT TIGHGTAYAM VVEALSKARV PAKAEAIRGI
ALELERLANH TGDLGAIAGD VGYLPTASFC GRIRGDFLNM SAELCGSRFG RGLLTPGGVQ
FDVGRELAEK LRKRIDVARR EVTNAVELLW DSPSVMGRLE GTGVVSEKDA LDLGLVGPAA
RASGLNRDIR RDHPFGIYNV TQLPVETAKG GDVYARTLVR WLEIEKSLHF IEEQLAQLPG
GSIVAPATQV GENRMALALV EGWRGELCHV ALTDGNGEFR RYKITDPSFH NWSGLAMALR
GGQISDFPLC NKSFNLSYCG FDL