Gene GM21_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4007 
Symbol 
ID8139381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4593151 
End bp4594320 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content61% 
IMG OID644871623 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_003023781 
Protein GI253702592 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.2044e-27 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTAGCG AAATAATGAC ACTGAACATG GGGCCCCAGC ACCCCAGTAC CCACGGCGTT 
CTCAGGCTCG TAGTCGAGCT GGACGGCGAG GTGATACAGA AGATTACTCC CCACATCGGC
TACCTGCACC GGGGCATCGA GAAGCTCTCC GAGCACCGCA CTTACCACCA GGCGCTGCCG
CTTACCGACC GCATGGACTA CCTGGCCCCT ATGCACAACA ACCTGGGGTA CGTGCTGGCC
GTCGAGAAGC TCCTCGGCAT CGAGGTTCCC GAGCGGGCCG AGACCATACG CGTTATCCTG
GCGGAGCTTA CCCGTCTGAA GAGCCACCTG GTATGGATCG CCTGCCACGC CCTCGACATC
GGCGCCATGA CCGTCTTCAT CTACGCCTTC CGCGAGCGCG AGAAGATCAT GGAACTGTAC
GAGATGGTCT CCGGCGCCAG GATGACCTCG AACTACTTCC GCGTGGGTGG TCTCTCCCGA
GACCTCCCTG CAGGGTTCGA GACGGCGGTT CAGGAGATTA TCGACACCTT CCCGGGTCAC
TTTGACACTT ACGAGGGTCT TCTCACCAAG AACACCATCT GGCTGCAGAG GACCATCGGC
AACGGGGTCA TCTCCGCGGA CGACGCCATC GACTTCGGCA TCTCCGGGCC GGCCCTCAGG
GGCTCCGGCG TCGACTTCGA CCTTAGGCGC GACCTCCCCT ACTCCGGCTA CGAAAAGTAC
GACTTCAAGG TGCCGGTCGG CGAGAACTGC GATACCTTCG ACCGCTACAA GGTCCGCCTG
GTGGAGATGC GCGAGGCGGT GAAGATCATC GACCAGGCAA TGAAGCGCCT GAAGCCGGGA
CCGATCCTGG CCGACGCGCC GCAGGTCTGC TACCCGCCGA AGGAGAGCGT CTACAACTCC
ATCGAGGGGC TGATCCACCA CTTCAAGATC GCTTCCGAAG GCTTCCCGGT TCCTGAAGGG
GAGGTTTACC AGGGGGTCGA GAACCCGAAA GGGGAGCTCG GCTACTACAT GGTTTCCGAC
GGCGGCTCGA AGCCCTACCG CATGAGGGTG CGTCCCCCCT CATTCGTCAA CCTGGGCGCC
ATCGAGAAGA TGGCCAAAGG TTCGATGATC GCGGACCTCG TGGCGGTAAT CGGAACGCTC
GACATCGTTC TTGGCGAAAT AGACCGGTAA
 
Protein sequence
MASEIMTLNM GPQHPSTHGV LRLVVELDGE VIQKITPHIG YLHRGIEKLS EHRTYHQALP 
LTDRMDYLAP MHNNLGYVLA VEKLLGIEVP ERAETIRVIL AELTRLKSHL VWIACHALDI
GAMTVFIYAF REREKIMELY EMVSGARMTS NYFRVGGLSR DLPAGFETAV QEIIDTFPGH
FDTYEGLLTK NTIWLQRTIG NGVISADDAI DFGISGPALR GSGVDFDLRR DLPYSGYEKY
DFKVPVGENC DTFDRYKVRL VEMREAVKII DQAMKRLKPG PILADAPQVC YPPKESVYNS
IEGLIHHFKI ASEGFPVPEG EVYQGVENPK GELGYYMVSD GGSKPYRMRV RPPSFVNLGA
IEKMAKGSMI ADLVAVIGTL DIVLGEIDR