Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4007 |
Symbol | |
ID | 8139381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4593151 |
End bp | 4594320 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871623 |
Product | NADH dehydrogenase I, D subunit |
Protein accession | YP_003023781 |
Protein GI | 253702592 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 2.2044e-27 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTAGCG AAATAATGAC ACTGAACATG GGGCCCCAGC ACCCCAGTAC CCACGGCGTT CTCAGGCTCG TAGTCGAGCT GGACGGCGAG GTGATACAGA AGATTACTCC CCACATCGGC TACCTGCACC GGGGCATCGA GAAGCTCTCC GAGCACCGCA CTTACCACCA GGCGCTGCCG CTTACCGACC GCATGGACTA CCTGGCCCCT ATGCACAACA ACCTGGGGTA CGTGCTGGCC GTCGAGAAGC TCCTCGGCAT CGAGGTTCCC GAGCGGGCCG AGACCATACG CGTTATCCTG GCGGAGCTTA CCCGTCTGAA GAGCCACCTG GTATGGATCG CCTGCCACGC CCTCGACATC GGCGCCATGA CCGTCTTCAT CTACGCCTTC CGCGAGCGCG AGAAGATCAT GGAACTGTAC GAGATGGTCT CCGGCGCCAG GATGACCTCG AACTACTTCC GCGTGGGTGG TCTCTCCCGA GACCTCCCTG CAGGGTTCGA GACGGCGGTT CAGGAGATTA TCGACACCTT CCCGGGTCAC TTTGACACTT ACGAGGGTCT TCTCACCAAG AACACCATCT GGCTGCAGAG GACCATCGGC AACGGGGTCA TCTCCGCGGA CGACGCCATC GACTTCGGCA TCTCCGGGCC GGCCCTCAGG GGCTCCGGCG TCGACTTCGA CCTTAGGCGC GACCTCCCCT ACTCCGGCTA CGAAAAGTAC GACTTCAAGG TGCCGGTCGG CGAGAACTGC GATACCTTCG ACCGCTACAA GGTCCGCCTG GTGGAGATGC GCGAGGCGGT GAAGATCATC GACCAGGCAA TGAAGCGCCT GAAGCCGGGA CCGATCCTGG CCGACGCGCC GCAGGTCTGC TACCCGCCGA AGGAGAGCGT CTACAACTCC ATCGAGGGGC TGATCCACCA CTTCAAGATC GCTTCCGAAG GCTTCCCGGT TCCTGAAGGG GAGGTTTACC AGGGGGTCGA GAACCCGAAA GGGGAGCTCG GCTACTACAT GGTTTCCGAC GGCGGCTCGA AGCCCTACCG CATGAGGGTG CGTCCCCCCT CATTCGTCAA CCTGGGCGCC ATCGAGAAGA TGGCCAAAGG TTCGATGATC GCGGACCTCG TGGCGGTAAT CGGAACGCTC GACATCGTTC TTGGCGAAAT AGACCGGTAA
|
Protein sequence | MASEIMTLNM GPQHPSTHGV LRLVVELDGE VIQKITPHIG YLHRGIEKLS EHRTYHQALP LTDRMDYLAP MHNNLGYVLA VEKLLGIEVP ERAETIRVIL AELTRLKSHL VWIACHALDI GAMTVFIYAF REREKIMELY EMVSGARMTS NYFRVGGLSR DLPAGFETAV QEIIDTFPGH FDTYEGLLTK NTIWLQRTIG NGVISADDAI DFGISGPALR GSGVDFDLRR DLPYSGYEKY DFKVPVGENC DTFDRYKVRL VEMREAVKII DQAMKRLKPG PILADAPQVC YPPKESVYNS IEGLIHHFKI ASEGFPVPEG EVYQGVENPK GELGYYMVSD GGSKPYRMRV RPPSFVNLGA IEKMAKGSMI ADLVAVIGTL DIVLGEIDR
|
| |