Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3998 |
Symbol | |
ID | 8139372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4581958 |
End bp | 4583565 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644871614 |
Product | proton-translocating NADH-quinone oxidoreductase, chain M |
Protein accession | YP_003023772 |
Protein GI | 253702583 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.000000000000144113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCAGC TACCGTTACT GAGCATATTA ACCTTCACAC CGCTGATCGG GGCGATTCTC CTCCTCTTTG TGAACAAGAA CAGCCACGGA GTGCTCCGTA CCGTTACCAT GGCGGTGACG GTGGTGACGT TCGTCCTCTC GCTGCCTCTG ATCACGGGGT ACAACGCGCC CGGGACCGAC ATCGGCGGCT TCCAGTTCAT CGAGAATGTG CCCTGGATCG CTGCAGGCCC CTTCCAGATG AGCTATCACC TGGGCATCGA CGGCATCAGC CTCTGGCTCG TCATCCTCAC CACATTCATC ATGCCGATCG CCATCCTCTC CACCTACACG GCGGTCGAAG AGAAGGTGAA GGAGTACATG ATCTGCCTCT TGCTGCTCGA AGTCGGCATG ATCGGCACCT TTATCTCGAT CGACCTCTTC CTCTTCTACA TCTACTGGGA AGTGATGCTG ATCCCGATGT ACTTCATGAT CGGTATCTGG GGGGGCAAGA ACAGGATCTA CGCTGCAGTC AAGTTCTTCA TCTACACCGC GGTCGGTTCG CTCCTCATGC TGGTCGCATT GATCTCCCTT TACTTCAAGG CGGGCGGCGG CGACTTCAGC ATCATCCGCT TCTGGGAGCT TAACCTCGAT CCGGCCACCC AGGTGTGGAT GTTCCTCGCC TTCGCACTGG CCTTCGCCAT CAAGGTTCCG ATGTTCCCGC TGCACACCTG GTTGCCCGAC GCACATACCG AGGCGCCGAC CGCAGGCTCC GTCATCCTGG CCGCCGTCAT GCTGAAATGC GGTACCTATG GTTACATCCG TTTCGCCATG CCGCTCTTCC CGGAAGCGAG CGCGCAGTTC ACCCCGCTCA TCGCAACCCT GTCCGTCATC GGCATCATCT ACGCCTCGCT GGTCGCGATG GTGCAGCAGG ACGTCAAGAA GCTGGTCGCC TACTCTTCCG TGGCGCATCT GGGCTTCGTC ATGCTCGGCC TCTACGCCCT CAACACCCAG GGGGTCACCG GCGGTATGCT GCAGATGCTC AACCACGGTG TTTCCACCGG CGCATTGTTC CTTATCGTCG GATTCATCTA CGAGCGCCGT CACACTCGTC AGATCTCCGA CTTCGGCGGA CTCGCCAAGC AGATGCCCGT TTTCGCCACC ATGTTCATGA TCGTCACCTT CTCCTCCATC GGCCTTCCCG GGACCAACGG TTTCGTCGGC GAGTTCCTGG TGCTCCTGGG CTCCTTCGAG AGCGAGCTCC GCTGGTACGC GATCATCGCC ACCTCCGGCG TCATCCTTTC CGCCGTCTAC ATGCTCTGGA TGTTCCAGAG GGTCATGTTC GGCGAGCTGA AGAACCCGAA AAACCAGACT CTGAAGGACC TGAACGCAAG GGAAGTAGCG ATCATGCTTC CGCTTCTGTT CCTCATCTTC TTCCTGGGCG TCTACCCGCG CCCCATCATC GACTCCATGG CTCCGTCGAT CGACAGGCTG ATCGCTCAGA CCAAGGTGCA GAAGCAGGTG GCACAAGTAG AAGCACCGGC CGCGCCGCAG CTTCCGGCAG GGCACGTAGC AGTTCCGGGC CTTCCCGAAG GGCATCCGGC TCTCCCCGCA ACCCAAGAAG TAAAATAG
|
Protein sequence | MNQLPLLSIL TFTPLIGAIL LLFVNKNSHG VLRTVTMAVT VVTFVLSLPL ITGYNAPGTD IGGFQFIENV PWIAAGPFQM SYHLGIDGIS LWLVILTTFI MPIAILSTYT AVEEKVKEYM ICLLLLEVGM IGTFISIDLF LFYIYWEVML IPMYFMIGIW GGKNRIYAAV KFFIYTAVGS LLMLVALISL YFKAGGGDFS IIRFWELNLD PATQVWMFLA FALAFAIKVP MFPLHTWLPD AHTEAPTAGS VILAAVMLKC GTYGYIRFAM PLFPEASAQF TPLIATLSVI GIIYASLVAM VQQDVKKLVA YSSVAHLGFV MLGLYALNTQ GVTGGMLQML NHGVSTGALF LIVGFIYERR HTRQISDFGG LAKQMPVFAT MFMIVTFSSI GLPGTNGFVG EFLVLLGSFE SELRWYAIIA TSGVILSAVY MLWMFQRVMF GELKNPKNQT LKDLNAREVA IMLPLLFLIF FLGVYPRPII DSMAPSIDRL IAQTKVQKQV AQVEAPAAPQ LPAGHVAVPG LPEGHPALPA TQEVK
|
| |