Gene GM21_3998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3998 
Symbol 
ID8139372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4581958 
End bp4583565 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content59% 
IMG OID644871614 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_003023772 
Protein GI253702583 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.000000000000144113 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCAGC TACCGTTACT GAGCATATTA ACCTTCACAC CGCTGATCGG GGCGATTCTC 
CTCCTCTTTG TGAACAAGAA CAGCCACGGA GTGCTCCGTA CCGTTACCAT GGCGGTGACG
GTGGTGACGT TCGTCCTCTC GCTGCCTCTG ATCACGGGGT ACAACGCGCC CGGGACCGAC
ATCGGCGGCT TCCAGTTCAT CGAGAATGTG CCCTGGATCG CTGCAGGCCC CTTCCAGATG
AGCTATCACC TGGGCATCGA CGGCATCAGC CTCTGGCTCG TCATCCTCAC CACATTCATC
ATGCCGATCG CCATCCTCTC CACCTACACG GCGGTCGAAG AGAAGGTGAA GGAGTACATG
ATCTGCCTCT TGCTGCTCGA AGTCGGCATG ATCGGCACCT TTATCTCGAT CGACCTCTTC
CTCTTCTACA TCTACTGGGA AGTGATGCTG ATCCCGATGT ACTTCATGAT CGGTATCTGG
GGGGGCAAGA ACAGGATCTA CGCTGCAGTC AAGTTCTTCA TCTACACCGC GGTCGGTTCG
CTCCTCATGC TGGTCGCATT GATCTCCCTT TACTTCAAGG CGGGCGGCGG CGACTTCAGC
ATCATCCGCT TCTGGGAGCT TAACCTCGAT CCGGCCACCC AGGTGTGGAT GTTCCTCGCC
TTCGCACTGG CCTTCGCCAT CAAGGTTCCG ATGTTCCCGC TGCACACCTG GTTGCCCGAC
GCACATACCG AGGCGCCGAC CGCAGGCTCC GTCATCCTGG CCGCCGTCAT GCTGAAATGC
GGTACCTATG GTTACATCCG TTTCGCCATG CCGCTCTTCC CGGAAGCGAG CGCGCAGTTC
ACCCCGCTCA TCGCAACCCT GTCCGTCATC GGCATCATCT ACGCCTCGCT GGTCGCGATG
GTGCAGCAGG ACGTCAAGAA GCTGGTCGCC TACTCTTCCG TGGCGCATCT GGGCTTCGTC
ATGCTCGGCC TCTACGCCCT CAACACCCAG GGGGTCACCG GCGGTATGCT GCAGATGCTC
AACCACGGTG TTTCCACCGG CGCATTGTTC CTTATCGTCG GATTCATCTA CGAGCGCCGT
CACACTCGTC AGATCTCCGA CTTCGGCGGA CTCGCCAAGC AGATGCCCGT TTTCGCCACC
ATGTTCATGA TCGTCACCTT CTCCTCCATC GGCCTTCCCG GGACCAACGG TTTCGTCGGC
GAGTTCCTGG TGCTCCTGGG CTCCTTCGAG AGCGAGCTCC GCTGGTACGC GATCATCGCC
ACCTCCGGCG TCATCCTTTC CGCCGTCTAC ATGCTCTGGA TGTTCCAGAG GGTCATGTTC
GGCGAGCTGA AGAACCCGAA AAACCAGACT CTGAAGGACC TGAACGCAAG GGAAGTAGCG
ATCATGCTTC CGCTTCTGTT CCTCATCTTC TTCCTGGGCG TCTACCCGCG CCCCATCATC
GACTCCATGG CTCCGTCGAT CGACAGGCTG ATCGCTCAGA CCAAGGTGCA GAAGCAGGTG
GCACAAGTAG AAGCACCGGC CGCGCCGCAG CTTCCGGCAG GGCACGTAGC AGTTCCGGGC
CTTCCCGAAG GGCATCCGGC TCTCCCCGCA ACCCAAGAAG TAAAATAG
 
Protein sequence
MNQLPLLSIL TFTPLIGAIL LLFVNKNSHG VLRTVTMAVT VVTFVLSLPL ITGYNAPGTD 
IGGFQFIENV PWIAAGPFQM SYHLGIDGIS LWLVILTTFI MPIAILSTYT AVEEKVKEYM
ICLLLLEVGM IGTFISIDLF LFYIYWEVML IPMYFMIGIW GGKNRIYAAV KFFIYTAVGS
LLMLVALISL YFKAGGGDFS IIRFWELNLD PATQVWMFLA FALAFAIKVP MFPLHTWLPD
AHTEAPTAGS VILAAVMLKC GTYGYIRFAM PLFPEASAQF TPLIATLSVI GIIYASLVAM
VQQDVKKLVA YSSVAHLGFV MLGLYALNTQ GVTGGMLQML NHGVSTGALF LIVGFIYERR
HTRQISDFGG LAKQMPVFAT MFMIVTFSSI GLPGTNGFVG EFLVLLGSFE SELRWYAIIA
TSGVILSAVY MLWMFQRVMF GELKNPKNQT LKDLNAREVA IMLPLLFLIF FLGVYPRPII
DSMAPSIDRL IAQTKVQKQV AQVEAPAAPQ LPAGHVAVPG LPEGHPALPA TQEVK