Gene Emin_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1137 
Symbol 
ID6263193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1234701 
End bp1236488 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content39% 
IMG OID642611617 
ProductNADH/ubiquinone/plastoquinone (complex I) 
Protein accessionYP_001876026 
Protein GI187251544 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAT TATTACCCGT TTTTATTCCT TTAATATGCG CTTGCGGCAT TTTAATTACA 
GGCAGGAAAT ACTCTGCTTA TAATAAAGTG TGGGCGGGCG CGGGAGCGGT TGGCGCGTTA
TTGGGCGTTA TTGTTTTATT TTTAACGCCG TTTTCCATGT TTTATTTAAA CTGGTGCGAA
AGTATTCTTA CGTTTTCCCT TTCAGCGGGC GCTTTTGATA AAGCGGCCCT CGCCGCGCTT
ACTGTTTTTG CCGTTTTAAT TTATGTGTAT TCCTTAAAAA GCAATATTGA ACATGAAAGA
TGGTTCTTTT TCAGTTTTCT GTTTACATTG GCTTTTGCCT GTGGGGCAGT ATTGGCGGAT
AATCTTGTTT TAATGCTGTT TTTTTGGGAA GGACTTCTAA TAAGTTTATA TGTTTTTATA
GTTATTAATA AAAACGGGCA AAGTACCGCT TTTAAAGCTT TTATGATTAA CGTCGTGGGC
GATATTTTGC TGTTGGCGGG CATTATTTTA ACGGGTTACG CCGCAAAAAC TTTTGATTCA
AACGCTATAG TGCTTGCGCC TATAAGCATG CAAACCGGTT TGGGCATGGG CGCGTTTCTT
TTAATAATGC TTGGCGCTTT ATCAAAAGCG GGGGCTGTTC CTTTCCATTC CTGGATACCG
GAAGCCGCGG CAAATACAAA TGTTCCTATG ATGGCGTTAT TGCCCGCTTC GTTTGAAAAG
GTTTTGGCTG TGTTTTTAAT GGGGAAAGCG GTTACAATGT TTAACGTGTC CGTACTTCCC
AACGGGGCGG ACTTTTTAAT TGTGGCGGGT ATAATATCAA TACTTCCCGC GGCGTTTTTA
ATACTTAAAG AAACCGACCT TAAAAAATTT ATTGCATATA ATATTATTTT GCAAATAGGT
TTGCTTGCTT TTGAGCACGG CGCAACAATG GGCGGCGATA TCATGGCACT GCTTAACCAC
GGCATTTATA AAGCTGCCGC TTTAGCTTGT TTATTTTTGT GCGCGGGCGC AGTAGAAGAA
GCCGCCGGCA CAACGGACAC AAATAAACTG GGCGGACTGT TTAAGAATAT GAAAGTTGTT
GCGGTAAGTT TTATTATAGC TTCCGCCGCA CTTTGCAAAG TAACGTTTTT AGATATTTTC
TTTTCATCAA ATTATAACGG GCTCAACCAG CATCATATAG GTTTTATTAT TTTTATGGCC
GTTATAAACC TGCTTGTGTT ATTTTCTTTT GTAAGAGTAA TAATAAATGT GTTTTTTGTA
AAAGGAACCG TTGGCTTTAC ACGCCCGCAT TTTACAAAAA CAATTATTCC CGCGGTGTTT
GCTTTTGTGG TTTTTCTTTA CTCCATAGGC TGGGTTTTAG ATGAAAACAA TTTATTAAAT
TTAGCGCACG CGCACTTTTC TTTTACCTGG ATAAGCGGGC TTAGTTTTGT TGTAATTATT
ATGGCGGGCG GGTTGTTTTT CGCGGGGTTT AAAAAGCACG GACTTCGCTT TGCGGCGGGT
ATTGTTAACA ATGCGCCCGT TATAAAACAA ATTAATAAAT TTAACTCTTC TCATTTATCA
GACTTTTATG AGAATATTAA AAAAGCGGCC AAATTTATTT CCAAAGCGCT TTTTAAAACA
GACAGGATTA TGGATTTTAT TATAGACGAT ATTCCAAGCA CGGTATCAAA AGGATTTTCA
AAAGCGGGCT CGGCCTTACA CACGGGCTAT AGTTTTAGCT ATATAATATG GGCTGTTATA
GGCGGGCTTG TTTTTGCCAT AATCTCAATA AGCGGGGGGT GGATGTAA
 
Protein sequence
MSILLPVFIP LICACGILIT GRKYSAYNKV WAGAGAVGAL LGVIVLFLTP FSMFYLNWCE 
SILTFSLSAG AFDKAALAAL TVFAVLIYVY SLKSNIEHER WFFFSFLFTL AFACGAVLAD
NLVLMLFFWE GLLISLYVFI VINKNGQSTA FKAFMINVVG DILLLAGIIL TGYAAKTFDS
NAIVLAPISM QTGLGMGAFL LIMLGALSKA GAVPFHSWIP EAAANTNVPM MALLPASFEK
VLAVFLMGKA VTMFNVSVLP NGADFLIVAG IISILPAAFL ILKETDLKKF IAYNIILQIG
LLAFEHGATM GGDIMALLNH GIYKAAALAC LFLCAGAVEE AAGTTDTNKL GGLFKNMKVV
AVSFIIASAA LCKVTFLDIF FSSNYNGLNQ HHIGFIIFMA VINLLVLFSF VRVIINVFFV
KGTVGFTRPH FTKTIIPAVF AFVVFLYSIG WVLDENNLLN LAHAHFSFTW ISGLSFVVII
MAGGLFFAGF KKHGLRFAAG IVNNAPVIKQ INKFNSSHLS DFYENIKKAA KFISKALFKT
DRIMDFIIDD IPSTVSKGFS KAGSALHTGY SFSYIIWAVI GGLVFAIISI SGGWM