Gene Moth_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2095 
Symbol 
ID3832461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2185054 
End bp2186439 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content58% 
IMG OID637830020 
Productcytochrome bd ubiquinol oxidase, subunit I 
Protein accessionYP_430930 
Protein GI83590921 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAC TCTTGCTGGC CCGTTGGCAA TTCGGCATAA CCTCGGTTTA CCATTTCCTC 
TTCGTCCCCC TGACCCTGGG ACTTTCTATT CTGGTGGCCA TCATGGAGAC CATCTACGTC
CGTACTGGTG ATGAAACTTA CAAGAACATG GCCCGCTTCT GGGGCCGCCT TTTCCTGATT
AACTTCGCCA TGGGTGTGGT GACCGGTATC GTTCAGGAGT TTCACTTCGG CATGAACTGG
TCCGAGTACT CCCGCTTCGT CGGTGACATT TTCGGTGCCC CCCTGGCTGT GGAAGCCCTG
GCCGCCTTCT TCCTGGAATC CACCTTCCTG GGCCTGTGGA TCTTCGGCTG GGAAAAACTC
TCCCCGGCCC TTCATGCTGC CTGTATCTGG CTAGTAGCCT TTGCCTCTAA CCTTTCCGCC
TTTTGGATCC TGGTGGCCAA CTCCTTCATG CAGGAACCGG TGGGTTTTAC CTTACGTAAC
GGCCGCGCCG AGATGACGGA TTTCTTCGCC CTACTAACCA ACCCCCACGT CCTTTACCAG
TTTCCCCATA CCGTCCTGGC CGGCTTTGTG ACGGCGGCCT TTTTCGTCAT GGGGATCAGT
GCCTACCACC TGCTGCGGCA AAGCCAATTA GAGCCCTTCC GCCGTTCCTT CCGGCTGGCG
CTTATCATGG GCCTCATCGG TAGCCTACTG GTGGCGGGTA TCGGACACCT CCAGGGGCAG
CACCTGGTCG CTACCCAGCC CATGAAAATG GCGGCGGCCG AAGCCCTCTG GGACAGTGCC
GACCCGGCGC CCCTGGCCCT GGTAGCCCTG GTCGACCAGG AAAACCAGCA AAACAACCTG
GAGATTAAAA TCCCTGCCCT GACGAGTTTC CTGGCTTACA ATAGCTTCCG GGGCGAGGTG
AAGGGCCTGA AGGAACTCCA GGCCGCAGCC GCGGAGCAAT ACGGCCCGGG CAATTATATA
CCGCCGGTGG CCCCGGTATT CTGGAGCTTT CGCTTAATGA TTGCCGCCGG GCTGTGGTTG
ATTTTGCTGT CCCTGTATAG CCTGTATCTA TGGCGTAAGG GACTACTGGA AAGTAGGCCC
CTGGTCCTCA AAGCCCTGCT CTGGAGTATC CCGATCCCTT ACCTGGCTAA CACTGCCGGC
TGGTTCGTGG CGGAAGTCGG CCGTTACCCC TGGATTGTTT ACGGTTTGCA ACGGCTCGAA
GCAGCCGTCT CACCAGGGGT ATCCGCTACC GCTATCTTGA CGACCCTGGT GGCCTTTACC
CTGCTTTACG GCCTGCTGGC TGTGGTGGAT GTCTACCTCC TGGCCAAATA CGCCCGCCAG
GGTGTAGTAG AGCAGCCCCC TGCCGGTAAG ATGCATCCTT CCGGGGAGGT GTCGTTATGG
ATCTGA
 
Protein sequence
MDALLLARWQ FGITSVYHFL FVPLTLGLSI LVAIMETIYV RTGDETYKNM ARFWGRLFLI 
NFAMGVVTGI VQEFHFGMNW SEYSRFVGDI FGAPLAVEAL AAFFLESTFL GLWIFGWEKL
SPALHAACIW LVAFASNLSA FWILVANSFM QEPVGFTLRN GRAEMTDFFA LLTNPHVLYQ
FPHTVLAGFV TAAFFVMGIS AYHLLRQSQL EPFRRSFRLA LIMGLIGSLL VAGIGHLQGQ
HLVATQPMKM AAAEALWDSA DPAPLALVAL VDQENQQNNL EIKIPALTSF LAYNSFRGEV
KGLKELQAAA AEQYGPGNYI PPVAPVFWSF RLMIAAGLWL ILLSLYSLYL WRKGLLESRP
LVLKALLWSI PIPYLANTAG WFVAEVGRYP WIVYGLQRLE AAVSPGVSAT AILTTLVAFT
LLYGLLAVVD VYLLAKYARQ GVVEQPPAGK MHPSGEVSLW I