Gene Moth_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2331 
Symbol 
ID3831083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2450965 
End bp2452164 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content55% 
IMG OID637830255 
Producthypothetical protein 
Protein accessionYP_431161 
Protein GI83591152 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCAGC TCCTCAACAT CGCCCACTAC GAAATGCTCC ATATTTTTAA AGAGAAAATC 
CTTTTTTTAA TGGTTTTCCT GGTCCCCCTC GGCTACGCCG CCCTTTTCGG CGCCGCCTAT
GTCACCGCCG TCCTTAACAA CGTGCCTATA GCCATCGTCG ACCTGGATGA CTCAAAACTC
AGCCGGGAAA TCGCATCCGC CTTTGCCAAC AGCCCCCACT TCAAGGTGGT GGACGACATA
AAGACCTATC CTGAACTTCA GGAGGGGATG AAAAACGGCA GGGTGCGGGC CGGCGTGGTC
ATCCCGGAAC ACTTTGAGCA GAAGTTGGCC CGGCACGAAT TGACCCGGGT ACTGACCGTT
TACGACGGGT CCAATTTGAT CTGGGGTTAC AATACCCGCA AATACATCCG CGAGGTATTT
AACGAGTTCA GCGCCAGCAG CACGGCCTCC TACCTGGCAG GTATGGGCTA TACTAAAAAC
GAGATCCGTT CCATTATGGA CACGGTTTCC CTGAACACCG AGGTCTGGTA CAACCCCACT
TTCAGCTACA CCAATTTTCT CTTCCTGGGA CTGATCATGA TGATCTTGCA CCAGATCGGC
CTCCTCAGCG TCAGCCTGAC CGTAACCCGG GAGAAAGAGC GCAAAACCTG GCTGCAGTTT
TTGAGCGCGC CCGTACCGGC ATGGAAAATC TTTACCGGTA AGGCTATCCC GTATTTCACC
GCCAACTTCT TTAATTACGC CCTCTTGCTC TGGTTCGCCT CCCGCTTCGT CCACGTGAAG
ATCGGCGGCT CCCTGGGTCT AATCCTTGTG CTCGGCCTTC TCTACGATCT AGTCATCACC
GGTGCCGGTT TCTTAATTTC CCTCCACGCA TCCAACTCCC TGCAGGTTAC CAGGTACGTG
ATGCTTTTGT CCGTACCCTT CTTTATGATT TCCGGTTATA CCTGGCCCGG AACCCATATA
CCGGTTTTTA TCAATTACCT GGCGCGGTTG CTGCCCTCCA CCTGGATGGT TCTGGGCTTC
CGGCAGGTCG CGCTAAAGGA GCTTGATATG AGCTATATGC TGCCCTACAT CCGGGCCCTG
GGCCTGATGG CCGTCCTGGC GCTATTGCCG GCCGTAACCT TTGCCAAGCG GCTCAGGCCG
CGCCCGCAAG GCGGCCCGGT GATAAACAAC GGGCCCTCGT ATCCGGCCCG CTGGAAATAA
 
Protein sequence
MRQLLNIAHY EMLHIFKEKI LFLMVFLVPL GYAALFGAAY VTAVLNNVPI AIVDLDDSKL 
SREIASAFAN SPHFKVVDDI KTYPELQEGM KNGRVRAGVV IPEHFEQKLA RHELTRVLTV
YDGSNLIWGY NTRKYIREVF NEFSASSTAS YLAGMGYTKN EIRSIMDTVS LNTEVWYNPT
FSYTNFLFLG LIMMILHQIG LLSVSLTVTR EKERKTWLQF LSAPVPAWKI FTGKAIPYFT
ANFFNYALLL WFASRFVHVK IGGSLGLILV LGLLYDLVIT GAGFLISLHA SNSLQVTRYV
MLLSVPFFMI SGYTWPGTHI PVFINYLARL LPSTWMVLGF RQVALKELDM SYMLPYIRAL
GLMAVLALLP AVTFAKRLRP RPQGGPVINN GPSYPARWK