Gene Moth_0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0874 
Symbol 
ID3831512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp904089 
End bp905027 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content61% 
IMG OID637828804 
ProductABC transporter related 
Protein accessionYP_429734 
Protein GI83589725 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01288] ATP-binding ABC transporter family nodulation protein NodI 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.468306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGGAGGA TTAGGCCATT GACCGAGGTT GTTCGGGCTC GGGGCCTGGT AAAAAGGTAT 
AACGGCTTCC CGGCTGTCAA GGGAATAGAC TTCGACGTGC GGCCAGGTGA GTGTTTTGGC
TTCCTGGGTC CTAACGGCGC CGGGAAGACT ACCACCATTA AAATGATCCA CTGTTTCACC
CCGGTGAGCG ACGGTACCCT GGAGGTGCTG GGGTATGACG TCCGCAGGCA ACCCCGCCAG
ATCAAGGCCC GGCTGGGGGT GGTGCCCCAG GAGGACAACC TGGACCCGGA GCTGACGGTA
GTAGAAAACC TGCTCCTCTA CGCCAGCTAT TTCGACCTTC CCCGTGGAGT CGCCCGGGAG
CGGGTAGGGG AACTCCTGGT CTTCGCCAAC CTGGAGGATA AAGCAGGGGT AGAAGTCGAA
CACCTCTCCG GGGGGATGAA GAGGCGCCTG GCCATCGCCA GGGGGCTCAT CAACAACCCC
GGGCTCCTCA TCCTGGATGA GCCAACCACC GGCCTGGACC CGGAGGCCCG GCATCTGGTG
TGGGAGAAGA TGCGTCAGTT GAAGGCCGGT GGGGTGACCC TGATCCTGAC CACCCATTAT
CTGGAAGAAG CGGCCCAGCT CTGTGACCGC CTAGTGATCA TGGATCACGG GATAATTCTG
GAAGAAGGCT CGCCGCAGGA GCTGGTAGAA CGCCACGTCG GTCGGGAGGT CCTGGAACTT
AGCCCCGTCG ATGGCCGCGG CCGGGAGATC CTGAACCTGG TAGATGGCAT GATCCTGGCT
TCCCAGTTTA TAGGCCGGAC CCTTTACCTG TACACCACCA GGGGGCGGGA GGTCTGGCGC
CGGATCCAGG ATGGTAACGG TCACTTCAGC CACCAGGTGT TGCGGCCGGC CACCCTGGAA
GACGTCTTCC TGAAACTTAC CGGCAGGAAT CTTTCCTAG
 
Protein sequence
MWRIRPLTEV VRARGLVKRY NGFPAVKGID FDVRPGECFG FLGPNGAGKT TTIKMIHCFT 
PVSDGTLEVL GYDVRRQPRQ IKARLGVVPQ EDNLDPELTV VENLLLYASY FDLPRGVARE
RVGELLVFAN LEDKAGVEVE HLSGGMKRRL AIARGLINNP GLLILDEPTT GLDPEARHLV
WEKMRQLKAG GVTLILTTHY LEEAAQLCDR LVIMDHGIIL EEGSPQELVE RHVGREVLEL
SPVDGRGREI LNLVDGMILA SQFIGRTLYL YTTRGREVWR RIQDGNGHFS HQVLRPATLE
DVFLKLTGRN LS