Gene Moth_2330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2330 
Symbol 
ID3831082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2449812 
End bp2450951 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID637830254 
Producthypothetical protein 
Protein accessionYP_431160 
Protein GI83591151 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAC AATGCCTGGC AATTATGCGG CGCGAAGTGT TTTACCTCTG GCGCGATAAG 
GGTTTGCGCC ATATCTTACT TTTCGGCTCT ATATTGGGGC TGCTGCTGTT TTACGCCATC
TACAGCGCCC AGGTATTAAA GGATATCCCT ACTGCAGTCG TCGACCTGGA CAACTCCGGC
GCCAGCCGCG AACTGGTGGA CAAGATAGGC AAGGCGGAGT ATTTAAAGCT GGTCGCCTCG
GTCGCGAGTT ACGACGACCT GCAGGAGTTA ATCAAGCAGG GAAAAGCCGT CGTGGGCGTC
GTCATCCCCG AGAACTTCGC CAGGGACGTG GCCCTCGGCC GGCAGACACG AGTATTGGCG
GTTATTGACG GCAGCAACAT GATCTATGCC ACCAACGCCT CCGCCGCCTT GCTTACCGTC
ACCCGCACCA TCAGCGCCCA GGCCGGCGTC AGCGCCCTGG TGGCCCGGGG AGTTCAATTG
CAACAAGCTA AAGAAGCCTA TCAGGCCATC GATTTCAGCG AGGAACCGTG GTTCAACCCG
GCCCTCAACT ACGCCTACTT CCTCGTCCTG GCCCTGGCTT TAAACATCTG GCAGCAGTGC
TGCACCCTGG CAGCGTGCCT GAACGTCATC GGCGAACGGG GTATGAAGAG CTGGTTGCAA
ATCAAGGCCA GCGGCATTTC CAAATTTCGA TTTTTTGCCA GCAAATCGAT AGCCCAGGTT
TTTATCTTCA TGGCCATCGT TTTGCCCTTG TATATCCTGG CCTTCGGCGT CTTTAAGCTG
CCCCTTAACT GTAGCTGGCC GTCCTTTCTC CTCTTCACCC TGGCTTTCGC CATAGCCATT
CACAGCATCG GCACCCTGGC GTCCAGTTTC GCCCGCAACG CCGTGGACGC CGCCAGGTTC
GGCATGATTA TCGCCCTGCC CTCCTTTGTA CTGTCAGGCT ACACCTGGCC CCTGGAGGCC
ATGCCCTATT ACCTTCAGCG GATCGCCAGG ATACTGCCCC AGACGTGGTT CTTCCAGGGG
TTAAACTACT TCGCCTTCAA AAACGCCGGT TGGAATTTGA TGTCCCATTA TATACTAGCC
ATGCTGGCCG TAGCCGCCGT ATGCTATGGA GCAGCGGCTA TCTTTATCGC GCGGAGTTAG
 
Protein sequence
MLKQCLAIMR REVFYLWRDK GLRHILLFGS ILGLLLFYAI YSAQVLKDIP TAVVDLDNSG 
ASRELVDKIG KAEYLKLVAS VASYDDLQEL IKQGKAVVGV VIPENFARDV ALGRQTRVLA
VIDGSNMIYA TNASAALLTV TRTISAQAGV SALVARGVQL QQAKEAYQAI DFSEEPWFNP
ALNYAYFLVL ALALNIWQQC CTLAACLNVI GERGMKSWLQ IKASGISKFR FFASKSIAQV
FIFMAIVLPL YILAFGVFKL PLNCSWPSFL LFTLAFAIAI HSIGTLASSF ARNAVDAARF
GMIIALPSFV LSGYTWPLEA MPYYLQRIAR ILPQTWFFQG LNYFAFKNAG WNLMSHYILA
MLAVAAVCYG AAAIFIARS