Gene Moth_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2059 
Symbol 
ID3831090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2151461 
End bp2152981 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content51% 
IMG OID637829988 
ProductABC transporter related 
Protein accessionYP_430898 
Protein GI83590889 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0776584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGG CAGGCATCCA AGTTGAGTCT TTGAGCCTTT TTTATCCCGG GAACCCCAGA 
CCGGCTTTGC AAGGTGTCAA TTTAACCGTA TATCAGGGGG AGATAGCCTT TTTAGTTGGC
GGCAATTTAA GCGGCAAGAC CTCCCTCCTG CGGTGCCTTG CTGGTTTAAT TCCCGGAGTG
CTGCCCGGTA AATGGCGGGG CCGGATCCTG GTGGCCAACA AAAGCCTGAA CGCGGAGGGC
AACGAGCGTG CGCCTGCAGG CGTGATCCTC CAGAATTCCG ATTTATATCT CTTGCCTCGG
GTATACGACG AATTGGCGCT CCCTTTGGTA AATTCCGGGT TGACATCCCG GGAGGTCGAA
AAGAGGATCG TACTCCTGGC GGAAGAACTA GGAATATGTC ATCTACTGAA CAGGCAGATG
AGCAGCCTGT CCGGAGGCGA GCGTCAAAAG GTGGCTTTTG CCGCCGCCAT TGCGGTTGAT
TATCCAGTTA TACTGTGCGA TGAGCCTTTT GAACAGGCGG ATGCGGAAGC GGCTGAGGCG
ATGCTTTCCC TTTTAAGAAG GAAAGCAGCT TCAGGGGGTA CGGTTCTCAT TGCAACTCGC
TATTTTGAGT ATGCCCGTTA TTTTGCTGAC CGGATCATAC TAATCAGGGA CGGGGCCGTG
ATTGCGCAGG ACAGGTCCGG CAATGCTGAA AAAATCGCTG GAATGGTGCC GGAGTGTCGA
ATCAGCTGGC GGCAGCGATC TCAGGCTCCT ATTTTGACCG TCTCCGGTTC CAGGGAGCTC
TTTTTTGAAG GCGTCACGCA TCTTTTTGAC AGCGGGCATG GTATCAAAAA TGTTTCTTTA
GAAGTCTGTA AGGGCGAAGT AGTGGCAATT ATGGGCCCCA ACGGGTCGGG CAAAACTACT
CTGCTCAAGC ATACGGTCGG CCTTTTAAGA CCCCAGGAGG GGCGGGTTTG GTTGCGCGAG
ACAGAAACCT CGCGGCTACC CGTAGGCCTG CTGGCTCAGA AAATCGGCAT GTTATTTCAA
AATCCCGACG ACCAGATATT TAATGAAAGA GTGGACCGCG AAATAGCCTG TAGCTTAAGG
GCGCGTGGCG CACGGTGGAA TGAAGCTCTC AAGGAAGCAG CCCATTGGCT TGCCAAAATT
GGGCAGGCCC AAATAGCCGC TAGCCACCCG CATTCCTTAC CTTATTCGCT GCGGCAAATA
GTATCCTTAG TGTCGGTTTT GATCAACCGG CCAGAGATAA TAGTGCTTGA TGAACCCTTC
AAATCCCTGG ACTATCGGAA TGTAGAAATA CTCATGTCAA TTATACTTGA GCTACGGCAA
GAAAAGGACA ACCCCATAAT CCTAGTGGCT CATGATCCAA CTGTAACCCT GCTTTACGCC
AATCGAGTAG CTTTCTTTGA TCATGGCGAA ATAGTTCTGC AGGGGGTCCC GCAAGATATT
TTCTTTTCTA CCCAGTTCCG GAATTTAAGC TTGAGCAGAC ACCCGTTTAT AAGGGGGCTC
CTAAATAACA ATAACCACTA A
 
Protein sequence
MKEAGIQVES LSLFYPGNPR PALQGVNLTV YQGEIAFLVG GNLSGKTSLL RCLAGLIPGV 
LPGKWRGRIL VANKSLNAEG NERAPAGVIL QNSDLYLLPR VYDELALPLV NSGLTSREVE
KRIVLLAEEL GICHLLNRQM SSLSGGERQK VAFAAAIAVD YPVILCDEPF EQADAEAAEA
MLSLLRRKAA SGGTVLIATR YFEYARYFAD RIILIRDGAV IAQDRSGNAE KIAGMVPECR
ISWRQRSQAP ILTVSGSREL FFEGVTHLFD SGHGIKNVSL EVCKGEVVAI MGPNGSGKTT
LLKHTVGLLR PQEGRVWLRE TETSRLPVGL LAQKIGMLFQ NPDDQIFNER VDREIACSLR
ARGARWNEAL KEAAHWLAKI GQAQIAASHP HSLPYSLRQI VSLVSVLINR PEIIVLDEPF
KSLDYRNVEI LMSIILELRQ EKDNPIILVA HDPTVTLLYA NRVAFFDHGE IVLQGVPQDI
FFSTQFRNLS LSRHPFIRGL LNNNNH