Gene Moth_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2101 
Symbol 
ID3832467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2193384 
End bp2194565 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID637830026 
ProductCdaR family transcriptional regulator 
Protein accessionYP_430936 
Protein GI83590927 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAACC AGGAATTAAT CCAGCCCCTG CTGGAACGGG CCGCAGCCCT CTGGCAAACA 
GCCATTGACG TCGTTGACCC TGGGGGTCAG GTAGTAGCCA GCTCGGAACC TTCACGGTTG
CATTTTTACC ATCCGGAGGT GCTGGGACTT TCACTTACGG AAGATGGAGT TCGCGTTCAA
GGTGAAAGTT ACTATCTTCC CCTGGTAATA AATGACCGCG TGGCCGGCTG GCTGGTATGG
CAGGGGGGTG TTAAGGGCGA AGTTGCCACC CTGTTGCAGG CCGCTCTGGA GGAAATCTTG
GCGGCGGCCA GGCGGCGGGA TCAGCAGTAT TTCACCGCCC GGGAGGAAGA AGCCCTGGTT
GCCGATCTTC TGGCCCGGGA GGCGGCCTCC CGGGTTGAGG AATTAAAGGT AAGGGCGCTG
AATTCGGGTT ATGACTTGAA CCTGCCCCGG GCGGTTATAG TATTACAATT GGAACCCAGG
GAAAACCGCT ACTTCAACAT TAATTTGCAC CTGGGGTATG ATGTTACAGA GGAGCAGTTA
AAGGAGGAAA TCCTGGAGCA AATAAAGGGC GATCTTTACC TGACCGCCCA GGATCTAGTG
GCTTATTATG GCAGAGAGAT GCTGGTCATT TTTAAGGCTT TTTTAGAAGT AGAGAATATG
GGCCGCCTCT ACCAGGCCCT GGAGGTTATC TGTCGGCACC TGTATGAGTT GATCAAAGAT
AACCGTCTCT TTGCTGTCCG GGTGGCCTAC GGGAGCATTG TCACGGAAAT CGCTGGTTTA
AGGCAGTCCT ATAAGGAGGC AGCCGAATTG ATTGCCCTGG GCCATTCCTG CGGTCATAAT
AGCGGCTATA TTGATTTCGA AGACATTCTT TTTGAGGCCC TGGTGTGTTC CCTGCCGGGG
CGGTTGAGGG GTAGATATCT GGAACCCCTT TATCAGAAAA TTGTAGCGGC TGGCGAAGAA
GGTATCCAGT TGCTGGAGAC GGTGGAAGCC TGCATCGACA ACAATATGAA TATCAAGCAG
ACGGCTGAGC AACTATACCT GCACCGCAAC ACTGTAACCA ATCGCCTGGA GAGAATCAAG
CTCCTGACGG GCCTGGACCC GGGAACGGGT TTTCGCGCCC TTTTCTGGCT GAAAATGCTG
GCCGTCTACA GGAGATTGGT GCAGACACGG GATGAGGCGT AA
 
Protein sequence
MLNQELIQPL LERAAALWQT AIDVVDPGGQ VVASSEPSRL HFYHPEVLGL SLTEDGVRVQ 
GESYYLPLVI NDRVAGWLVW QGGVKGEVAT LLQAALEEIL AAARRRDQQY FTAREEEALV
ADLLAREAAS RVEELKVRAL NSGYDLNLPR AVIVLQLEPR ENRYFNINLH LGYDVTEEQL
KEEILEQIKG DLYLTAQDLV AYYGREMLVI FKAFLEVENM GRLYQALEVI CRHLYELIKD
NRLFAVRVAY GSIVTEIAGL RQSYKEAAEL IALGHSCGHN SGYIDFEDIL FEALVCSLPG
RLRGRYLEPL YQKIVAAGEE GIQLLETVEA CIDNNMNIKQ TAEQLYLHRN TVTNRLERIK
LLTGLDPGTG FRALFWLKML AVYRRLVQTR DEA