Gene Moth_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2024 
Symbol 
ID3831399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2112161 
End bp2113207 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content55% 
IMG OID637829953 
ProductLacI family transcription regulator 
Protein accessionYP_430863 
Protein GI83590854 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAAATA TCTCTGATGT TGCCAGAAGA GCCGGAGTAT CACGAACTAC GGTATCCAGG 
GTCCTCAACG GGAAAGACGA CGTCAACGAA GAAACCCGGC GCCGGGTACT GGAGGCCATT
AAGGAGCTAA ACTACCGCCC CAGTGCCCTG GCCCGCAGCC TGGTGAAACA AAAGACCGAT
ACCATTGGCG TTATCCTGTC GGATATCACC GATCCCTTCT TTTCCCTTAT TATCCAGGGG
GTGGAAGATG TAGCTCATAA ATTCGGTTAC GGCATAGTTT ACGCCAGTAT GCGCTGGGAC
CCGCAAATCA AGCATAACTA TGTAAGTTTC TTGCGCAACG GGCGGGTGGA CGGTCTCCTT
ATGATGGGCC ATACAGTGGG CAATGAAGAC TATGTACGGG AGATGGTAGA GGATAAGTTT
CCCCTGGTCC TGGTTGAATA TTGGATTGAG AATCTCAAGG CCAACTTTAT ATCCATTGAT
AACAAAGGGG GGGGTTACCT GGCCACCAGG CACCTGCTGG GACTTGGGCA CCGCCGCATT
GCCCATGTAG CAGGGCATAA AAACGCCCGG GTATCGCAAG AACGCCTGGC CGGTTACCGC
CAGGCCCTGG AGGAAGCCGG CAGGCCCTAC GACGAAAGCC TGGTAGTTTA CAGCGATTTT
ACCACCGAAG GAGCCATACC GGTTGCGAAA AAACTATTAT CCCTGCCTGA GCGGCCGACA
GCCATCTTTG CGGCCAACGA CCTGATGGCC TACGGCGTCA TCCATGCGGC CCGCGAACTG
GGATTAAGGG TTCCAAAGGA TCTGGCGGTG GTCGGCTACG ACGACATCGA GCTGGCTTCC
CTAGTAACAC CACCCTTAAC AACCATCCAT CAGCCCCGCT ATGAAATCGG CTCCATGGCC
GCCTGGTCCC TGATCCAGCA GATCGAGAAT AAAGAAATGC AGCCGACGGT AACAGAGTTT
AAGACGAGCC TGGTAATTCG GGAATCCTGC GGCGCTGTTT TCCGGCGCGG CGACCAGGGG
TATGTAGAAG GAGGCAAGCC GGCATGA
 
Protein sequence
MPNISDVARR AGVSRTTVSR VLNGKDDVNE ETRRRVLEAI KELNYRPSAL ARSLVKQKTD 
TIGVILSDIT DPFFSLIIQG VEDVAHKFGY GIVYASMRWD PQIKHNYVSF LRNGRVDGLL
MMGHTVGNED YVREMVEDKF PLVLVEYWIE NLKANFISID NKGGGYLATR HLLGLGHRRI
AHVAGHKNAR VSQERLAGYR QALEEAGRPY DESLVVYSDF TTEGAIPVAK KLLSLPERPT
AIFAANDLMA YGVIHAAREL GLRVPKDLAV VGYDDIELAS LVTPPLTTIH QPRYEIGSMA
AWSLIQQIEN KEMQPTVTEF KTSLVIRESC GAVFRRGDQG YVEGGKPA