Gene Moth_0629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0629 
Symbol 
ID3832527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp653126 
End bp654145 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content39% 
IMG OID637828571 
ProductLacI family transcription regulator 
Protein accessionYP_429501 
Protein GI83589492 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.409769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATAACC CTTCGATTGT AGACGTTGCA AAACGTGCCG GAGTATCAAT TACTACCGTT 
TCACGAATAA TTAATAATAC ATCGTACCCA GTGGCTGCCA AGACACGGGA AAAGGTATTG
AAAGCAATTG AGGAATTACA CTATAGTCCA AATAAAATTG CGCAGAGGCT AAAGAAAAGT
ACTAGCGATA TAGTGGGGTT AATTGTCCGG GATGTTTGCG ATCCGTATTT TGCAGAGATA
GCTAGGGGAG TAACGGATAG AGCAAGCCAA GTTGGCTATC TGTGTTTTTT ATGTAATACC
GGACGTAATC TTGAAAACGA ATTTAGATAC CACGATCTCC TCTGGCAACA TAGAGTTAAA
GGCATAATAT TAGCTGGAGG CGGCCTTGAC CAACCGGATT ATAAGCAAAT ACTTCAGAAG
CAGCTTGAGA GACATGAGAA ATATGGGTTA AAAATTATTG CCTTAGCGCC TCAGGGATTG
GAAATGCCGT ATGTTATGAT AAATGATTAT GAGGCCGGAA AGAAAATCAC AGAGTACCTG
ATTGAGCGAG GACATAGAAA AATCGGTTTT ATTGGTGGTC CAGAAAAAGT ATTCACTTCC
AAAGAGAGAC TAAAAAGTTA TAAAGATGCG ATGGATGCTA TCGGAATGAA GTGCGAGGAA
TATATTATTC ACTGCGATTT TTCACGAAAG GGGGGTTATG AAGCCTGTAA TCTACTTCTT
TCCAAGGTAC AAGATTTAAC AGCGATATGC TGCGTAAACG ATAACATAGC TATTGGGGCA
ATTAGTGCTA TTAAAGAACA TGGATATAAA ATTCCGGAAG ATATTTCAAT CATCAGTATA
GGTAACATTA CTGAAGCAAA ATACACAGAT CCTCCTCTTA CAACGCTCTC AATTCCCCAT
TATGAAATGG GTCAGCTCGC TATAGATGTC ATAGCTGAAG GGAAAACGGA TGTTAGAATC
ATTTTAAACA CCCGTATTAT TGAGAGAAAA TCAGTTGATT TTTCTCAGAA AACAAGGTGA
 
Protein sequence
MDNPSIVDVA KRAGVSITTV SRIINNTSYP VAAKTREKVL KAIEELHYSP NKIAQRLKKS 
TSDIVGLIVR DVCDPYFAEI ARGVTDRASQ VGYLCFLCNT GRNLENEFRY HDLLWQHRVK
GIILAGGGLD QPDYKQILQK QLERHEKYGL KIIALAPQGL EMPYVMINDY EAGKKITEYL
IERGHRKIGF IGGPEKVFTS KERLKSYKDA MDAIGMKCEE YIIHCDFSRK GGYEACNLLL
SKVQDLTAIC CVNDNIAIGA ISAIKEHGYK IPEDISIISI GNITEAKYTD PPLTTLSIPH
YEMGQLAIDV IAEGKTDVRI ILNTRIIERK SVDFSQKTR