Gene Moth_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2366 
Symbol 
ID3832546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2488965 
End bp2489924 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content58% 
IMG OID637830285 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_431191 
Protein GI83591182 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000181907 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000001 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCCATGC ATGACCAGTA TCAAACCGGG ATATGGGATG GAGGTACGGC CCGGGCGCAA 
AAGCGGCGCA GTTCCCTGTC CTTGATTACC GGCTTATTAA TCCTGGCCGT TTTAACCTTC
GCTGCCTTTG CCTACGCCGG CTACCGGGTA GCCGACCTGT GGCTGGCCGG GAACCTGGCT
ACGGGGCAGG GCCAGCAGGA CGGGAGCGGC ATCGCCGAAG CGGCTCCAGG CAACCCGGAG
ACCATTCTCC TCATCGGCGT CGACCAGCGG GTGCCTGGCG AACCCTCCCG GTCGGATACC
ATTATGCTGG CCGTCCTGGA CCCGCAAAAA CCCGGGGTCG ACCTCCTTTC CATTCCCCGG
GACACGCGAG TCAAGATCCC CGGCCACGGT TACGACAAGA TTAACGCCGC CCACGCCTAC
GGCGGCCCCC AGCTATTGAT GGACACCATC AACGATTTTC TGGGCAGCCA CGTGGACAAG
TACGTGGAGG TCAATTTCCA GGATTTCCAG AAGATCATCG ACATTCTGGG CGGGGTCGAT
ATCGACGTCG ACAAGCGCAT GTACTACCCG GAAGAGAACA TAAACTTAAA GCCGGGGTTC
CAGCACCTGA ACGGCTATGA CGCCCTGGCC TACGTCCGCT ACCGCTACGA CCCCGAGGGG
GATATCACCA GGGTCGGGCG CCAGCAGAAG TTTATGAAAG CCCTCATCGA CCAGACCTTG
AAACTGAGCA CCATACCCAA AATACCCAAA CTGGTCAGCG AAATCAGCAA GGACGTCAAG
ACAAACCTCA GCGTCAAAGA AATGCTCTCC CTGGCCCTGA GCATGAAGGA TTTAAACGGT
AGCGCCGTCA ATACCTACAC GATCCCCGGC GAAGGCAAAT ACGTGGGCGG CGTGAGTTAC
TATATCGTCG ATCAGCAGAA GCTGACTGAT GTTCTAGCGT CCATTCCCAA CCTGCGCTAG
 
Protein sequence
MAMHDQYQTG IWDGGTARAQ KRRSSLSLIT GLLILAVLTF AAFAYAGYRV ADLWLAGNLA 
TGQGQQDGSG IAEAAPGNPE TILLIGVDQR VPGEPSRSDT IMLAVLDPQK PGVDLLSIPR
DTRVKIPGHG YDKINAAHAY GGPQLLMDTI NDFLGSHVDK YVEVNFQDFQ KIIDILGGVD
IDVDKRMYYP EENINLKPGF QHLNGYDALA YVRYRYDPEG DITRVGRQQK FMKALIDQTL
KLSTIPKIPK LVSEISKDVK TNLSVKEMLS LALSMKDLNG SAVNTYTIPG EGKYVGGVSY
YIVDQQKLTD VLASIPNLR