Gene Cthe_3177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3177 
Symbol 
ID4809628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3753711 
End bp3754976 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content41% 
IMG OID640108611 
Productmonogalactosyldiacylglycerol synthase 
Protein accessionYP_001039565 
Protein GI125975655 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGA ATATTCTTAT ATTATCTTCG AACAATACGG GGCATGGCCA CAAGAGCATA 
ACCGAATCCC TGCTGGAACA GTTTTCCCAT TATCCCGACG TCAATGTCCA TGTGATTGAC
GGTTTTACCC TGGCAGGAAA TTTTGGCCTT AGAATCGGGA AACTTTACGG TTCCGTCACA
AGAAATGCAA AAGAACTGTG GAAACTGGTG TGGGAATTAT CTTTAAAAAA ACCCAGTCTG
TTAAACGACT TTACTGAAGT TGCTATTAAA GATAACTTTT TGAAACTCAT ATGCAATATC
AAACCGGATC TTATACTGTC GGTGCATCCA AACTTCAACG GTTCGGTACT GAACATTCTC
GAAGACTATA ACATTAAAGT TCCCTTTGTA ACGCTGCTGG CAGACATTGT AAGTATTACT
CCCCTCTGGG CGGATCCAAG GGCTGACTAT ATCATATGCC CTTCAAAAGA ATCAAAGTTT
AAATGCCTTG AGTTCGGGGT TTCGGAATCA AAGCTTATTG AAACGGGATT TCCGGTAAGG
CAGAAGTTTT TGAAACATCT TGAAAAAAAC GGCGAAAACA ATACGCAGAA CATTAAAAAA
TACACCGGCG ACAGGCCTTT AGAATGCCTT ATCATGAGCG GCGGCGAAGG TTCCGGAAAT
ATGAGCAGAA TCGCATCGAT ACTGCTTAAG AATTTCAACT GCCGGGTTAA AATAGTCACA
GGCCGTAACA GACTCTTAAA AAGAAGGCTC GAGCGCACCA TCGGGGAACG ATTCGGCGAC
AGGGTTGAAA TATACGGTTT TACCGAAAAC ATTCAAGATC TTATGCTATC CTCCGATATT
GCTTTCACCA GGGGCAGCCC GAATGTCATG ATGGAAGCAG TTGCCTGCAA TGTTCCCCTT
ATTATCACAG GCAATCTCCC CGGTCAGGAA GAAGGCAATC CCGCCTACAT GCAAAAGTAC
AATCTTGGTG TTGTATGCAA AGACGTCAGA AAATTAAGGC ATACAGTCAA CGAACTTCTT
GAAAACAACG GTGAAAAACT GAACAGAATA AAACAGTCTC AAAAAGAGTT TTTAAATCCA
AATGTTGCAA AAGAAATCGT AAGCTTCCTT TTAAGCATAG ATAAACAAGA GGAACCCTGC
TTCCCTGAAG ATTTGTTTTC CGGCGATTTC CCTAGATTTT GGGAATTACA CAAAAGAATA
TCGATGTCGC ACAACAAAAA AATAATACTC AAAAGGCATA AAAAAATCGG AACGAAGCAA
AAATAA
 
Protein sequence
MIKNILILSS NNTGHGHKSI TESLLEQFSH YPDVNVHVID GFTLAGNFGL RIGKLYGSVT 
RNAKELWKLV WELSLKKPSL LNDFTEVAIK DNFLKLICNI KPDLILSVHP NFNGSVLNIL
EDYNIKVPFV TLLADIVSIT PLWADPRADY IICPSKESKF KCLEFGVSES KLIETGFPVR
QKFLKHLEKN GENNTQNIKK YTGDRPLECL IMSGGEGSGN MSRIASILLK NFNCRVKIVT
GRNRLLKRRL ERTIGERFGD RVEIYGFTEN IQDLMLSSDI AFTRGSPNVM MEAVACNVPL
IITGNLPGQE EGNPAYMQKY NLGVVCKDVR KLRHTVNELL ENNGEKLNRI KQSQKEFLNP
NVAKEIVSFL LSIDKQEEPC FPEDLFSGDF PRFWELHKRI SMSHNKKIIL KRHKKIGTKQ
K