Gene Cthe_2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2336 
Symbol 
ID4809264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2785043 
End bp2786170 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content37% 
IMG OID640107743 
Productglycosyl transferase, group 1 
Protein accessionYP_001038731 
Protein GI125974821 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000420527 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTA TGTATGTGAT TGATGCCGGT CCGGTTAACG GTGGTGCACC GATTTCCACT 
TCAATATTGG CCAATCAGTT TGCGGGTGAT GATAATGAGG TTATTATGGT CATGCCTAAA
AATAAAGATA CTGAGATTTT GGATAAAAGA ATAAAGAGGA TTGAACTTGC ACGGTTTTCT
GACTATTTTC CTCTTGATGT TTTTCATCCA ATAAAAGCTT TGCTTCTTGC AAAAGATTTG
AAAGCAGTCA TTGAAAAGGA AAAACCCGAT GTTATACATG CAAATATGCC TCGCGGAGCA
AGAGCAATTG GGTTATTGAA ATTGCTTGGA ATGATATCTG ATAAAATAAA GCTTGTTTAC
ACAGACAGGG AGCATATTTC ACAATTCAGT CCCCTGGTAC GGATGCTGTA TATTTTCTTT
ATTGCAAGAA GATATGATGC TATAATATGT ATCACGGAGA AGAGCATGGA ATACTGGAGA
AAAAAAGCGA GGAAAGCCAA GATAAGTGTA GTACCCAATA CAGCGGGAAA ATATTATGAG
ACTTATGAAC CTGATATGCA TTCTATAGTC CGAAAAAAGC TAATGATTCC TGACAAAAAA
TTGACGTTAA TGTTTGCCGG AAGAATGATT GAAGCAAAGA ACTGGCCATT GGCTAAAGAA
ATTGTGAGCA AACTGTCTAA GGAGGATGTT CACATTATCA TTGCAATTTC GTACTTTAAT
CAGGAGCAAG AGTGTAAGAC AAAAGACTTT CTGGAAAGTA TCCGAAGGCT TGGTGTGAGT
TACACCTTTA AAGAGAATAT TCCGCAAGAA GAAATGAATG AACTGTATTA TGCGGCCGAT
ATTTTTGTTT TAACTTCAAA CAGGGAATCT TTTGGCAGAA CAGCAATAGA AGCAATGAGC
AGAAAATGTG CTGTTTTGGG GCGTAATGTT GGAGGACTTC CCGAGGTAAT ACAAAAAGAG
GCAAACATAT TTGATTGTGA TGCCGACAAA TTTGTAAACC GTATATTGGA GTACAAAAAA
AACACGGAGG AATTGGAGAA AGACAAAGAT TGGTTTTATG AGCGTTTTGC AAATAATTAT
ACGGCTGAAA TATATAAAAG AAAACACGAA GATGTTTACC GGTTTTAA
 
Protein sequence
MKIMYVIDAG PVNGGAPIST SILANQFAGD DNEVIMVMPK NKDTEILDKR IKRIELARFS 
DYFPLDVFHP IKALLLAKDL KAVIEKEKPD VIHANMPRGA RAIGLLKLLG MISDKIKLVY
TDREHISQFS PLVRMLYIFF IARRYDAIIC ITEKSMEYWR KKARKAKISV VPNTAGKYYE
TYEPDMHSIV RKKLMIPDKK LTLMFAGRMI EAKNWPLAKE IVSKLSKEDV HIIIAISYFN
QEQECKTKDF LESIRRLGVS YTFKENIPQE EMNELYYAAD IFVLTSNRES FGRTAIEAMS
RKCAVLGRNV GGLPEVIQKE ANIFDCDADK FVNRILEYKK NTEELEKDKD WFYERFANNY
TAEIYKRKHE DVYRF