Gene Cthe_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2341 
Symbol 
ID4808975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2791172 
End bp2792338 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content38% 
IMG OID640107748 
Productglycosyl transferase family protein 
Protein accessionYP_001038736 
Protein GI125974826 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATAT TCATTAAAGT TTTATTTTAT GTAAGCGGAT TTATCATATT TTGGGCCATG 
ATAGGATATC CCGTATCACT TAAATTAATT GGAAAATGCT ATAAATCGCG TAAGTTGGAA
AAAGACTATA ATCACCAGCC TACTGTAACG GTTATGGTTG TTGCACATAA CGAGGAAAAA
GTTATACTGG AAAAACTGAA TAATATTCTG GAACTGGACT ATCCTCAAGA CAAAATTGAG
ATTCTGGTTG CTTCCGACAA CAGTACAGAT CAGACCAACA ATATTGTAAA AGAATTTATT
AAAAAGCATC CCGAGCGAAA AATCAGGCTC TATGAGGTTA AAGCCCGGAA GGGAAAAACA
AATGCGCAAA ATGAAGCTCA AAAGACTGTA ACAACGGAAT ACCTGGTTAT GACGGATGCC
AACTCAATGC TTGACAGAAA TGCGGTAAAA GAATTAATGG CGGCGTTTAC ATCGGATGAT
ATTGCGTATG TTTGCGGAAG GCTATCAATT GTGAATCGGG AAGCCAGCGA TGTCAGCAGT
GCGGAGGCCG GTTACTGGGA CAGTGACCTT GCAACCCGTG AAATTGAAGG AAGAATTCAG
ACAATAACGG CCGGAAACGG TGCTCTGTAT GCTTGCAGAA ACAGCGAATA TCATGATTTT
GATCATATAC AATGCCATGA TGCTGCAATG CCCCTATATT ATGCGTTAAA AGGAAAAAGG
GCCATATGCA ACCACGATGC TGTGGCATAT GAAAAAGCGG GAGAAGTAAT AGAGGATGAA
TTTAAAAGAA AAGTACGTAT GAATCGTACG ATATTAATGG CTATTTTGCC TGATATAAGG
ATACTTAATG TTTTTAAATA CAAGTGGTTC TCATACTTTT ATTTCGGACA CAGGACATGC
AGGTATCTGT TATGGATAGC ACATTTAATT GTGCTGCTTT CCAATGCTTT ATTGCTGGCA
AATTCAAAAT TTTATTTATT AACTTTTACC GGACAGTTGC TTTTTTATTT GATTAGCCTG
ATAGGAACTG TCACCAGGAC AAAAAATAAA TATGTATCTC TTATTTATTA TTATACAGTG
ACGATAATTG CCCAGTGGTT TGGAGTTTAT AATATTGTAA CCGGGAGGGC AAAACCCTTC
TGGGAGAAAG CGGAGAGCAC AAGATAG
 
Protein sequence
MGIFIKVLFY VSGFIIFWAM IGYPVSLKLI GKCYKSRKLE KDYNHQPTVT VMVVAHNEEK 
VILEKLNNIL ELDYPQDKIE ILVASDNSTD QTNNIVKEFI KKHPERKIRL YEVKARKGKT
NAQNEAQKTV TTEYLVMTDA NSMLDRNAVK ELMAAFTSDD IAYVCGRLSI VNREASDVSS
AEAGYWDSDL ATREIEGRIQ TITAGNGALY ACRNSEYHDF DHIQCHDAAM PLYYALKGKR
AICNHDAVAY EKAGEVIEDE FKRKVRMNRT ILMAILPDIR ILNVFKYKWF SYFYFGHRTC
RYLLWIAHLI VLLSNALLLA NSKFYLLTFT GQLLFYLISL IGTVTRTKNK YVSLIYYYTV
TIIAQWFGVY NIVTGRAKPF WEKAESTR