Gene Cthe_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1303 
Symbol 
ID4809555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1581724 
End bp1582950 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content44% 
IMG OID640106726 
Productglycosyl transferase, group 1 
Protein accessionYP_001037728 
Protein GI125973818 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTT TGATGCTTTC ATGGGAATAT CCTCCCAGAA TTGTCGGAGG GATATCGAGA 
GTCGTTCACG GCCTGGCGCA GAAGCTTGGA GCGCGGGGCT GTGACGTTCA TGTTATAACA
TGTTGGGAAA TGGGAACTCG GGAGTTTGAA AGGGATAAAT ATGTAAAGGT ACACAGACTT
CATTCCTATG ACGTAACTCC AAATAATTTT GTTGACTGGG TTCTTCATTT GAATTTTGCC
ATTGTTGAGC ATGCCACCCG GCTTATAAAT GAAACCGGAA AGTTTGACAT TATACATGCC
CATGACTGGC TGGTTGCTTT TGCGGCGAGA GTTTTGAAGC ATGCCTATTC AACACCTCTT
GTTGCAACTA TACATGCCAC GGAACATGGC AGGAACTGGG GTATACATAA TGACACCCAG
CGCTATATAA ACAATGTGGA ATGGTGGCTT GCGTTCGAGG CCTGGAGGCT GATTGTCAAC
AGCGAATATA TGAAGAATGA AGTTATGTCC ATATTCAAGA TTCCCAATGA CAAAATAGAC
GTGATTCCCA ATGGGGTTGA TTTGGATAAA TTCAAAGGCT ACGAGAAGGA TATGGAATTT
AGAAGACGGT TTGCGCAGGA CAACGAGAAA ATAGTGTTCT TTGTTGGAAG ACTGGTAAAC
GAGAAAGGTG TACATGTACT TATAGATGCG CTCCCGAAGG TGTGCCATTA TTACAATGAT
GTCAAGTTTG TGATTGCAGG GAAAGGTCCG CAGTTTGACC ATCTGAAGTG GAAGGCCGAG
AGCATGGGAA TGGCGCACAA GGTCTACTTC ACCGGATACA TAAGTGACGA GGAACTTTTA
AAGCTTTATA AATGTGTTGA TGTTGCAGTT TTTCCAAGTC TTTACGAACC TTTTGGAATT
GTTGCTTTGG AAGGGATGGT TGCAAATGTT CCGGTTGTCG TTTCCGACAC CGGAGGCCTT
GGAGAGATTG TGGAACACGG CGTCGACGGC ATGAAGTCTT ACACGGGAAA TCCCAATTCC
CTTGCAGACA GTATATTGGA AATACTTCAC AATCCCGATA AAGCGGAGAG AATGAAGAAA
AAAGCGTTGG AGAAAGTTCG TTCAATTTAT AATTGGGATG TGGTTGCGGA AAAAACGCTA
AATGTGTATA AAACCATTTT GGAAGAAAAC AAGCATATTT ATTGGGGTTC CCCGATTATG
AAGGAGGAAA CGGAAAGGCT CAACTGA
 
Protein sequence
MRILMLSWEY PPRIVGGISR VVHGLAQKLG ARGCDVHVIT CWEMGTREFE RDKYVKVHRL 
HSYDVTPNNF VDWVLHLNFA IVEHATRLIN ETGKFDIIHA HDWLVAFAAR VLKHAYSTPL
VATIHATEHG RNWGIHNDTQ RYINNVEWWL AFEAWRLIVN SEYMKNEVMS IFKIPNDKID
VIPNGVDLDK FKGYEKDMEF RRRFAQDNEK IVFFVGRLVN EKGVHVLIDA LPKVCHYYND
VKFVIAGKGP QFDHLKWKAE SMGMAHKVYF TGYISDEELL KLYKCVDVAV FPSLYEPFGI
VALEGMVANV PVVVSDTGGL GEIVEHGVDG MKSYTGNPNS LADSILEILH NPDKAERMKK
KALEKVRSIY NWDVVAEKTL NVYKTILEEN KHIYWGSPIM KEETERLN