Gene Cthe_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2222 
Symbol 
ID4811087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2651422 
End bp2652486 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content43% 
IMG OID640107628 
Productglycosyltransferase 28-like protein 
Protein accessionYP_001038617 
Protein GI125974707 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase 
TIGRFAM ID[TIGR03590] pseudaminic acid biosynthesis-associated protein PseG 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATA TAGGAATCAG AGTTGACGGA AGTGCCAATA TCGGCATGGG ACATATAATG 
CGCTGCCTGT CGCTGGCAAA AGGATTTAGA AATGCCGGCG CCAATGTATA TTTCTTAAGC
CGGTTTGAAC AGGGAATTTC AAGGATAAGG CAGGACAACT TTGAAGTTTT GGAAATGCCG
TACCGAAAAA GCAGGAATTC GGGAGGCTTT TTCTATGGAG ATGCTTCGGA GCTGGAGGAA
GACGCGGAAG AAATAATCTG CCGAATTAGA GCATTTAATC TGGATGTGCT GATTATTGAC
TCCTATAACG TCAGCCGGGA GTTTTTTTTG AAGCTGAAGC CGCATGTAAG AAAGCTTTGC
TACATTGATG ATCTTAATAA ATTTGTATAT CCTGTGGATG TGCTGATAAA CGGAAACATT
ACAGCCCCAG CATTAAATTA TGCCAAATAC AGCGATGACG AGCTTATGCT TTTGGGCTTG
AAATATAATC TCATAAGGGA TGAATTTAAA AATTTGCCCG AGAGAATAAT AAACAGGGAT
GTGCGGGAAA TAATGATAAC AACAGGAGGC TCAGACCCTT TTAACCTGAC TCTGAGGCTT
GCAAATGCCA TCCTGCCGGA AGAAGAATTT AAAGATGTGA GAATCAATAT TGTTGTGGGC
AGCGGTTTTA CCAATGCGGA CAAGTTTAGA GAGCTGTCCG AAAGAAACCC GAATGTTGTA
TTGCATGAAA ATGTTTTGCG AATGTCGGAA GTAATGCTAA AATCCGATGT TGCAATATCT
GCAGGGGGAA GCACATTGTA TGAGCTTTGC GCCTGCGGGA CACCTGCCCT GGCTGTTGTT
ATTGCTGATA ACCAAAGGGA AATGGTGGAT ATGTTGTCTT CCGAAGGTTA CATAATCAGC
CTGGGCTGGC ATGAAGAGCT TGATGACAGG GAGCTTTTGC GAAAGGTTAA GTCTTTGTGC
GGGGATTATG AAAAAAGAGT GCTTTTCAGC AGAAAGATGC AAAAGCTGGT GGACGGAGAA
GGGGTAAAAC GTGTGGTTGA GGAAATAATG AAAATAACTT CGTGA
 
Protein sequence
MLNIGIRVDG SANIGMGHIM RCLSLAKGFR NAGANVYFLS RFEQGISRIR QDNFEVLEMP 
YRKSRNSGGF FYGDASELEE DAEEIICRIR AFNLDVLIID SYNVSREFFL KLKPHVRKLC
YIDDLNKFVY PVDVLINGNI TAPALNYAKY SDDELMLLGL KYNLIRDEFK NLPERIINRD
VREIMITTGG SDPFNLTLRL ANAILPEEEF KDVRINIVVG SGFTNADKFR ELSERNPNVV
LHENVLRMSE VMLKSDVAIS AGGSTLYELC ACGTPALAVV IADNQREMVD MLSSEGYIIS
LGWHEELDDR ELLRKVKSLC GDYEKRVLFS RKMQKLVDGE GVKRVVEEIM KITS