Gene Cthe_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2335 
Symbol 
ID4809263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2783469 
End bp2784941 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content33% 
IMG OID640107742 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001038730 
Protein GI125974820 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000025001 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAA AGATAACTAA AAATGAAGTT TTGGCGTCAC TTTTTTGGAA GCTTATAGAA 
AGAAGTGGAA CCCAAGGAAT AAACTTTTTG GTATCAATAG TATTGGCGAG GCTGCTTCAG
CCTCAAGAAT ATGGATTGAT TGCTTTAATA TCAATTTTTA TTGCTTTAGC GAATGTATTT
ATACAAACCG GATTAAACAC TGCCTTAATA CAAAAGAAAG ATGTCGATGA GAAAGATTAT
TCTACGGTAT TTTATGCAAG TTTGGGAGTG GCAGGGTTTT TGTATGTTAT ACTGTTTTTT
GCTTCTCCTT TTATTGCAAG CTTTTATGAT CAGGATTTAC TTGTACCTGT TTTAAGAGCA
CTGTCAATTA CATTGTTTTT TGGAGCGGTA AATTCTATAC AAATTGCTGT AATATCTCGC
AACATGCAGT TTAAAAAACT GTTTTACAGT AACTTTGGTG CCATAATAAT CTCCGGTTTA
GTAGGTATTT TTATGGCTTT CAATGGATTT GGAGTATGGG CGTTGGTAGC ACAGCAACTC
GTGAACCAAT TTTTTTCAAC TGCCATTATG TGGTTTACAG TTAAATGGAG ACCAAAATTG
TTATTTAAAT TTGAACGATT GCGAGGTTTA TTTTCTTATG GATGGAAAAT ACTTGCGTCC
AATTTGATAA CTACACTGTT TTTGGATTTG AGAAGTTTAA TAATAAGTAA AATATATAGC
ACTGACTTGC TGGGTTACTT CAATAAAGGA AAACAATTTC CATATGTAAT CATAACCAAT
ATTAACGGTT CAATTCAGTC TGTGATGCTG CCGGCATACT CGTCCATGCA GGATAATAAA
GAGCGGATAA AAGGAATGGT TCGACGTTCC ATAACTACAA GTACATTTAT TATCTTTCCG
ATGATGATAG GCCTTGCATT GGTAGCCGAG CCGTTGGTGA AAATAGTCCT GACAGATAAA
TGGCTGCCGT GTGTGCCTTT TTTACAAATT TTTTGCATGT CCTATATGTT TATGCCTTTT
CATACAGCCA ATCTTGAGGC AATCAAAGCA CTAGGGTACA GCGACTTGAT TTTAAAAATA
GAAATAATCA AGAAAGTTCT CGAACTTTTA ATTTTGCTGA TCAGTTTAAA ATTTGGAGTA
TATGCAATTG CATTTGGAGC GTTTATTACA AGTTTAATTT CTACAATAAT AAATTCATAC
CCTAATACAA AGCTCCTTAA TTATAGCTAT AAAGAACAAT TGGCAGATAT AATGCCTTCG
TTGATGTTGT CTGTTGTGAT GGGATTGGTG GTTTACAGTG TTAAATTAAT GGTATTGTCT
GCTTGGTTGA CTCTTTTGAT TCAAGTTTGT GTAGGTGTAA TTATATATAT GGTTTTGGCA
CAAATTTTTA AAATTGAAAC TTCGATTTAC TTATTTAACA CTCTCAAAAG TGTTTTAGAA
TTCAGTAGAA ATAAAAGAAA AATCCAAGAT TAA
 
Protein sequence
MGKKITKNEV LASLFWKLIE RSGTQGINFL VSIVLARLLQ PQEYGLIALI SIFIALANVF 
IQTGLNTALI QKKDVDEKDY STVFYASLGV AGFLYVILFF ASPFIASFYD QDLLVPVLRA
LSITLFFGAV NSIQIAVISR NMQFKKLFYS NFGAIIISGL VGIFMAFNGF GVWALVAQQL
VNQFFSTAIM WFTVKWRPKL LFKFERLRGL FSYGWKILAS NLITTLFLDL RSLIISKIYS
TDLLGYFNKG KQFPYVIITN INGSIQSVML PAYSSMQDNK ERIKGMVRRS ITTSTFIIFP
MMIGLALVAE PLVKIVLTDK WLPCVPFLQI FCMSYMFMPF HTANLEAIKA LGYSDLILKI
EIIKKVLELL ILLISLKFGV YAIAFGAFIT SLISTIINSY PNTKLLNYSY KEQLADIMPS
LMLSVVMGLV VYSVKLMVLS AWLTLLIQVC VGVIIYMVLA QIFKIETSIY LFNTLKSVLE
FSRNKRKIQD