Gene Cthe_2688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2688 
Symbol 
ID4808860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3171893 
End bp3173446 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content41% 
IMG OID640108107 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001039080 
Protein GI125975170 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID[TIGR02900] stage V sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAA AATCGTTTAT AGGCAGTGCT GTCATTTTGA TGATAGCCAG TTTCATAGTT 
AAAATAATTG GTTTTATTTA CAGAATATAC CTTTCAAACC TTATCGGCGC AGAAGGCATG
GGGTTGTTCC AGCTTATTTC CCCTGTGTAT TCCCTCATTA TCCTTACTTT GACTTCGGGA
GTGTCGATAG CTGTCTCCAA GATGGTGGCG GAAGAAATGG CAAAAGGCCA CCATGTCAAT
TTAAGGAGGA TTACGGGCTG CGCCCTGGTT ATTGTTTTAT TGGCAGGCTT GGCGGTTTCC
CTGCTGATTC TTATTTTTAT AAATCCTATA GTCAATGTAA TATTAAAGGA TTCCAGGACC
TATTATTCAA TGCTTCTTTT GATACCCTGT ATACCTGTGA TTGCAGCTGC ATCCGCCCTC
AAAGGGTACT TTTATGGTAT ACAGGATGTG GTGCCCACCG CATGCTCTCA AGTTGTGGAA
CAGCTTGTGA AAACATTTTT GGTTATGGCC ATGGCGGGCT ACTTCGTAAA TGTGGGACTG
GAATATGCCT GTGCTCTTGC GACTGTCGGA ATGGCACTCG GCGAGATTTC AAACCTGTTG
GTTTTGGTTG TAGTGTACAA ATTCAAGAAA AAGCGGGCTT GTGCGAATGC ATCCAAAAAA
GGCTTTATGA GAAAGCGAGT TATAGTTAAG GAGATTGTAA AAATATCAAT TCCTGTGTCT
TTCAACAGGT TTATCACTTC CATCATGTCC ACTGTAGAGT TTATTTTAAT CCCGAGAATG
CTTGTTTTGG GGGGCATGAC CTATCAAAAC AGCATACAGG AATATGGCAA ACTTACGGGA
ATGGCCATGC CGCTGGTTTT CTTTCCGTCC CTTGTGACAT CAGCTCTGGC GACGACTCTT
GTTCCGGCGA TTTCGGAAGC AATGTCCGTA AAAAGATACA AAACGGTCAA TTACAGAATG
TCAAAATCAA TACAACTTAC ATTTATAATG GGTTTTATAT TTTCAGCCAT TTTTATGCTC
TTTCCCGATA CAATAGGGGA TTTAATTTAC AGGAAGGAAA ATATCGGGCA TATATTGTAT
CTTCTCTCCT TTACCGGAAT ATTCATTTAT CTTCAGCAGA CCCTCCTGGG CATAATGAAC
GGCCTTGGGA AACAGGGAAT TCTTCTTAGA AACTCTATTG TGGGTTATGT AATAAGAATA
CTTTTTGTGA TTTACTTTGT TCCTTCATAC GGAATTGCAG GATATATTGC GGGTATGGTG
GTAAGTTCCA TATGCGTTTG CATACTGGAT ATTTCAACGG TAATCAAGAC AACAGGAATG
GCGCTTGATT TTAGAAATTG GATAATAAAA CCCGGCCTTG CGGGGGCAAT AATGCTTGTT
ATTGGGAAAT ACGTGCAAAG CTTCTTCACC ATATTTCACC TGGGACATTC ATGGACGGTT
GTACTCACTG TCTTTGGAAA TATTGTAATC GGTTTTTTGC TGATGTTTGT GCTGGGAGTT
CTGGATAAAG ATGAGATGTT GGCCATGGTA GGCTTAAAAA AAGTGCAGAG GTAA
 
Protein sequence
MAKKSFIGSA VILMIASFIV KIIGFIYRIY LSNLIGAEGM GLFQLISPVY SLIILTLTSG 
VSIAVSKMVA EEMAKGHHVN LRRITGCALV IVLLAGLAVS LLILIFINPI VNVILKDSRT
YYSMLLLIPC IPVIAAASAL KGYFYGIQDV VPTACSQVVE QLVKTFLVMA MAGYFVNVGL
EYACALATVG MALGEISNLL VLVVVYKFKK KRACANASKK GFMRKRVIVK EIVKISIPVS
FNRFITSIMS TVEFILIPRM LVLGGMTYQN SIQEYGKLTG MAMPLVFFPS LVTSALATTL
VPAISEAMSV KRYKTVNYRM SKSIQLTFIM GFIFSAIFML FPDTIGDLIY RKENIGHILY
LLSFTGIFIY LQQTLLGIMN GLGKQGILLR NSIVGYVIRI LFVIYFVPSY GIAGYIAGMV
VSSICVCILD ISTVIKTTGM ALDFRNWIIK PGLAGAIMLV IGKYVQSFFT IFHLGHSWTV
VLTVFGNIVI GFLLMFVLGV LDKDEMLAMV GLKKVQR