Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2688 |
Symbol | |
ID | 4808860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3171893 |
End bp | 3173446 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108107 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001039080 |
Protein GI | 125975170 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | [TIGR02900] stage V sporulation protein B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAA AATCGTTTAT AGGCAGTGCT GTCATTTTGA TGATAGCCAG TTTCATAGTT AAAATAATTG GTTTTATTTA CAGAATATAC CTTTCAAACC TTATCGGCGC AGAAGGCATG GGGTTGTTCC AGCTTATTTC CCCTGTGTAT TCCCTCATTA TCCTTACTTT GACTTCGGGA GTGTCGATAG CTGTCTCCAA GATGGTGGCG GAAGAAATGG CAAAAGGCCA CCATGTCAAT TTAAGGAGGA TTACGGGCTG CGCCCTGGTT ATTGTTTTAT TGGCAGGCTT GGCGGTTTCC CTGCTGATTC TTATTTTTAT AAATCCTATA GTCAATGTAA TATTAAAGGA TTCCAGGACC TATTATTCAA TGCTTCTTTT GATACCCTGT ATACCTGTGA TTGCAGCTGC ATCCGCCCTC AAAGGGTACT TTTATGGTAT ACAGGATGTG GTGCCCACCG CATGCTCTCA AGTTGTGGAA CAGCTTGTGA AAACATTTTT GGTTATGGCC ATGGCGGGCT ACTTCGTAAA TGTGGGACTG GAATATGCCT GTGCTCTTGC GACTGTCGGA ATGGCACTCG GCGAGATTTC AAACCTGTTG GTTTTGGTTG TAGTGTACAA ATTCAAGAAA AAGCGGGCTT GTGCGAATGC ATCCAAAAAA GGCTTTATGA GAAAGCGAGT TATAGTTAAG GAGATTGTAA AAATATCAAT TCCTGTGTCT TTCAACAGGT TTATCACTTC CATCATGTCC ACTGTAGAGT TTATTTTAAT CCCGAGAATG CTTGTTTTGG GGGGCATGAC CTATCAAAAC AGCATACAGG AATATGGCAA ACTTACGGGA ATGGCCATGC CGCTGGTTTT CTTTCCGTCC CTTGTGACAT CAGCTCTGGC GACGACTCTT GTTCCGGCGA TTTCGGAAGC AATGTCCGTA AAAAGATACA AAACGGTCAA TTACAGAATG TCAAAATCAA TACAACTTAC ATTTATAATG GGTTTTATAT TTTCAGCCAT TTTTATGCTC TTTCCCGATA CAATAGGGGA TTTAATTTAC AGGAAGGAAA ATATCGGGCA TATATTGTAT CTTCTCTCCT TTACCGGAAT ATTCATTTAT CTTCAGCAGA CCCTCCTGGG CATAATGAAC GGCCTTGGGA AACAGGGAAT TCTTCTTAGA AACTCTATTG TGGGTTATGT AATAAGAATA CTTTTTGTGA TTTACTTTGT TCCTTCATAC GGAATTGCAG GATATATTGC GGGTATGGTG GTAAGTTCCA TATGCGTTTG CATACTGGAT ATTTCAACGG TAATCAAGAC AACAGGAATG GCGCTTGATT TTAGAAATTG GATAATAAAA CCCGGCCTTG CGGGGGCAAT AATGCTTGTT ATTGGGAAAT ACGTGCAAAG CTTCTTCACC ATATTTCACC TGGGACATTC ATGGACGGTT GTACTCACTG TCTTTGGAAA TATTGTAATC GGTTTTTTGC TGATGTTTGT GCTGGGAGTT CTGGATAAAG ATGAGATGTT GGCCATGGTA GGCTTAAAAA AAGTGCAGAG GTAA
|
Protein sequence | MAKKSFIGSA VILMIASFIV KIIGFIYRIY LSNLIGAEGM GLFQLISPVY SLIILTLTSG VSIAVSKMVA EEMAKGHHVN LRRITGCALV IVLLAGLAVS LLILIFINPI VNVILKDSRT YYSMLLLIPC IPVIAAASAL KGYFYGIQDV VPTACSQVVE QLVKTFLVMA MAGYFVNVGL EYACALATVG MALGEISNLL VLVVVYKFKK KRACANASKK GFMRKRVIVK EIVKISIPVS FNRFITSIMS TVEFILIPRM LVLGGMTYQN SIQEYGKLTG MAMPLVFFPS LVTSALATTL VPAISEAMSV KRYKTVNYRM SKSIQLTFIM GFIFSAIFML FPDTIGDLIY RKENIGHILY LLSFTGIFIY LQQTLLGIMN GLGKQGILLR NSIVGYVIRI LFVIYFVPSY GIAGYIAGMV VSSICVCILD ISTVIKTTGM ALDFRNWIIK PGLAGAIMLV IGKYVQSFFT IFHLGHSWTV VLTVFGNIVI GFLLMFVLGV LDKDEMLAMV GLKKVQR
|
| |