Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3064 |
Symbol | |
ID | 4809938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3604489 |
End bp | 3606096 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108488 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001039453 |
Protein GI | 125975543 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | [TIGR02900] stage V sporulation protein B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000038595 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC AGTCAATCAC AAAAGGTTTT GCGGTACTGT CGGCGGCAGG ACTTATAACA AAAATATTAT CGGTGCTTTA TATTCCTTTT TTGCTTGCCA TTATCGGAGA TGAGGGAAAT GGTATTTATG CTGCCGCGTA TCAAGTATAT GTTTTTATTT ATGTTATTGC CAATTCAGGC ATTCCCGTTG CGATAGCAAA GTCCGTGTCT GAGCTTACCG CTGTGGGAAA CTACAAGGAT GCTTTGAGAA TCTTTAAAAT ATCGCGTTTT TTCCTTATTA TAATAGGTAC TGTTTTAACA GTGCTTATGT TTGTCACGGC AAAGCCATTG GCTGTCATGA TTAACTCGGA AAAATCATTC CTTGCAATAG CGGCATTGTC TCCGACACTG TTTTTTACCG CCCTGGCCTC TGCGTACAAG GGATATTTCC AGGGCATGAG CAACATGACT CCGACTGCCG TGTCCCAGGT GGTTGAACAG ATATTCAATA TGATATTCAC AGTGCTTTTT GCAGCGTTGC TAATAAATAA AAGCCTTGAG GCTGCGTGCG CCGGAGGTAC CGTAGGAACA ACTGTGGGTG CGCTGGCTTC CGTTATTGTC CTTATATTTA TATACAACAG AAGAAGAGAA GAAATTAACA ATCTGAAGGA ACACAGGAAG ACTGCAAAGA GATACTCATA CAAGCAGCTT GCGACAAGAA TATTTTATTA CAGCCTACCC ATAACTGTTT GTGTGGCTGC TCAATATGCT GGAAATCTCA TTGATGTGGC AAATATAAGA GGGCGTCTTT TGGCCGGCGG CTATACGCTG GAAATGGCAT CGGTCATGCA CAGCTATTTG TCCAAATACC AGCAAATAAT GAACGCACCG ATTTCCATAG TTTCGGCTCT TGCGGCGGCG GTGCTGCCTT CCATTTCGGG AGCTGCGGCG GAACAAGATA TAAAGCAGGT TAAGGATAAA TCCAACCATG CTTTCAGGCT TTGCATGCTG ATAGTAATTC CGTCGGCTGT GGGGTTGTCC ATATTGAGTG AACCTATTTA CGCCGTATTG AAATACGGAG CGGGTTCCCA CCTTATGCGC TACGGCTCAA TAGTACTCGT TCTCATGTCC ATTGTACAAA TACAGTCGTC AATTTTGCAG GGTGCAGGAA AACTGTACAA AGCAACGATA AATGTAATTT TAGGTATTAT CGCAAAGATA ATTTTCAATT ATATACTTAT AGCAAATCCC AATATAAATA TCATGGGAGC AGTGATAGGA AGTATAGTGG GATACGGTTT GACCATTATT CTCAATGTTA TGACAGTAAG AAAAGAGTTG AAAATAAAAA TAAATATACT GAAACAGGCG GTAAAACCGG CTGTTTCATC AGTGGTAATG GGTATTTTTG TATGGATTGT ATACAAGGGT TTATACTTTG TTTTAGGATT TATTAAGAGC GCATATCTTG TAAACGCATT ATCTACAGTT GTTTCAGTTC TGTTCGGAAT GGCAATATAT TTTTATATAA TGATACTTGT CAGGGGAATA ACAAAAAATG ATTTTGACGT ATTGCCGGAA AAAATCAGAA GAATGATACC CAAATTCGTA TTAAACAAAG CCGTATGA
|
Protein sequence | MKKQSITKGF AVLSAAGLIT KILSVLYIPF LLAIIGDEGN GIYAAAYQVY VFIYVIANSG IPVAIAKSVS ELTAVGNYKD ALRIFKISRF FLIIIGTVLT VLMFVTAKPL AVMINSEKSF LAIAALSPTL FFTALASAYK GYFQGMSNMT PTAVSQVVEQ IFNMIFTVLF AALLINKSLE AACAGGTVGT TVGALASVIV LIFIYNRRRE EINNLKEHRK TAKRYSYKQL ATRIFYYSLP ITVCVAAQYA GNLIDVANIR GRLLAGGYTL EMASVMHSYL SKYQQIMNAP ISIVSALAAA VLPSISGAAA EQDIKQVKDK SNHAFRLCML IVIPSAVGLS ILSEPIYAVL KYGAGSHLMR YGSIVLVLMS IVQIQSSILQ GAGKLYKATI NVILGIIAKI IFNYILIANP NINIMGAVIG SIVGYGLTII LNVMTVRKEL KIKINILKQA VKPAVSSVVM GIFVWIVYKG LYFVLGFIKS AYLVNALSTV VSVLFGMAIY FYIMILVRGI TKNDFDVLPE KIRRMIPKFV LNKAV
|
| |