Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3095 |
Symbol | |
ID | 4809721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3650006 |
End bp | 3652912 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108523 |
Product | glycosyl transferase family protein |
Protein accession | YP_001039483 |
Protein GI | 125975573 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4346] Predicted membrane-bound dolichyl-phosphate-mannose-protein mannosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.42978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGAAAA AACTTTTTAA GGTCTGCCTT ATATTATTAA TCTCAATTCA GTTCATTTTT TCTTCTCCCG TGTTTGCTGA AAGTGAAAAT CTTGTAAAAA ACCCCGGGTT TGAAGAAGGA AATGATGAGA GTGTATATTT TTGGCAGACC CACTGTTGGG AAAAAGCTGA AGGAGTTACA GAATTTTTTA TTGATGAATC CGTATACCAT TCCGGAGGCA AAAGTGCGTG CATAGTTAAC CATTCTGAAA ATGATTCCAG ATACATGCAG CCAATCAAGG TAAAAGGAGA TACATATTAT AGATTGTCCT GTTGGGTAAA GACAGAAAAT GTCGGAACCA AAACCAAAGG AGCCAACATC TCCATTGAAG GCAGCCTGGA TACTTCCAGG GATATCCGTG AAACAAGCGA TAACTGGGAA TATCTCGAGC TATATGGTAA GACCAGTCCA AACCAGGAAA CCTTTACTCT TACCATAGGT CTTGGCGGAT ACGGCAATAC CAATACCGGT AAAATATGGA TTGATGACGT TGAAGTTGTC GAGCTTGAAA GTCTTCCTGC CGGCAAAACA GCCATCAACC TTGATCCGAA GTATACAGGA GGAAATCAGA ACACGTCTGA AAATACCGGC GGTAAAATAT TTATGATTTT TATCGCACTA ATTTTCTTTC TCATTTTGCT TGCCGTGATA TTTGTCATAT TCTTTAAAGA CAAAATGCCC GGCTTTAACG CCAAAGCGGC GCAGACCGAC AAGCCGGCAG CCAAAAAAAA TCAGAAAGAT AATGATGAGA AATGGATAAA AGTCAAATTT GACAAAATTG ATTTTATCAT AATGGGCGTT ATGACCCTTG TTTATCTCTG CGTTGCTTTG TACAACCTCG GAGATTTTAA AGTTCCCACC ACTTCATGGG AACCAACTCT GCCCGGTGAA TATTTTACCG TAGATCTTGG CAAAGAAGCC ACTCTGTCAA GAATATACTA CTACTGCGGA CTGGGCAGTA ACAAATACAA TCCTTTTTCC AAATTAAGAG TTGAATATCT TAATGAAAAC CAGGAATTTG AATATCTTGC AACTATTGAT AAGAAAGAGC TTCAAGACAT CTTTAAATGG AAATACGTTG ACACCGCACC GGTAACCACC ACACAACTCA AGTTTATTGT AGACTCCACC GGAGGCGCGT TCAATGAAAT AGCCATAGTT GAACAAAACA GCAAAGAGCC TTTGAAAAAT ATCAAAATTA TTGACAGCGT ACTGGAAGAT GAAAGCAAGG CTACAATAGG AAATCTTTTT GATGAACCGG ACAAGTTTGC ATTCAAGCCT TCCTACATGA ACGGAATGTA TTTCGATGAA ATTTATCATG CAAGGACGGC TTTCGAACAC ATTCACAGAA TGTCTCCTTA TGAAACAACC CACCCACCTC TCGGGAAAAT CTTCATAGCA TTGGGTATCC TCGTATTTGG AATGGTGCCT TTCGGCTGGA GAATAGTCGG CACTCTGTTT GGTGTTGCCA TGGTGCCGGT AATGTATATG TTCGGAAAAA AGGTCTTCCA TGACAGATTT TATGCGTTTT GCACTGCATT TTTAATGATG TTTGACTTTA TGCATTTCAG TCAGACGCGA ATAGCAACAA TAGACAGCTA TGTAACTTTA TTTGTCATAC TCATGTATTA CCACATGTAT GACTACTTTA TAAACAAATC CTACAACACA ACCCTTAAGG AATCTTTGAA GCCTTTGTTC CTAAGCGGAC TGTTCTTTGG GTTGGGCGCC GCCAGCAAGT GGATTGCAAT TTACGGAGCC GCCGGACTTG CCCTTCTGTT CTTTGTAAAC AAGGGAACCG AGTTTTGGGA ATATAGAAAG ATATCAACCG GCAAATCCAA AAAGAAACCT CTCTGGATTG GCGATTATCC TTCAAACTTA GGCATTACGA TAGGAGCGTG TGTTTTATTC TTTATTATAA TTCCTCTGGT AATTTACATC CTGTCATATA TTCCTTATAT GCTGGTACCC GGGGACAACA AAGGAATCAC TGTATTTATT GAAAACTCCA AGCACATGTT TGAATATCAC TCAAAACTGG AAGCGGAACA CCCGTACCAG TCCGCATGGT GGGAATGGCC TATTATGGCA AAACCCATGG CGTTCTACTT CGGTAGCGAC CTTGAACCCG GTATGACTTC AAAAATATTT ACAATGGGAA ATCCGGCCGT ATGGTGGGTT GGACTTCTGG CTTTATTGAT AGTTGCAATA TTTGCCCTTT CAAAAGTCAA CAAAAATCTT GTCGTACTGT TTACTTTGGC AGCAACCACC TTCGGATATG TTGCACTGCC TAAAACAATC TTCACAGAAA CTCTGAAAGC CCAAAGCGCC GAATTATGCT GGGCCGTAAT CTTCTTCGCA TTAATTGCTA TCCTGCTGGT TCTGTCAAAG CTGGATACGG ACATTTTAAT AACCAGCGCT GTTTCCTCGG TTGCATTTGG AACAATACTT TACCTCTTTA AAGATATCGA AAGAGGAGAT GCATACTTAA AGAGCGGCAA TACACAGCTT ATTATATGGG CCTGTCTTTT AATATGTATA ACATTGCTGT TTATCGGAAT ATACAGGTAC GACAAAAAAA TGATGGTGCC TTTGGCCGGA CTTATATTCC AGTACGTACC ATGGATTGCC GTCCCCAGAA TAGCATTTAT ATATCACTAT TTCTCCATTG TACCGTTTTT AATACTGTTA ATTGTTTATG TCATTAAAAA AGCTGCCGAT AAACATAGTG GCGTAAAATA TTTCGCATGT GTTTACCTTG GAATTGTTAT GGCTCTGTTT ATACTCTTCT ATCCTGGATT GTCAGGACTT GAAGTACCTA CATCTTATAT GAGATATCTT AAATGGTTTA ACACATGGTA TTTCTAA
|
Protein sequence | MSKKLFKVCL ILLISIQFIF SSPVFAESEN LVKNPGFEEG NDESVYFWQT HCWEKAEGVT EFFIDESVYH SGGKSACIVN HSENDSRYMQ PIKVKGDTYY RLSCWVKTEN VGTKTKGANI SIEGSLDTSR DIRETSDNWE YLELYGKTSP NQETFTLTIG LGGYGNTNTG KIWIDDVEVV ELESLPAGKT AINLDPKYTG GNQNTSENTG GKIFMIFIAL IFFLILLAVI FVIFFKDKMP GFNAKAAQTD KPAAKKNQKD NDEKWIKVKF DKIDFIIMGV MTLVYLCVAL YNLGDFKVPT TSWEPTLPGE YFTVDLGKEA TLSRIYYYCG LGSNKYNPFS KLRVEYLNEN QEFEYLATID KKELQDIFKW KYVDTAPVTT TQLKFIVDST GGAFNEIAIV EQNSKEPLKN IKIIDSVLED ESKATIGNLF DEPDKFAFKP SYMNGMYFDE IYHARTAFEH IHRMSPYETT HPPLGKIFIA LGILVFGMVP FGWRIVGTLF GVAMVPVMYM FGKKVFHDRF YAFCTAFLMM FDFMHFSQTR IATIDSYVTL FVILMYYHMY DYFINKSYNT TLKESLKPLF LSGLFFGLGA ASKWIAIYGA AGLALLFFVN KGTEFWEYRK ISTGKSKKKP LWIGDYPSNL GITIGACVLF FIIIPLVIYI LSYIPYMLVP GDNKGITVFI ENSKHMFEYH SKLEAEHPYQ SAWWEWPIMA KPMAFYFGSD LEPGMTSKIF TMGNPAVWWV GLLALLIVAI FALSKVNKNL VVLFTLAATT FGYVALPKTI FTETLKAQSA ELCWAVIFFA LIAILLVLSK LDTDILITSA VSSVAFGTIL YLFKDIERGD AYLKSGNTQL IIWACLLICI TLLFIGIYRY DKKMMVPLAG LIFQYVPWIA VPRIAFIYHY FSIVPFLILL IVYVIKKAAD KHSGVKYFAC VYLGIVMALF ILFYPGLSGL EVPTSYMRYL KWFNTWYF
|
| |