Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2838 |
Symbol | |
ID | 4809675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3354477 |
End bp | 3356318 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108258 |
Product | hypothetical protein |
Protein accession | YP_001039230 |
Protein GI | 125975320 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0208595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTC TAACGTTATA TGCCGGAAAA ATCAACCAAA TGCCCGGATT GATAAATGAA GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTT ATATCGGAAG TAATACGTAT CGATGAAGAA GTGGCTGAGC TTATCAATAA ACGGAAAGAA AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCC AAGATAGTGG CTGCCGCAGT AGTTATTACC GGGTTAGGGA TAGCGGCGGC ATTGACAGGA GGGGTATTGG GAGTCATACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA GGAGGAGCGG TTGGAGGAAT AGCCGCGGCG ATAAATGGAG GTTCGTTTCT GGAAGGATTT GCGGACGGAG CATTGAGCGG AGCGATTTCC GGAGCGGTAA CAGGAGCGGC ATGTGCCGGG CTTGGTGCTT TAGGAGCTCT AGCAGGGAAA AGCATCCAAT GTATGAGCAC AGTGGGAAAA GCGATAAATG TTACATCAAA GGTTACGGCA GCACTTTCTT TTGGTATGGA TGGATTTGAC ATGCTGGCAA TGGGAATATC ATTGTTTGAT CCATCCAATG CATTGGTTGA ATTTAACCGG AAGCTGCATT CCAATGCACT TTACAATGGA TTCCAGATTG CAGTAAACGC GCTGGCTGTG TTTACTGCCG GAGCGGCATC CACAATGAAG TGCTTCGTTG CAGGCACGCT GATATTGACT TCGGCAGGCT TGGTTGCGAT AGAAAATATC AAGGCAGGAG ACAAGGTAAT TGCGACGAAT CCTGAAACTT TTGAAGTAGC GGAAAAGACG GTGCTTGAGA CATATGTGAG AGAGACAACG GAGCTTTTGC ATTTGAGAAT CGGAGGCGAA GTAATCAAAA CAACCGTCGA CCATCCATTT TATGTTAAAG ATGTAGGTTT TGTTGAAGCG GTGAATCTGC AAGTCGGAGA CAAATTGGTT GATTCAAGAG GCAACGTTTT GGTAGTGGAA GAGAAAAAGC TCGAAATAAC TGGTGAACCT GTGAAAGTTT ACAACTTTAA AGTTGATGAC TTTCATACTT ATCATGTTGG GAATAAAGGG ATATTGGTAC ATAATGCGAA TTATAATCCT AAAACTACCT TTGAAAATCT GGATCTGGAA ACCGCCAGTA ACAAGCAAAA GGGTAATTAT GGAGAATATC GTGCGAACGA TAATTTAATT AACAATCAAA GTCTGAAAGA AGAAAGATAT AATTTAAAAC GAAAGGGGAG AAGTGCACCG ACATCTCCGG ATGATAAAAT TGTAAAGGGG ATAGATGGAA TATATGTAAA CGAGGATCCA AACTCAAATA TTAAATATGT AATTAATGAG TCAAAGTTTA ATAGTGCACA ATTGGGGAAA ACGAAAAAAG GCATAAAACA AATGTCGGAT GAGTGGCTCC TTGAGAAACA AGGTAAAAGA ATTTTAAAAG CAGTTAATGG CGATAGAAAG CTGCAAAAAG ACATATTGCA AGCGTTAGAT GATGGTCAAA TAGAAAAAGT TTTATCACGA GTTGGCAAAG ATGGAAAAGT GATAACATAT AGACTGGGCA GCAATGGTGA AATAATCGGA CTTTGGCCAT AA
|
Protein sequence | MATLTLYAGK INQMPGLINE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS QTQDRKIDSL EKFCSESEKF ISEVIRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK IKDGLKSVAE WCKENWKSIA KIVAAAVVIT GLGIAAALTG GVLGVILAGA FWGALAGGLI GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGALAGK SIQCMSTVGK AINVTSKVTA ALSFGMDGFD MLAMGISLFD PSNALVEFNR KLHSNALYNG FQIAVNALAV FTAGAASTMK CFVAGTLILT SAGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT ELLHLRIGGE VIKTTVDHPF YVKDVGFVEA VNLQVGDKLV DSRGNVLVVE EKKLEITGEP VKVYNFKVDD FHTYHVGNKG ILVHNANYNP KTTFENLDLE TASNKQKGNY GEYRANDNLI NNQSLKEERY NLKRKGRSAP TSPDDKIVKG IDGIYVNEDP NSNIKYVINE SKFNSAQLGK TKKGIKQMSD EWLLEKQGKR ILKAVNGDRK LQKDILQALD DGQIEKVLSR VGKDGKVITY RLGSNGEIIG LWP
|
| |