Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2047 |
Symbol | |
ID | 4811016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2436871 |
End bp | 2438571 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107452 |
Product | hypothetical protein |
Protein accession | YP_001038447 |
Protein GI | 125974537 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.632657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTA TAAAATTATA TGCCGGTAAA ATTAACCAAA CACCTGAACT GATAAGGGAT GTAAAAAAGT CTGTAATTGA TTTTAAATCA GAGTTATCAG CATTAAAGAA GAAAACTCTA AATATCAATA GAAGTGTATG CAATCTGGAC GATGTAACAA GCTCCATACA GGCGTCTTCC CAGACCCAGG ACAGGAAAGT CACTTCTCTT GAAACAGTTT GTAAAGAAAC CGAAGAATTC ATCTCGGAAG TAGTCAGTAT CGACGGCGAA GTGGCTCAGC TTATTAATGA ACGAAAAGAA AATTTTTATA AAGAGTACTA TTACTTGAAA CCGGAAAATG AAAAAAGCGG CTGGGAAAAA ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCAATTGTC AAGATAGTGG CTGCCGCGGT AATTATTACG GGGTTAGGGA TAGGGGCAGC ATTGACAGGC GGAGTATTGG GAGTTGTACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG TGGATTGATA GGAGGAGCGG TCGGAGGAAT AGCTGCGGCG ATAAATGGTG GTTCATTTTT AGAAGGATTT GCTGACGGGG CATTAAGCGG AGCAATTTCC GGAGCTGTGA CAGGATCGGC ATGTGCCGGG CTTGGTGCTT TGGGAGCAGC GGTAGGAAAA GGCATCCAAT GCTTGAGTAC AGTGGGAAAG GCGATAAATG TTACATCAAA AGTGACTGCA GCACTTTCCT TAGGTATGGA TGGATTTGAC ATATTGGCGA TGGGGGTATC GTTATTTGAT CCGTCCAACG TGTTGGTTGA ATTTAACCAG AAGCTACATT CCAATACACT TTACAACGGA TTTCAGATTA TAACTAATGC GCTGGCTGTT TTCACTGCCG GGGCGGCATC AACAATGAAG TGCTTTGTTG CAGGCACGAT GATATTGACT GCGACAGGTT TGGTTGCGAT AGAGAATATC AAGGCAGGAG ACAAGGTAAT TGCAACGAAT CCAGAGACTT TTGAAGTAGC CGAGAAGACG GTGCTTGAGA CATATGTGAG AGAGACAACG GAGCTTTTGC ATTTGACAAT TGGTGGAGAG GTAATCAAGA CAACCTTTGA TCATCCGTTT TATGTAAAAG ATGTAGGCTT TGTCGAAGCA GGAAAACTGC AGGTAGGAGA TAAACTGCTT GATTCAAGAG GCAATGTTTT AGTGGTGGAA GAGAAAAAGC TAGAGATTGC AGATAAACCT GTTAAAGTTT ATAATTTTAA AGTAGATGAC TTCCATACTT ATCATGTTGG CGATAATGAA GTATTGGTGC ATAATGCAAA TTATGTTGAA GGAGACTTAG ACGGTATTAC TATTATTAAT AAGAAGTATG CAGGGCAAAC ATATAAGTTA AGTGGTGATT TAGCATTAAA GTATCCAGAT GGTGTTAAAT TTACGAATGA AGGTTTTCCA GATTTTAGTC CCTATAGTAA GAAGACAGTC AAAGTTGAAG GATTACAAGG TGACACATAC TATGATTTTA TTAAAGCTAA TCAAGCAGCA GGATATAAAT CAACACCAAA AGGGTATACT TGGCATCATG TCGAAGATGG AATTACTATG ATGCTTGTAC CATCTGATTT ACATGGAGCA GTGAAACATA CGGGTGGCGC TGCATTAATA AGGAAGGGAA TAAGGCCATA A
|
Protein sequence | MATIKLYAGK INQTPELIRD VKKSVIDFKS ELSALKKKTL NINRSVCNLD DVTSSIQASS QTQDRKVTSL ETVCKETEEF ISEVVSIDGE VAQLINERKE NFYKEYYYLK PENEKSGWEK IKDGLKSVAE WCKENWKSIV KIVAAAVIIT GLGIGAALTG GVLGVVLAGA FWGALAGGLI GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGSACAG LGALGAAVGK GIQCLSTVGK AINVTSKVTA ALSLGMDGFD ILAMGVSLFD PSNVLVEFNQ KLHSNTLYNG FQIITNALAV FTAGAASTMK CFVAGTMILT ATGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT ELLHLTIGGE VIKTTFDHPF YVKDVGFVEA GKLQVGDKLL DSRGNVLVVE EKKLEIADKP VKVYNFKVDD FHTYHVGDNE VLVHNANYVE GDLDGITIIN KKYAGQTYKL SGDLALKYPD GVKFTNEGFP DFSPYSKKTV KVEGLQGDTY YDFIKANQAA GYKSTPKGYT WHHVEDGITM MLVPSDLHGA VKHTGGAALI RKGIRP
|
| |