Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2834 |
Symbol | |
ID | 4809671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3350172 |
End bp | 3351857 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108254 |
Product | hypothetical protein |
Protein accession | YP_001039226 |
Protein GI | 125975316 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0278532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTA TAACGTTATA TGCCGGAAAA ATCAACCAAA TACCCGGATT GATAAATGAA GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTT ATATCGGAAG TAGTACGTAT CGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCC AAGATAGTGG CTGCCGCAGT AGTTATTACC GGGTTAGGGA TAGCGGCGGC ATTGACAGGA GGGGTATTGG GAGTCATACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA GGAGGAGCGG TTGGAGGAAT AGCCGCGGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT GCGGACGGCG CTTTAAGCGG AGCAATTTCC GGAGCGGTGA CAGGAGCGGC ATGTGCCGGG CTTGGTGCTT TAGGAGCTCT AGCAGGGAAA AGCATCCAAT GTATGAGCAC AGTGGGAAAA GCGATAAATG TTACGTCAAA GGTTACGGCA GCACTTTCTT TTGGTATGGA TGGATTTGAC ATGCTGGCAA TGGGAATATC ATTGTTTGAT CCATCCAATG CATTGGTTGA ATTTAACCGG AAGCTGCATT CCAGTGCACT TTACAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT TTCAGTGCCG GGGCGGCATC GACAATGAAG TGCTTTGTTG CAGGTACAAT GATATTGACT GTGGCAGGCT TGGTTGCGAT AGAGAATATC AAGGCAGGGG ACAAGGTAAT TGCGACGAAT CCGGAGACTT TTGAAGTAGC GGAAAAGACG GTGCTTGAGA CATATGTGAG AGAAACAACG GAGCTTTTGC ATTTGACAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT TATGTAAAAG ATGTGGGTTT TGTTGAAGCG GGAAAACTGC AAGTAGGAGA TAAGTTGGTT GATTCAAGAG GCAATCTTTT GGTGGTGGAA GAGAAAAAGC TTGAAATAAC AGATAAGCCT GTAAAGGTTT ACAATTTTAA GGTCGATAAT TTTCATACGT ATCATGTTGG CGAAAATAGG GTATTGGTTC ATAATGCGAA TAAGTATGTT AAGGGAACGC GTAGTACTCA GTTGACGTTT GATGAAGCAC TGAAAAAGTT AGACAAGTCA GGCTTACGAC CGGGTCAAAC AGAAATTTCA AAGAGTAGGG TTATGGAAAT CGTAGAGAAT TATGATCCTA TGAAAGCACA AAGCAGTGTG TATACTGATT CAACGGGTAG ATATTTAGTT GAAGGCCATC ATACAACTGT CGCAAATACA ATGCTAGGAA AAGGATCTGG GGTGAATATG AATATACCTA CACAGCAGAT ACCATCTGCT ACAAATGTCT ATTGGACAAA AAAGTGGTAT GAATTTTGGA AAACACAAAT AAAAGTAACA AAATAA
|
Protein sequence | MATITLYAGK INQIPGLINE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS QTQDRKIDSL EKFCSESEKF ISEVVRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK IKDGLKSVAE WCKENWKSIA KIVAAAVVIT GLGIAAALTG GVLGVILAGA FWGALAGGLI GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGALAGK SIQCMSTVGK AINVTSKVTA ALSFGMDGFD MLAMGISLFD PSNALVEFNR KLHSSALYNG FQIAVNALAV FSAGAASTMK CFVAGTMILT VAGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT ELLHLTINGE VIKTTFEHPF YVKDVGFVEA GKLQVGDKLV DSRGNLLVVE EKKLEITDKP VKVYNFKVDN FHTYHVGENR VLVHNANKYV KGTRSTQLTF DEALKKLDKS GLRPGQTEIS KSRVMEIVEN YDPMKAQSSV YTDSTGRYLV EGHHTTVANT MLGKGSGVNM NIPTQQIPSA TNVYWTKKWY EFWKTQIKVT K
|
| |