Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2030 |
Symbol | |
ID | 4811000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2406778 |
End bp | 2408454 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107439 |
Product | hypothetical protein |
Protein accession | YP_001038434 |
Protein GI | 125974524 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00279579 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTA TAACGTTATA TGCCGGAAAA ATCAACCAAA TGCCCGGATT GATAAAAGAA GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTC ATATCGGAAG TAGTACGTAT TGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCT AAAATAGTAG TTGCGGCGGT AGTTATTGCA GGATTGGGGA TAGCGGCGGC TTTGACAGGC GGGATATTGG GAGTCGTACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA GGGGGAGCGG TTGGAGGAAT AGCCGCTGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT GCGGACGGCG CTTTAAGCGG AGCAATTTCC GGGGCTGTGA CAGGAGCGGC ATGTGCCGGG CTTGGTGCTT TGGGAGCTCT AGCAGGGAAA AGTATCCAAT GCATGAGCAC AGTGGGAAAA GCGATAAATG TTACATCAAA GGTTACGGCA GCACTCTCGT TTGGTATGGA TGGATTTGAC ATGCTGGCAA TGGGAGTATC ATTGTTTGAA CCATCCAATG CATTGGTTGA ATTTAACCGG AAGCTGCATT CCAATGCATT TTATAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT TTCACTGCCG GGGCGGCATC GACAATGAAG TGCTTTGTTG CAGGTACGCT GATATTAACT GCGACAGGCT TGGTTGCGAT AGAGAATATC AAGGCAGGGG ACAAGGTAAT TGCGACGAAT CCGGAGACTT TTGAAGTAGC CGAGAAGACA GTGCTTGAGA CATATGTGAG AGAGACGACG GAGCTTTTGA ATTTGACAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT TATGTTAAAG ATGTGGGTTT TGTTGAAGCG GGAAAACTGC AAGTAGGAGA TAAGTTGGTT GATTCAAGAG GTAATGTTTT AGTATTGGAA GGTAAAAAGC TTGAAATAAC AGATAAGCCT GTAAAGGTTT ACAATTTTAA GGTCGATAAT TTTCATACGT ATCATGTTGG CGAAAATAGG GTATTGGTTC ATAATGCGAA TAAGTATGTT AAGGGAACGA GTAAGACTGA GATAATTGGT AAGCCACATG CTTCAGCTCA ACATCAAGCT TTTACTATGG ACGAAGTAAA TAAGTTATCT TCAACAGGTG AATTCTCAAA AATATACATA AATAAGTCTT TAAAAACTGC AGGTTTTAAT GGAACACAAA AACCTGATAT AATTGCAGTA GGGAAAAACG GAAATGGACA TATTGTTGAG ATTGCCGGTC CTAGTCAGTT ATCAGGTAAA CCTAAGTATG CTCTAAAGAA CAAATTCAAT ACTATGTTAC AAAATAACCC TGGAATGACT GGTGATTTGA TATTCCCTGA ATATTAA
|
Protein sequence | MATITLYAGK INQMPGLIKE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS QTQDRKIDSL EKFCSESEKF ISEVVRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK IKDGLKSVAE WCKENWKSIA KIVVAAVVIA GLGIAAALTG GILGVVLAGA FWGALAGGLI GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGALAGK SIQCMSTVGK AINVTSKVTA ALSFGMDGFD MLAMGVSLFE PSNALVEFNR KLHSNAFYNG FQIAVNALAV FTAGAASTMK CFVAGTLILT ATGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT ELLNLTINGE VIKTTFEHPF YVKDVGFVEA GKLQVGDKLV DSRGNVLVLE GKKLEITDKP VKVYNFKVDN FHTYHVGENR VLVHNANKYV KGTSKTEIIG KPHASAQHQA FTMDEVNKLS STGEFSKIYI NKSLKTAGFN GTQKPDIIAV GKNGNGHIVE IAGPSQLSGK PKYALKNKFN TMLQNNPGMT GDLIFPEY
|
| |