Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2825 |
Symbol | |
ID | 4809662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3340946 |
End bp | 3342793 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108245 |
Product | hypothetical protein |
Protein accession | YP_001039217 |
Protein GI | 125975307 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000521887 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTA TAACGTTATA TGCCGGAAAA ATCAACCAAA TGCCCGGATT GATAAATGAA GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTT ATATCGGAAG TAATACGTAT CGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCC AAGATAGTGG CTGCCGCAGT AGTTATTACC GGGTTAGGGA TAGCGGCGGC ATTGACAGGA GGGGTATTGG GAGTCATACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA GGAGGAGCGG TTGGAGGAAT AGCCGCGGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT GCGGACGGGG CATTAAGCGG AGCGATTTCC GGAGCGGTAA CGGGAGCCGC ATGTGCCGGG CTGGGTGCTT TGGGAGCAGC GGCAGGAAAA GGAATCCAAT GTATGAGCAC AGTGGGAAAA GCGATAAATG TTACATCAAA GGTTACGGCA GCACTCTCGT TTGGTATGGA TGGATTTGAC ATGCTGGCAA TGGGAGTATC ATTGTTTGAT CCATCCAACG CATTGGTTGA ATTTAACCGG AAGCTGCATT CCAATGCACT TTATAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT TTCAGTGCCG GGGCGGCATC GACAATGAAG TGCTTTGTTG CAGGTACAAT GATATTGACA GCGGCAGGTT TGGTTGCGAT AGAGAATATC AAGGCAGGAG ACAAGGTAAT TGCGACGAAT CCGGAGACTT TTGAAGTAGC GGAAAAGACG GTGCTTGAGA CATATGTGAG AGAGACAACG GAGCTTTTGC ATTTGACAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT TATGTAAAAG ATGTGGGTTT TGTTGAAGCG GGAAAACTGC AAGTAGGAGA TAAGTTGGTT GATTCAAAAG GCAATGTTTT GGTGGTGGAA GAGAAAAAGC TTGAGATAAC AGATGAACCT GTTAAGGTTT ATAACTTCAA AGTGGATGAT TTTCATACTT ATCATGTTGG GAAAAAAGGG ATATTGGTAC ATAATGCAGA CTATAACCCC AAAATGGGAT TTGATGATTT GGACCTTGAG AAAGCTACGA ACAAACAAAA AGGCAATTAT GGAGAGTATC TGGCAGATGA TAATCTTATT AATAATCCAA AATTGAAAGA AGCAGGGTAT GATTTGGAGC GGATAGGAGG TAAGGTTCCG ACCTCACCGG ATGATAAAAT TACAAAAGGG ATAGATGGGA TATATATAAA TAAGAATCCT GACTCAAATG TTAAATATGT AATTGATGAG GCGAAATTTG GAAAAGCGGG ACTTAGTACA AAGACAAGAG ATGGAAAACA AATGTCGGAT TCTTGGCTGA TAGGTGATAA AACAGGTAAT GATAGAATTT TAGAAGCAGT GAATAATGAT AAACAATTAG CAGCTGGTAT ACTCGATGCA TTACAAAACA ACCAAGTAGA AAGAGTGTTG TCAAAAGTGG ATGCAAACGG AAATGTAACG ACATATAGAC TGGATAGTGA TGGTAATATA ATTGGAGTTT GGCCATAA
|
Protein sequence | MATITLYAGK INQMPGLINE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS QTQDRKIDSL EKFCSESEKF ISEVIRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK IKDGLKSVAE WCKENWKSIA KIVAAAVVIT GLGIAAALTG GVLGVILAGA FWGALAGGLI GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGAAAGK GIQCMSTVGK AINVTSKVTA ALSFGMDGFD MLAMGVSLFD PSNALVEFNR KLHSNALYNG FQIAVNALAV FSAGAASTMK CFVAGTMILT AAGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT ELLHLTINGE VIKTTFEHPF YVKDVGFVEA GKLQVGDKLV DSKGNVLVVE EKKLEITDEP VKVYNFKVDD FHTYHVGKKG ILVHNADYNP KMGFDDLDLE KATNKQKGNY GEYLADDNLI NNPKLKEAGY DLERIGGKVP TSPDDKITKG IDGIYINKNP DSNVKYVIDE AKFGKAGLST KTRDGKQMSD SWLIGDKTGN DRILEAVNND KQLAAGILDA LQNNQVERVL SKVDANGNVT TYRLDSDGNI IGVWP
|
| |