Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2610 |
Symbol | |
ID | 4809032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3080278 |
End bp | 3082032 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108024 |
Product | hypothetical protein |
Protein accession | YP_001039003 |
Protein GI | 125975093 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000187127 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGGTG CAAGGCTTTT TGACATTAAA ACTTTCAAAC AAATAACGGA ACACAGTATA TATATGGACG GTACAATTAA AGATAAGATA AAAAATAATT TGGAATCACT TCTTAACGTA CGGATTATAG AAACAGACTT TGACTCCGGG GAAAATGAAA AAGTGGATGT TATCGCCTTA GGCAGTGATA ACAATCCTTT GATTGTGTTG TTTAAGACAG CACAAAACGA AAGTGTGGCG GCAAGAGCCG TGTTTTATCT TGACTGGCTT GTAAACAACA AAAATTTGTT TAACAAGCTC GTCCAAAAAA CTTTCCCAGG TATGAATGTT GACGGCATAG ACTGGCAAAA GGCAGGTGTC TGCTGTGTGG GAAGTGACTT TTCCAAGTAT GACCGCTACA TGCTGTCGCA TGTCGGGCGC AATATTGAAT TGATTCGATG TAAGAAATAT GACAGGGATC TTATGGTTAT TGAAACAGTT TACAAACCAA CAGGCATAAG TTTTGGAAAG CCTGTGTCTT TATCTTTGAC AGCCACTTCA AATGCTGAAC AAAACAACAA AACAAAAAAA TATATAAAGG TAAGTCAGAC TCCAAGAATT GGTTATTGCC AGATAGAAGA CGGAAAACAT TATGTGGTGT TCCCCGGAAA GGAAAAGGGC GAGATACTTG ACTTGCCTGT TTCCATATAT CTTGCCCAAA ACCAGTTTGT TCTGGTTGAC GAATACAACC GGTTCCAGTA TGCTTTCTCC TATTGGCTGA ATGACAATGA AATCTTAAGT TCAAATATTG CAAGTTTTGC CGTGGTTGTG CTGAAAGATT CGGAGATTTT TATCGACAAA GGCGACAATG TATTGCTTAA GCTCAACAAT ATTCCCCCAA ATGTTCAGCT TAGGGATAAA AGCATTGTGT CTGTGGACAG TAACAACAAC TTTTTAAGAT TCTACAAGCC GGTAAAACAC AACGCCGACA GCTTTATGAT GTCGGCAAAA GCCAAGGGGC ATACTCTGGC CTTTGTGCTC AAAATTCTGG ATAACGGGGT TTTGCTTCGG GATATTGAAA CTGGCAGGGA ATTTTTTAAA GAAATGGATA CAGACGGCAT AACTTTTAAG GAGCAGCAAA TCCTGTGTCT TCATGAAGGA AACGTGGTAC ATACTCTGAC GTCGTGCAAA TTTTACACTT TGTCTTCGTA CTATGACAAG TTTGAATACG GCACAGTGGA AATAAAGGAC GGACTCGCTT TTCTTAAAAA ACTGTCCGGT GAGATTGTGA TAATAAATGA CGCGCCGGAC TACTTAAAGC CCGGTCAGGT GGCCTATGTG GATGAGAATA ACAATTTCTG CGGCATTGAA GATGACGGGG AAGCTCAGGA GACTGACACC GTAAAGAGAA ATGTTTCCAA TATCAGCACT TTTAAGAGAG TAAGCAAGAA GGAAAGAATC GAGGTAACCA AGCAGGTTTT GATTCTCGGA AATAAAGCTT ATGAAAATTC CTACAAATTG TGCCTTTTAA AATTCGGGTA CAAGGCAGAA GTGCTGGAAG GATTCGAACC ATGGGCAAAA ATCAGTAATG TGCTTAGGGA CACGGATGTG GTAGTGGTGG TAACTTCGCA TATATCCCAT GACAACATGT GGAGAGTAAA AAAGGAAATA ACGGATATTC CTGTTATCTA TTCAGAATTT GACGGAGCAA ACAGAATATT GGAGAAGGTG ATTGCAGCGG AGAACAACTG GAAAGAAGTG CGTACGGCAA GGTAA
|
Protein sequence | MEGARLFDIK TFKQITEHSI YMDGTIKDKI KNNLESLLNV RIIETDFDSG ENEKVDVIAL GSDNNPLIVL FKTAQNESVA ARAVFYLDWL VNNKNLFNKL VQKTFPGMNV DGIDWQKAGV CCVGSDFSKY DRYMLSHVGR NIELIRCKKY DRDLMVIETV YKPTGISFGK PVSLSLTATS NAEQNNKTKK YIKVSQTPRI GYCQIEDGKH YVVFPGKEKG EILDLPVSIY LAQNQFVLVD EYNRFQYAFS YWLNDNEILS SNIASFAVVV LKDSEIFIDK GDNVLLKLNN IPPNVQLRDK SIVSVDSNNN FLRFYKPVKH NADSFMMSAK AKGHTLAFVL KILDNGVLLR DIETGREFFK EMDTDGITFK EQQILCLHEG NVVHTLTSCK FYTLSSYYDK FEYGTVEIKD GLAFLKKLSG EIVIINDAPD YLKPGQVAYV DENNNFCGIE DDGEAQETDT VKRNVSNIST FKRVSKKERI EVTKQVLILG NKAYENSYKL CLLKFGYKAE VLEGFEPWAK ISNVLRDTDV VVVVTSHISH DNMWRVKKEI TDIPVIYSEF DGANRILEKV IAAENNWKEV RTAR
|
| |