Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2233 |
Symbol | |
ID | 4809971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2660402 |
End bp | 2661814 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107639 |
Product | hypothetical protein |
Protein accession | YP_001038628 |
Protein GI | 125974718 |
COG category | [S] Function unknown |
COG ID | [COG2604] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATGC TGAAGAATAA TCTGGAGTTG CTTAAAAAAA GATATCCGGA AATATATAAT GAAATAAAAG ACATTCATGT GGATTCCAAT GCGTATCAGA TTATACAAAA CAATGAAGGA CAAAAGACTT TAAAGGTGAC TTTGGACCTG TATGGCGAGA ATAAAAGCTT CTTTTTGCAC AGTAAATATC ATCCGGATGT TGAAGCAGAC AGATTTGCGC GGGAACAATA CAAACCGGAA TTTTCCATAC AAATTCTCTA CGGATTTGGA TTGGGTTATC ATGTTGAAAA AATTGCCGGA CTTTTAAAAC CGGACAGCAT GCTGTATGTT ATAGAAAATA ATTTGATGGT TTTCAGGTCG GCATTGGAAA ACATGGACTT ATGCCCTATT TTGGAAAATC GGAATGTTTC CCTGATTGTT TCAAAGGATG TTGGCTATAT ATCGGAAAAA GTTAAGGAAT TAATTGATAA CAACCTTGAC AAGGTCAGTT TTATAACACA TCCCGCTTCA TTGAAAGCAA TTCCTGAGGA AAACGAATAT TTCAAATTTG TAATGGAGAA CTGGAATCTT AAGAAAAGTA TTACCGATGA TTATGACAAT ACTTTGCGCA ATAATGCAAA AGAAAACTTA AAGTTAAACA GTCCGAATGT AGGCATCTTT TTTGACAAGT TCAAAGATGT TCCGATAATC ATTGTGTCTA CAGGCCCTTC CCTTGATAAA AACATAGATT TGCTTAAAGA GGCCAAGGGA AGGGCATTGA TTATTTCAGC CGGTTCTGCC TTAAGACCTC TGCTTATGAG GAATATAAAG CCGGATTTCT TTGCCATTAT TGACCCGCAG GATATAACCT ACAACCAGAT AAAGGGATAT GAAAATATCG GTATTCCTTT TATTTATCTG GTTACTGCCG CTTCCTATAC CGTTTCACGT TACCTGGGGC CGAAACTGGT GGCTTATTAC GGAAAGTACA ATAATAGTTC GGAACATTTG GTGGATTCGG GAGGCTCAGT TGCGACCACT ATACTGGACA TAGCCATTAA AATGGGAGGA AATCCTATCA TATTAGTGGG ACAGGACCTG GCGTATGTCG ACGGAAAAAA TCATGCCCAA TATGGGAGCC ATGCCAGCAT TTACTCACCT GAGCTTAAAA ACATGAGAAG GGTAAAAGGG CAAAACGGAG AGATGCTGTA TACATCCCTG GGACTGCTAA GTTATAAATA CTGGATTGAA AACAGGATAC AAAAAGAGAA AAGAATATTC ATAAATGCCA CCGAAGGCGG AGCTTATATC GAAGGAATGA AACATATCAA GTTAAGGGAC GTAATTTCCG ATTATCTAAA AGAAAGTTTC GATTTTGAAA ATAAAATAAA ATCCATACTG AAAGAGAGCG GGATCCAACA TGTTCAAGGA TAA
|
Protein sequence | MTMLKNNLEL LKKRYPEIYN EIKDIHVDSN AYQIIQNNEG QKTLKVTLDL YGENKSFFLH SKYHPDVEAD RFAREQYKPE FSIQILYGFG LGYHVEKIAG LLKPDSMLYV IENNLMVFRS ALENMDLCPI LENRNVSLIV SKDVGYISEK VKELIDNNLD KVSFITHPAS LKAIPEENEY FKFVMENWNL KKSITDDYDN TLRNNAKENL KLNSPNVGIF FDKFKDVPII IVSTGPSLDK NIDLLKEAKG RALIISAGSA LRPLLMRNIK PDFFAIIDPQ DITYNQIKGY ENIGIPFIYL VTAASYTVSR YLGPKLVAYY GKYNNSSEHL VDSGGSVATT ILDIAIKMGG NPIILVGQDL AYVDGKNHAQ YGSHASIYSP ELKNMRRVKG QNGEMLYTSL GLLSYKYWIE NRIQKEKRIF INATEGGAYI EGMKHIKLRD VISDYLKESF DFENKIKSIL KESGIQHVQG
|
| |