Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0254 |
Symbol | |
ID | 4808602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 311724 |
End bp | 313064 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105666 |
Product | hypothetical protein |
Protein accession | YP_001036686 |
Protein GI | 125972776 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAGG GTGTCTCAAA AACGGGTATT GTGTTTTTGT TACTGATTTG TGTGGGTTTT CTTCTTGCCA ACAACATACT GATACTTGTT TCCATTATTC CATTTACGTT AATGGCTTTT GGGTATTATT TAAAAATGCC CGACGGTATC AGGGTTGACA AGACTGTGTC CAAAAACAGG GTTACGGTCG GAGAACTGCT GGAGGTATCT GTAAGAGTAC TGGTAGAGTC GGGATTTGGT TCAATGGAAA TATGCGATAT TGTGCCCCCG CATTTTGAAC TGGTGGAAGG AACTAATTAC TGTGCAGTGT GGAAAGGGTT TGAGCCGAAA GAAATACTCT TAAATTATAC TGTCCGCTGT ACAGCATCGG GAACTTATAC ATTCAGGACC ACTGGCTGGA GAGCCAGACA TGCTGTGGGA GCTTTTTCGA TAAACAGAAA ATATGAGACG GATTTGACGG TAGAAGTGAC CCCAAGGCTT ATTGAACTCA AGAAGGTAAG GGGCATGTCC ACGGTGTGCA AAGTTCCGAT GCCGGAGGGA GCTTTGGCAA GCATGGGAAT GACAACCCAG GAATTTAAGG AACTCAGGCT CTATTCTCCC GGTGACCCGT TTAAGGCGAT AAACTGGAAG GTTACGTCGA GAAATTTGGT CAGGGGCAGT ATCTGGCCTG TGGTAAATGA GTTTGAAAAA GAAGGAAAGA AGTCTGTGTG GATATTTTTG GATACGTCAA AAATAATGTC CTTCGGTTCC AACATAAAAA ATGTCAAAGA GTATTCTGTT GAGGCTGTAA ACAGTCTTAG CGACTATTAT ATAAAACACA ACTGCAGTGT GGCTTTTCAT ACCTTTGGAG GAAGCGATGT CTTTATAAAT CCCGGGTCGG GAAGGCAGCA GCATTACAGG ATTTTAAGGG AGCTTATGAA AATAAGGAAT TTCACCGGAG TATCCCGGGA AAATTCCGGG GAGCGTCAAA AAAACTCCAA GGAGCACAAA AAACTGGAAG AGGCAGTGTA TTCGTGCAGA AATTATTTTA ACGGACTGAG ACCAATGTTT ATAATTATTA CAAGATTTTG CACAAAAAAT TCCGAAGAAA TTTTCAAAGG TATAAACCTT ATGTCAAAAT ACACTTCGCT TCGAAAGGGC TATGTTCCCA GCATAATGTT GATAAATATA ATGGGGTATG GTCTTATGGC TGAAAATGAG AATGAAATGA TGGCGGCAAA TCTTCTTGAG GCCATGAACA AAGTGCTTTC GGAAAAGATA AGAAAAAATT GTATCTGGAT TGACTGGGAC CCTAACAAGG AAAGCCTTAC AGGTGCATTA TTAAAACAGG TGGTGGGTTA A
|
Protein sequence | MPKGVSKTGI VFLLLICVGF LLANNILILV SIIPFTLMAF GYYLKMPDGI RVDKTVSKNR VTVGELLEVS VRVLVESGFG SMEICDIVPP HFELVEGTNY CAVWKGFEPK EILLNYTVRC TASGTYTFRT TGWRARHAVG AFSINRKYET DLTVEVTPRL IELKKVRGMS TVCKVPMPEG ALASMGMTTQ EFKELRLYSP GDPFKAINWK VTSRNLVRGS IWPVVNEFEK EGKKSVWIFL DTSKIMSFGS NIKNVKEYSV EAVNSLSDYY IKHNCSVAFH TFGGSDVFIN PGSGRQQHYR ILRELMKIRN FTGVSRENSG ERQKNSKEHK KLEEAVYSCR NYFNGLRPMF IIITRFCTKN SEEIFKGINL MSKYTSLRKG YVPSIMLINI MGYGLMAENE NEMMAANLLE AMNKVLSEKI RKNCIWIDWD PNKESLTGAL LKQVVG
|
| |