Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3211 |
Symbol | |
ID | 4809513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3803185 |
End bp | 3804786 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640108645 |
Product | hypothetical protein |
Protein accession | YP_001039599 |
Protein GI | 125975689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAAAT GTATTCAGTT TGACATTCAG GCACTGGAGC CTTTGAAACT TGGCAAATTT GAAAGGGATT CAAATAATGA GTACAGCTAT TCATATATTC CGGGTTCTGT AGTAAAAGGA GCCGTTGCTT GGGGATTGGT TGAAGAAAAA GGCTTTGTTC CGAAGGAGAT ATTGAACGGC AGTTCAATAT TTTACAATGC TTATCCTTTG CTGGATGGTA ATCCTGCAAT ACCAATGATG CAGGGGTATA TGGGTGATAA GCAGGAGATA CGTTCAAATA AAGATAATGT CCGTGTGGTA CACTCATTTA ACAGAAAAAT AGAAAACAGC ATTCCTTTAA ACAATTATGA ATTTGTGGTT AAAGATGCGG AAAATGAAAA GCTCCTGCTG GGATATAATC CGGGAAAGAT CGAAAATCTT CATATCAATA AAAAAGATGC GTTAAATGAC AGTAGTAAAT TAAAAATGTT CAGATATGAG GCAATAAGAA AAGGTGAATG CTTTAGGGCC TATATTCGAG TTCCTGAGGA GCATTATGAT GACATAGTAA AGATATTGAG TAAAGATTAT ATTTATTTCG GCGGTTCAAG GGGAAGTGGC TATGGAAAGT GCAGAGTAAC CGGTATAAAA GAAGTTTCTG CTGTTTGTTT GTATGAATCG GATTTGGATA TTAAAGAGGA TTTATATATT TACTTTTTAT CCGATGCAAT ACTGTATTAC GACGGTAAAG TTAATACATA CATACCGGAA AATGAATTAA AAGAAATGCT TGGAATTGAA GGTAAATGTG AATATGTAGA GTCCTATTCC GGTCTTGGAG TTGCGGCGTC TTACAATACT TTATACAGAA CAAATACAGT TTGCTACACT GCTGTATTGA AGGGAAGTGT AATAAAATAC AGAGTGGAAG GAAAGATTAA CCCGGATAGA ATAAAAAAAT TGGCATCTGA AGGTGTTGGG CTAAGAAGAG AGGACGGATA TGGCCAAATT GCCGTATTGG GCAAAATTCA AGATGATATG GTCGTTTCAA GATATAGGAA AAATGATATT GTTGAAAAAT CAGATATTGA ACTTACAGAT GAGGACAAAC GTGTTGTAAA TATGATTTTA TGTAATATTT TTAAGACCCG GTCAAAACTT CAAATTGAAA AAATGGTAAT AGAACTGCTA AAAGATGTAA ACAGACCGCA AGAGAGTCTT CAAAGTCAAA TAGGAAAGCT TCTTAATATT TTTCAGAATG GTTTCTACCA GACGGAAGAT GACTTTAGAA AATATTTGAA TGAATATCTT GAACATATAA GAAAGAAAAA GGGAAAGGCC GTATGGCATA AATTGTATGA CTTTACTGTT GCAAACACTT TGGACAAGTT CAGATCAGAA AGATTAAGTA TTCAAAAAAT GCTTGAGGAT TTTGTACAGA ATAAATTCAA TTCTGTTTTC CGTGAAATTG AGAATATTGT GAATAAAGGC ATTACTTTGG GTATTTACAA ATATCCTGTT GAAAGTCAAA AAGCTCAAGT TCTACATAAT CTTAAAATGG ACTTCTTTAT CAGCTTTTTT GAATACTGTC TGAGGATGAA AGAAAAAGAG GTGATATCAT GA
|
Protein sequence | MHKCIQFDIQ ALEPLKLGKF ERDSNNEYSY SYIPGSVVKG AVAWGLVEEK GFVPKEILNG SSIFYNAYPL LDGNPAIPMM QGYMGDKQEI RSNKDNVRVV HSFNRKIENS IPLNNYEFVV KDAENEKLLL GYNPGKIENL HINKKDALND SSKLKMFRYE AIRKGECFRA YIRVPEEHYD DIVKILSKDY IYFGGSRGSG YGKCRVTGIK EVSAVCLYES DLDIKEDLYI YFLSDAILYY DGKVNTYIPE NELKEMLGIE GKCEYVESYS GLGVAASYNT LYRTNTVCYT AVLKGSVIKY RVEGKINPDR IKKLASEGVG LRREDGYGQI AVLGKIQDDM VVSRYRKNDI VEKSDIELTD EDKRVVNMIL CNIFKTRSKL QIEKMVIELL KDVNRPQESL QSQIGKLLNI FQNGFYQTED DFRKYLNEYL EHIRKKKGKA VWHKLYDFTV ANTLDKFRSE RLSIQKMLED FVQNKFNSVF REIENIVNKG ITLGIYKYPV ESQKAQVLHN LKMDFFISFF EYCLRMKEKE VIS
|
| |