Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2446 |
Symbol | |
ID | 4809825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2916929 |
End bp | 2918047 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107860 |
Product | ABC-type sugar transport system periplasmic component-like protein |
Protein accession | YP_001038841 |
Protein GI | 125974931 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0237495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAGT TAAAAAAAGT AATAGCTCTT CTTGTTACAA CAATGCTTGT TTTATCCGTG TTAGTTGGAT GTGGAAACAA TACAACCAAC AATAACAGCG GCACAAGCGG ACCAAGTACA AACAGCGGAA CAAGCGGATC GGGCACAAAC AGTGGTACTA CGTCGAACAA TGTAACCGGA AAAACCGATT TGGCTGACAC CAATTTTGAC ACAAGCTATA CGCCAAAGAG AACATCATAC AAGATTTACT GCACATACAA GAATATTCAT GCCTGGTATG ATGCTATCAA ATGCGGTATT GATGCTGCTG TAAAAGAGCT GGCAGAAAAA GGCGTTACAG TAGATTATGA ATGGTATGGA CCTGCCCAGC CGGATGCCGT TGACCAGGTA AATTCCATTG AGACTGCAAT CGGACAAGGT TGGGACCTTA TCGCTGTCGA CGTTAACCAG CCCGAATTGA CAGGAGAAGC AATCAACAAT GCCGTCGCAA AAGGTATTCC TGTTGCTGTA TTCGGAACTT CAGACGTACC GAACTGTGAC CGTGCATTCT TTGTAGGAAA TACTGACCCG TATGGTGATG GCTGTGCCCT TGCAAAAGCA GTTTGTGAAA AGATGGGTGG CAAAGGTCAG ATTGCAATTC TGGCAGGTAC TATAGGAGCT TTGGCTCACG AAGAAAGATT GCGTGGATTT AAGGATACAA TTGCAAAATA TCCTGATATA GAAATCGTTG ACGAGCAGCG CGACAACGAC GAAGTTGAAA AGGCAATCAG TATTACAGAA TCCTGGCTCC AGGCTTATCC TAACCTGGGA GGTATTCTCT GCAACAACAT GTCCAACCCG GTTGGTGCAT GCCAGGCTGT AGCAGATGCC GGTAAATCAG GCAAGATCGT TATCGGCGGT ATGGACCATG ACCTTCGTGC TTTGAATGCT CTGAAAGATG GTACTTTGTA TGTGGCACAA GTTCAGAACT GCTATGACAT GGGTTACAAA CTTATCTACA ATGCAATAAA GACGATTGAC GGTGAAAAAG TTGAAGAGTC AACAGCAGTA GGTTCCACTT CAGTGTATGC ACAAGACGCA GATAAATTCA TCAATATGTT ATATGGAGAG GCAAATTAA
|
Protein sequence | MLKLKKVIAL LVTTMLVLSV LVGCGNNTTN NNSGTSGPST NSGTSGSGTN SGTTSNNVTG KTDLADTNFD TSYTPKRTSY KIYCTYKNIH AWYDAIKCGI DAAVKELAEK GVTVDYEWYG PAQPDAVDQV NSIETAIGQG WDLIAVDVNQ PELTGEAINN AVAKGIPVAV FGTSDVPNCD RAFFVGNTDP YGDGCALAKA VCEKMGGKGQ IAILAGTIGA LAHEERLRGF KDTIAKYPDI EIVDEQRDND EVEKAISITE SWLQAYPNLG GILCNNMSNP VGACQAVADA GKSGKIVIGG MDHDLRALNA LKDGTLYVAQ VQNCYDMGYK LIYNAIKTID GEKVEESTAV GSTSVYAQDA DKFINMLYGE AN
|
| |