Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2447 |
Symbol | |
ID | 4809826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2918049 |
End bp | 2919536 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107861 |
Product | ABC transporter related protein |
Protein accession | YP_001038842 |
Protein GI | 125974932 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000548266 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGAAT ATGTTGTTGA GATGAAAGGT ATCTCGAAAT CCTTTCCCGG CACAAAGGCG CTGGACGATG TTGCATTGCA GCTGAAGAAG GGTGAAATTC ACGCCCTTGT CGGTGAAAAC GGAGCGGGAA AAAGCACTTT GATGAATATT CTGACTGGAC AGATTTCAAT GGATATCGGA GAAATCTTTA TGGAGGGAAA ACCGGTTCGG TTTTCCTCTC CTAAAGATGC TTTAAAAAAG AGCATTGTCC TGGTGCCTCA GGAATTGAAT CTGGTGCCGG AACTGAGCAT AGCGGAAAAT ATCTTTTTGG GCAATGAAAT ATTAAAAATT AGGTTAATTG ACTGGAAAAG TACATGTAAA GAAGCGGAGA AGCTTCTGGA ATTGTTGGGT GTGCATGTGG ATGTGACCCA ACCTGTTAAA AAGCTGTCGG CGGCTTATCA GCAGCTGGTC TCTATTGCCA GGGCATTGGC TTATTCCCCA AAATTGTTGA TTTTGGATGA GCCGACGGCG GTATTGACTA AGAATGAAAA AGAGAATTTG TTCAAATCCA TGAGAAAACT AAAAGAAAAT GGGACAACCA TGGTGTTTAT CAGCCATCAT CTCGATGAAG TAATGGAGCT TACCGACCGT GTCACCATCA TGCGTGACGG TCATGTAGTC AAGGTTGTAA ACACAAATGA AATTACAAAA GATGAAATGA TTAATTTGAT GGCAGGCAAA AAAGTTGAAA AAACAAAACG GATAAAGCGT AAGGTTTCCG ATGAAATCTT TTTCGAAGTC AGAAATCTTA CAAGAAAAGG TGAATTTGAA GATATCAGCT TTCATGTAAA GAAAGGCGAA ATTTTGTGTG TGGCAGGCCT GGTTGGAGCA GGAAGAACCG AGATATTTAA ATGTGCCTTT GGAATTACGG AAAAGGAACC CGGCGGAAAG ATTTTTATCG AAGGCAGGGA AGTAAACATA AAATCTCCTA TTGACGCAAT CAAATATGGT ATTGGGTATG TTTCCGAAGA AAGAAGACAT GACGGCATTA CACCCAATAT GTCGGTTATG GAAAATATGA TGTTGCCGTC GTATGGAGAG TTAAAGAAAT ATGGTCTGAT TGATTATGAA AAGGCAGTTT CCATTACAAA TGACTACATT CAATCTTTTA GAATCAAGAC ACCTTCCAGG GACACTCTGA TTAAGAATTT ATCCGGTGGA AATCAGCAGA AAGTTATCGT AGCAAGATGG ATGGCCAAGG GAATTAAAAT GTTGATTTTG GATGAACCTA CCAGGGGAAT TGATGTTAAT GCTAAAGGTG AAATCCATCA GCTTATAAGG GAACTGGCTG ATAAAGGAGT GGCTGTTGTT GTAATCTCCT CGGAGATAGA AGAAGTATTG GCATTGGCAG ACAGAATCAT GGTTATCCAA CGGGGTAAAA TTGGTGGATA TATTAACGAT GTCGATATGA CAACACAGGA AGATGTGCTG AAGGTGGCAT TTCAATGA
|
Protein sequence | MYEYVVEMKG ISKSFPGTKA LDDVALQLKK GEIHALVGEN GAGKSTLMNI LTGQISMDIG EIFMEGKPVR FSSPKDALKK SIVLVPQELN LVPELSIAEN IFLGNEILKI RLIDWKSTCK EAEKLLELLG VHVDVTQPVK KLSAAYQQLV SIARALAYSP KLLILDEPTA VLTKNEKENL FKSMRKLKEN GTTMVFISHH LDEVMELTDR VTIMRDGHVV KVVNTNEITK DEMINLMAGK KVEKTKRIKR KVSDEIFFEV RNLTRKGEFE DISFHVKKGE ILCVAGLVGA GRTEIFKCAF GITEKEPGGK IFIEGREVNI KSPIDAIKYG IGYVSEERRH DGITPNMSVM ENMMLPSYGE LKKYGLIDYE KAVSITNDYI QSFRIKTPSR DTLIKNLSGG NQQKVIVARW MAKGIKMLIL DEPTRGIDVN AKGEIHQLIR ELADKGVAVV VISSEIEEVL ALADRIMVIQ RGKIGGYIND VDMTTQEDVL KVAFQ
|
| |