Gene Cthe_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2447 
Symbol 
ID4809826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2918049 
End bp2919536 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content40% 
IMG OID640107861 
ProductABC transporter related protein 
Protein accessionYP_001038842 
Protein GI125974932 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000548266 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGAAT ATGTTGTTGA GATGAAAGGT ATCTCGAAAT CCTTTCCCGG CACAAAGGCG 
CTGGACGATG TTGCATTGCA GCTGAAGAAG GGTGAAATTC ACGCCCTTGT CGGTGAAAAC
GGAGCGGGAA AAAGCACTTT GATGAATATT CTGACTGGAC AGATTTCAAT GGATATCGGA
GAAATCTTTA TGGAGGGAAA ACCGGTTCGG TTTTCCTCTC CTAAAGATGC TTTAAAAAAG
AGCATTGTCC TGGTGCCTCA GGAATTGAAT CTGGTGCCGG AACTGAGCAT AGCGGAAAAT
ATCTTTTTGG GCAATGAAAT ATTAAAAATT AGGTTAATTG ACTGGAAAAG TACATGTAAA
GAAGCGGAGA AGCTTCTGGA ATTGTTGGGT GTGCATGTGG ATGTGACCCA ACCTGTTAAA
AAGCTGTCGG CGGCTTATCA GCAGCTGGTC TCTATTGCCA GGGCATTGGC TTATTCCCCA
AAATTGTTGA TTTTGGATGA GCCGACGGCG GTATTGACTA AGAATGAAAA AGAGAATTTG
TTCAAATCCA TGAGAAAACT AAAAGAAAAT GGGACAACCA TGGTGTTTAT CAGCCATCAT
CTCGATGAAG TAATGGAGCT TACCGACCGT GTCACCATCA TGCGTGACGG TCATGTAGTC
AAGGTTGTAA ACACAAATGA AATTACAAAA GATGAAATGA TTAATTTGAT GGCAGGCAAA
AAAGTTGAAA AAACAAAACG GATAAAGCGT AAGGTTTCCG ATGAAATCTT TTTCGAAGTC
AGAAATCTTA CAAGAAAAGG TGAATTTGAA GATATCAGCT TTCATGTAAA GAAAGGCGAA
ATTTTGTGTG TGGCAGGCCT GGTTGGAGCA GGAAGAACCG AGATATTTAA ATGTGCCTTT
GGAATTACGG AAAAGGAACC CGGCGGAAAG ATTTTTATCG AAGGCAGGGA AGTAAACATA
AAATCTCCTA TTGACGCAAT CAAATATGGT ATTGGGTATG TTTCCGAAGA AAGAAGACAT
GACGGCATTA CACCCAATAT GTCGGTTATG GAAAATATGA TGTTGCCGTC GTATGGAGAG
TTAAAGAAAT ATGGTCTGAT TGATTATGAA AAGGCAGTTT CCATTACAAA TGACTACATT
CAATCTTTTA GAATCAAGAC ACCTTCCAGG GACACTCTGA TTAAGAATTT ATCCGGTGGA
AATCAGCAGA AAGTTATCGT AGCAAGATGG ATGGCCAAGG GAATTAAAAT GTTGATTTTG
GATGAACCTA CCAGGGGAAT TGATGTTAAT GCTAAAGGTG AAATCCATCA GCTTATAAGG
GAACTGGCTG ATAAAGGAGT GGCTGTTGTT GTAATCTCCT CGGAGATAGA AGAAGTATTG
GCATTGGCAG ACAGAATCAT GGTTATCCAA CGGGGTAAAA TTGGTGGATA TATTAACGAT
GTCGATATGA CAACACAGGA AGATGTGCTG AAGGTGGCAT TTCAATGA
 
Protein sequence
MYEYVVEMKG ISKSFPGTKA LDDVALQLKK GEIHALVGEN GAGKSTLMNI LTGQISMDIG 
EIFMEGKPVR FSSPKDALKK SIVLVPQELN LVPELSIAEN IFLGNEILKI RLIDWKSTCK
EAEKLLELLG VHVDVTQPVK KLSAAYQQLV SIARALAYSP KLLILDEPTA VLTKNEKENL
FKSMRKLKEN GTTMVFISHH LDEVMELTDR VTIMRDGHVV KVVNTNEITK DEMINLMAGK
KVEKTKRIKR KVSDEIFFEV RNLTRKGEFE DISFHVKKGE ILCVAGLVGA GRTEIFKCAF
GITEKEPGGK IFIEGREVNI KSPIDAIKYG IGYVSEERRH DGITPNMSVM ENMMLPSYGE
LKKYGLIDYE KAVSITNDYI QSFRIKTPSR DTLIKNLSGG NQQKVIVARW MAKGIKMLIL
DEPTRGIDVN AKGEIHQLIR ELADKGVAVV VISSEIEEVL ALADRIMVIQ RGKIGGYIND
VDMTTQEDVL KVAFQ