Gene Cthe_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0391 
Symbol 
ID4808468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp486278 
End bp487777 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content40% 
IMG OID640105805 
ProductABC transporter related protein 
Protein accessionYP_001036822 
Protein GI125972912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.302991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATG CTTTACTTCT GGAAGTTAAA GGGCTTTCAA AAGTATTCCC TGGAACACGT 
GCACTAAATC GTGTCAGCCT TGAAGTAAGA AAAGGTGAAG TACATGGTTT GTGCGGAGAA
AACGGTGCAG GGAAAAGCAC TTTAATGAAC ATTGTAGGTG GAGTTTTTCC TGCAACTGAA
GGAATTATGA AATTCGACGG CAAAATTTTC AATCCCAAAA ACCCTAAGGA CTCTCAGGAT
GCAGGAATAG GTTTTGTTCA TCAAGAGCTT TGTTTGTGTC CGCATCTTTC GGCAGCTGAA
AACATTTTTA TTGGAAGGCT GCCAAGCAAA GGGGATAAAA TTGACTTTAA AAAGCTTTAT
GAAGATGCCG ATGCGATACT TAAAACTTTA GATGCCAATT TCAGCAGCAA AACTCTTGTT
AGTAATTTAA CGGTCAGTGA ACAGCAACTT GTGGAAATAG CAAAATCTAT TTCACTTAAT
TGTAAGCTCC TAATTTTAGA TGAGCCTACC AGTTCCCTGA CTGACAAAGA GACAGCAAGA
CTGTTCCAAG TGGTCAGAGA CCTCAAGAAG AAAGGTATAT CCATACTGTT TATATCTCAC
CGTATGAAAG AAGTGTTCGA AATATGTGAC CGGGTTACAG TTCTAAAAGA TGGTTCGTAT
GTTTGCTGTA TGAATATTTC GGAAATTACA CATGAAGATG TAATAAGAGC GATGGTAGGG
CGTGACCTTG GCGAGCTTTA CCCGCCTAAA TCTTCAAAAA TTGAGGAAGA CAACTATATT
TTAAAGGTTG AAAATTTAAG CGGAAATGGA TTTGAAAATG TAAGCTTTAC ATTGAAAAAA
GGAGAAATCT TAGGTTTCGC AGGCTTGGTA GGAGCAGGTC GCAGTGAGGT CATGAGAGGA
TTGTGTGCAA TAGATCCCGT TAAACGTGGA GAGGTTTATC TAAATGGTGA AAAACAAAAG
TTTAAACGGT ACAAGGACGC AGTAAAAAAA GGAATTTGCT ATCTTACAGA AGACAGAAAA
AATTCGGGTT TGTTTTTGCA CATGAGCATA GCCCAAAATA TCAGCAGTGC AAACCTTGAA
GCTGTTTCAA AACGTGGTTG GATTATAAAG AAACGAGAGT ACAGCTTGTC TGAAAAATAT
GTGAAGGCAT TGTCAATAAA GATTCCGGGT TTGTCATATC CTATCAGTAA TTTGTCCGGC
GGCAACCAAC AGAAATGCCT GATCGGAAAA TGGCTTTCAA CAAATCCCAA AGTCATAATT
ATGGATGAAC CTACACGAGG CATTGACGTC GGGGCAAAAC GTGAAATCCA TAATCTGTTG
CGTTCATTAA GTGAGCAAGG CGTTGGTGTG ATAATTGTCT CCAGTGAACT GCCTGAAATA
ATCGGTGTTG CTGACAGAAT TGCGGTGATG CATGAAGGGA GGCTGGCAGG ATTTTTACAG
GGAGAACAAG TTTCTGAGGA AAACATAATG AAATTGGCTT CCGGAGCAAA TTTATCATAG
 
Protein sequence
MDNALLLEVK GLSKVFPGTR ALNRVSLEVR KGEVHGLCGE NGAGKSTLMN IVGGVFPATE 
GIMKFDGKIF NPKNPKDSQD AGIGFVHQEL CLCPHLSAAE NIFIGRLPSK GDKIDFKKLY
EDADAILKTL DANFSSKTLV SNLTVSEQQL VEIAKSISLN CKLLILDEPT SSLTDKETAR
LFQVVRDLKK KGISILFISH RMKEVFEICD RVTVLKDGSY VCCMNISEIT HEDVIRAMVG
RDLGELYPPK SSKIEEDNYI LKVENLSGNG FENVSFTLKK GEILGFAGLV GAGRSEVMRG
LCAIDPVKRG EVYLNGEKQK FKRYKDAVKK GICYLTEDRK NSGLFLHMSI AQNISSANLE
AVSKRGWIIK KREYSLSEKY VKALSIKIPG LSYPISNLSG GNQQKCLIGK WLSTNPKVII
MDEPTRGIDV GAKREIHNLL RSLSEQGVGV IIVSSELPEI IGVADRIAVM HEGRLAGFLQ
GEQVSEENIM KLASGANLS