Gene Ccel_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1750 
Symbol 
ID7312276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2096302 
End bp2097624 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content39% 
IMG OID643608679 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002506081 
Protein GI220929172 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGT ACGAAATAAT TGAGAAAAAA CGAGATGGCT TCGAACTTAG CTTGGATGAA 
ATAAATTTCT TTATTACGGA ATATTGTAAT AATAGTATTC CTGACTATCA GGCGGCTGCA
CTTTTAATGG CTATTTTCCT TAGGGGTATG AATGAAAGGG AAACGGCTGA TTTAACAAAC
GTAATGGCAA ATTCAGGTGA CAGGATAGAT CTTTCTTCAA TTGCGGGTAT AAAAGTTGAT
AAACATAGTA CAGGCGGTGT AGGTGACAAG ACCACGCTGA TTCTCGCACC TATTGTAGCT
GCTTGCGGAA TACCTGTTGC CAAAATGTCA GGAAGAGGAC TTGGGCACAC AGGCGGAACA
ATAGACAAGC TTGAGTCAAT ACCCGGATTT AATACAAGCC TTTCAACAGA ACAGTTTATA
GATAATGTAA AAAGCATAGG CATATCAATA GCCGGACAAA CAGGTAATCT GGCTCCTGCT
GATAAAAAAC TATATGCTTT GAGAGATGTT ACTGCAACTG TAAATAATAC ATCACTTATT
GCAAGCAGTA TTATGAGTAA GAAGCTTGCT TCGGGTGCCG ACAGGATAGT ACTGGACGTA
AAAACCGGAA GCGGAGCATT TATGAAAACT TTTGAGGATT CTGTGGAACT TGCCAAAACA
ATGGTTAAAA TAGGTAAAAA CACAGGAAGA AAAACCATTG CGGTAGTTAC CGACATGGAC
ATACCACTTG GTTTTGCAGT AGGAAATTCA CTCGAAATAA TTGAAGTCAT TGATACTCTG
AAAAGTAAAG GGCCAGAGGA TTTGAAAGTT GTTTCATTTG AACTGGCTGC CAGAATGCTT
GAGCTTAGCG GGATAGGTAA TATTGATGAG TGCAGGGCCA GAGTTTCCAA CGCAGTTGAA
AGCGGTAAAG CACTATCTAA ATTTGCCGAG CTGATAGAAA ATCAGGGAGG AAATAAAGAC
GTTATAAATG ACAGTACTTT GTTCCCACAG CCTGCTTATA AGCTGGACTT TATATGTGAA
AAGCCTGGTT ATATACAATT TATGAAAACC GACCAAATAG GTATAGCTTC CCTGGTGCTT
GGTGCAGGAA GAGAAACAAA GGAAAGTAAA ATCGACTATA GTGCTGGTAT TATGTTTAAT
AAAAAAACCG GAGACAGGGT CGAAGCCGGA GAGAGTGTTG CAGTGTTGTA TACAAATAGG
GAAGAGACGC TTTTACATGC TGTTTCCCTT TTAAAAGAAG CGGTTGTAAT AAGTGAACTG
CCTCATGAAA GAAAACCGCT GATTCTGGCG TATATCGACA GTGAAGGAAA CATAAAAAAA
TAA
 
Protein sequence
MRMYEIIEKK RDGFELSLDE INFFITEYCN NSIPDYQAAA LLMAIFLRGM NERETADLTN 
VMANSGDRID LSSIAGIKVD KHSTGGVGDK TTLILAPIVA ACGIPVAKMS GRGLGHTGGT
IDKLESIPGF NTSLSTEQFI DNVKSIGISI AGQTGNLAPA DKKLYALRDV TATVNNTSLI
ASSIMSKKLA SGADRIVLDV KTGSGAFMKT FEDSVELAKT MVKIGKNTGR KTIAVVTDMD
IPLGFAVGNS LEIIEVIDTL KSKGPEDLKV VSFELAARML ELSGIGNIDE CRARVSNAVE
SGKALSKFAE LIENQGGNKD VINDSTLFPQ PAYKLDFICE KPGYIQFMKT DQIGIASLVL
GAGRETKESK IDYSAGIMFN KKTGDRVEAG ESVAVLYTNR EETLLHAVSL LKEAVVISEL
PHERKPLILA YIDSEGNIKK