Gene Cagg_3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3238 
Symbol 
ID7267385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3923320 
End bp3924627 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content57% 
IMG OID643568059 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002464532 
Protein GI219850099 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGGA TGGTTGAAAT TATTGCCGCG AAACGTGATG GTCGCGCATT GACTACGGCA 
GAGATCGAGT GGGTCGTGCA AGGATACGCT GCCGGTGAGA TTCCCGATTA TCAGATGGCG
GCCTTGGCGA TGGCGATAGT GTTACGCGGC ATGGACGATC GCGAAACCGC CGATTTGACG
ATGGCAATAG CGCGCAGCGG CGACATGCTC GATTTACACG ATGTCGCACC TTTAACGGTT
GATAAGCATT CAACCGGTGG GGTCGGCGAT AAAACAACGC TCGTGTTGGC ACCGATGGTG
GCGGCGGTTG GTTTGCCGGT GGCGAAGATG AGCGGTCGCG GGCTTGGCTT TAGTGGCGGT
ACTATTGATA AGCTGGAAAG TATTCCCGGG TTTCGCGCCA ACCTGAGCGC TGAAGAGTTT
CGCCGCGCCG TCCGCGAGCC TGGCCTGGTC GTTGCTGCGC AAAGTGGTGA TCTGGCACCG
GCCGATAAAA AGCTGTATGC GTTGCGTGAT GTGACCGCAA CCGTCGAGTC CATTCCTCTG
ATTGCGGCAA GTGTGATGAG CAAAAAGCTG GCTGCCGGGG CCGATTGTAT CGTACTTGAC
GTAAAGTATG GACGTGGTGC CTTTATGCAT ACCCTCGACG ATGCGCGGCA ATTGGCACGC
ACAATGGTCG CGATCGGGCG ACACGCCGGC CGGAAGGTGA CGGCCGTGAT CAGCAGTATG
CAGCAGCCGC TCGGTTTCGC CATCGGTAAT GCGCTGGAAG TGCGTGAAGC AGTGGCGGCA
TTGTGTGGCT CAGGCCCCTC CGATTTGGTG GAGTTGTGTC TCGTTCTTGG CAGTGAATTG
GTGTGCATGG CCGGCTTGCG GGATGACCCT GACACAGCGC GTGCCTTGTT GGTCGAAGCG
TTGAGCAGTG GTGCAGCATG GGAAAAGTTC CGGGCGATGG TTGCCAATCA GGATGGTGAT
GTGACAACGC TTGACCATCC TGATCGATTG CCGAATGCAC CGGTTCAAGT TGATCTGGCA
GCACCGCGCG CTGGATTTGT CACTGCAATT GATGGTCAAA CGTTGGGGTT CGTCGTGAAT
GCGTTAGGTG GTGGTCGGGT GCGGAAGGAA GATACGATTG ACCACTCAGT CGGGTTGGTT
TTACGGGCAA AGGTTGGTGC GCGCGTTGCC GCCGGTGATC CGCTGGTAAC CATCCATGCA
GCGCGTCAGA GTGATGTAGA CGCAGTTGCC GAACGGCTTG CCAACGCCTA TACTATTCAT
GACACGCCAC CACCATCGTT ACCCCTCGTC GAGGAGATTA TTCGGTGA
 
Protein sequence
MMRMVEIIAA KRDGRALTTA EIEWVVQGYA AGEIPDYQMA ALAMAIVLRG MDDRETADLT 
MAIARSGDML DLHDVAPLTV DKHSTGGVGD KTTLVLAPMV AAVGLPVAKM SGRGLGFSGG
TIDKLESIPG FRANLSAEEF RRAVREPGLV VAAQSGDLAP ADKKLYALRD VTATVESIPL
IAASVMSKKL AAGADCIVLD VKYGRGAFMH TLDDARQLAR TMVAIGRHAG RKVTAVISSM
QQPLGFAIGN ALEVREAVAA LCGSGPSDLV ELCLVLGSEL VCMAGLRDDP DTARALLVEA
LSSGAAWEKF RAMVANQDGD VTTLDHPDRL PNAPVQVDLA APRAGFVTAI DGQTLGFVVN
ALGGGRVRKE DTIDHSVGLV LRAKVGARVA AGDPLVTIHA ARQSDVDAVA ERLANAYTIH
DTPPPSLPLV EEIIR