Gene Csal_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0830 
Symbol 
ID4027393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp925171 
End bp926559 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content70% 
IMG OID637965996 
Productthymidine phosphorylase 
Protein accessionYP_572886 
Protein GI92112958 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATTC AAGACATCAT CCGCCGTAAG CGCGACGCCG AGACCCTCGA CGCCGACGCC 
ATCCACGCCT TCATGCGCGG CGTCGCCGAC GGCAGCGTGG GCGACGCCCA GATCGGCGCC
TTCGCCATGG CGGTCGTGCT CAACGGCATG ACCCGCGAGG AGGCCATCGC GCTGACCGAG
GCCACCCGCG ATTCCGGCCA GGTCCTGCGC TGGCACGACC TGCACCTCGA CGGCCCGGTG
CTCGACAAGC ACTCCACCGG CGGCGTGGGC GATCTCGTCT CGCTGGTACT GGGGCCGTGG
ATCGCCGCCT GCGGCGGTCA CGTACCCATG ATCTCCGGGC GCGGACTCGG CCATACCGGC
GGCACACTGG ACAAGCTCGA AGCGATTCCC GGCTATGACG TCACCCCCGA CGACGACCTC
TTCCGCCGGC TGGTCAAGGA CGTCGGCGTG GCGATCATCG GTCAGACCGG CACCCTCGCC
CCCGCCGACA AGCGTCTGTA CGGCGTGCGC GACGTGACCG CCACGGTCGA GTCCCTGCCA
TTGATCGTGG CCTCGATTCT GGGCAAGAAA CTGGCCTGCG GACTCGACAC CCTGGTCATG
GACGTCAAGG TCGGCAACGG CGCCTTCATG CCCACGCCCG ACGCCTCGCG GGAACTGGCC
GAGGCCATCG TCGCCATCGG CAGCGGCGCC GGCACGCCCA CCAGCGTGCT GCTGACCGAC
ATGAACCAGC CGCTGGCCGA CTGCGCCGGC AATGCCTTGG AAGTCCACGA GGCGTTGCGC
CTCTTGCGCG GGGACGGGCG CAATAAAGAG GTGCGCGGCG ACGGGCGCAA TAAAGAGCTG
CGCGGCGACG GGCACGATAG CCGCCTCTAC CAGGTCACCC ATGCCCTGGC CACGGAAATG
CTCGTGCAAG CCGGCCTCGC CGCCGATGCC GCCGACGCCG CGACGCGTCT GGAAACCGCC
CTGGCCTCCG GCGAGGCACT GGAACGCTTT TCGCGCATGG TGCATGGCCT CGGTGGCCCG
AGCGATCTGG CCGAACGCCC CGAACACTAT CTCGCGTCAG CTCCCTTCAC GACCGATGTC
GTCGCACCTC GGGCCGGTAC CGTCAACGCC ATCGACACCC GCGCGCTTGG GCTCGGCGTC
GTCGAACTGG GCGGCGGTCG CCGTAACGCC GGGGATGCCA TCGACCATCG CGTCGGTCTT
TCACGGATCG CCGGGCTCGG CCAGCGCGTC GAGCGCGGCC AGCCCCTGCT GCGCCTGCAT
GCCGCCAGCC GAGCCGAGGC CGACGCCGTC TCTCGGCGTC TGCGCGAGGC ATTCACCCTG
GGCGAACCGG GTCACGCCGT GCCGCCCGCG CTGATCCACG CCACCCTGCG TCAGGAGACA
TCGTCATGA
 
Protein sequence
MLIQDIIRRK RDAETLDADA IHAFMRGVAD GSVGDAQIGA FAMAVVLNGM TREEAIALTE 
ATRDSGQVLR WHDLHLDGPV LDKHSTGGVG DLVSLVLGPW IAACGGHVPM ISGRGLGHTG
GTLDKLEAIP GYDVTPDDDL FRRLVKDVGV AIIGQTGTLA PADKRLYGVR DVTATVESLP
LIVASILGKK LACGLDTLVM DVKVGNGAFM PTPDASRELA EAIVAIGSGA GTPTSVLLTD
MNQPLADCAG NALEVHEALR LLRGDGRNKE VRGDGRNKEL RGDGHDSRLY QVTHALATEM
LVQAGLAADA ADAATRLETA LASGEALERF SRMVHGLGGP SDLAERPEHY LASAPFTTDV
VAPRAGTVNA IDTRALGLGV VELGGGRRNA GDAIDHRVGL SRIAGLGQRV ERGQPLLRLH
AASRAEADAV SRRLREAFTL GEPGHAVPPA LIHATLRQET SS