Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0830 |
Symbol | |
ID | 4027393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 925171 |
End bp | 926559 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637965996 |
Product | thymidine phosphorylase |
Protein accession | YP_572886 |
Protein GI | 92112958 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCATTC AAGACATCAT CCGCCGTAAG CGCGACGCCG AGACCCTCGA CGCCGACGCC ATCCACGCCT TCATGCGCGG CGTCGCCGAC GGCAGCGTGG GCGACGCCCA GATCGGCGCC TTCGCCATGG CGGTCGTGCT CAACGGCATG ACCCGCGAGG AGGCCATCGC GCTGACCGAG GCCACCCGCG ATTCCGGCCA GGTCCTGCGC TGGCACGACC TGCACCTCGA CGGCCCGGTG CTCGACAAGC ACTCCACCGG CGGCGTGGGC GATCTCGTCT CGCTGGTACT GGGGCCGTGG ATCGCCGCCT GCGGCGGTCA CGTACCCATG ATCTCCGGGC GCGGACTCGG CCATACCGGC GGCACACTGG ACAAGCTCGA AGCGATTCCC GGCTATGACG TCACCCCCGA CGACGACCTC TTCCGCCGGC TGGTCAAGGA CGTCGGCGTG GCGATCATCG GTCAGACCGG CACCCTCGCC CCCGCCGACA AGCGTCTGTA CGGCGTGCGC GACGTGACCG CCACGGTCGA GTCCCTGCCA TTGATCGTGG CCTCGATTCT GGGCAAGAAA CTGGCCTGCG GACTCGACAC CCTGGTCATG GACGTCAAGG TCGGCAACGG CGCCTTCATG CCCACGCCCG ACGCCTCGCG GGAACTGGCC GAGGCCATCG TCGCCATCGG CAGCGGCGCC GGCACGCCCA CCAGCGTGCT GCTGACCGAC ATGAACCAGC CGCTGGCCGA CTGCGCCGGC AATGCCTTGG AAGTCCACGA GGCGTTGCGC CTCTTGCGCG GGGACGGGCG CAATAAAGAG GTGCGCGGCG ACGGGCGCAA TAAAGAGCTG CGCGGCGACG GGCACGATAG CCGCCTCTAC CAGGTCACCC ATGCCCTGGC CACGGAAATG CTCGTGCAAG CCGGCCTCGC CGCCGATGCC GCCGACGCCG CGACGCGTCT GGAAACCGCC CTGGCCTCCG GCGAGGCACT GGAACGCTTT TCGCGCATGG TGCATGGCCT CGGTGGCCCG AGCGATCTGG CCGAACGCCC CGAACACTAT CTCGCGTCAG CTCCCTTCAC GACCGATGTC GTCGCACCTC GGGCCGGTAC CGTCAACGCC ATCGACACCC GCGCGCTTGG GCTCGGCGTC GTCGAACTGG GCGGCGGTCG CCGTAACGCC GGGGATGCCA TCGACCATCG CGTCGGTCTT TCACGGATCG CCGGGCTCGG CCAGCGCGTC GAGCGCGGCC AGCCCCTGCT GCGCCTGCAT GCCGCCAGCC GAGCCGAGGC CGACGCCGTC TCTCGGCGTC TGCGCGAGGC ATTCACCCTG GGCGAACCGG GTCACGCCGT GCCGCCCGCG CTGATCCACG CCACCCTGCG TCAGGAGACA TCGTCATGA
|
Protein sequence | MLIQDIIRRK RDAETLDADA IHAFMRGVAD GSVGDAQIGA FAMAVVLNGM TREEAIALTE ATRDSGQVLR WHDLHLDGPV LDKHSTGGVG DLVSLVLGPW IAACGGHVPM ISGRGLGHTG GTLDKLEAIP GYDVTPDDDL FRRLVKDVGV AIIGQTGTLA PADKRLYGVR DVTATVESLP LIVASILGKK LACGLDTLVM DVKVGNGAFM PTPDASRELA EAIVAIGSGA GTPTSVLLTD MNQPLADCAG NALEVHEALR LLRGDGRNKE VRGDGRNKEL RGDGHDSRLY QVTHALATEM LVQAGLAADA ADAATRLETA LASGEALERF SRMVHGLGGP SDLAERPEHY LASAPFTTDV VAPRAGTVNA IDTRALGLGV VELGGGRRNA GDAIDHRVGL SRIAGLGQRV ERGQPLLRLH AASRAEADAV SRRLREAFTL GEPGHAVPPA LIHATLRQET SS
|
| |