Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ksed_08120 |
Symbol | |
ID | 8372321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Kytococcus sedentarius DSM 20547 |
Kingdom | Bacteria |
Replicon accession | NC_013169 |
Strand | + |
Start bp | 824940 |
End bp | 826241 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644991096 |
Product | thymidine phosphorylase |
Protein accession | YP_003148631 |
Protein GI | 256824671 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 0.80887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.487259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAA CGCAGCACGA CGCCGTGGAC ATCATCCGCA CCAAGCGCGA CGGAGAGCGC CTCTCCGACG AGCAGATCGA CTGGGTGATC GACGCCTACA CCCGGGGGGA CGTCGCCGAC GAGCAGATGT CGGCCCTGGC CATGGCCATC TACCTCAACG GCATGGAGCC CGACGAGATC TCCCGCTGGA CCTCCGCCAT GATCGCATCG GGGGAGCGGA TGGACTTCTC CACCCTGTCG CGCCCCACCG CGGACAAGCA CTCGACCGGG GGTGTTGGCG ACAAGATCAC CCTCCCGCTG GCCCCCCTGG TGGCGGCCTG CGGGGTCGCG GTCCCGCAGC TCTCCGGACG CGGGCTGGGC CACACCGGCG GCACCCTGGA CAAGCTCGAG GCCATCCCCG GCTGGCAGGC CGACCTGACC AACGAGGCGA TGATGCAGCA GCTCGAGGAG GTCGGGGCCG TCATCTGTGC CGCGGGCGCC GGGCTGGCCC CCGCGGACAA GAAGCTCTAC GCCCTGCGCG ACGTGACCGG CACGGTCGAG GCGCTGCCCC TCATCGCCAG CTCGATCATG TCCAAGAAGA TCGCCGAGGG CACCGGCGCG CTGGTGCTCG ACGTGAAGGT GGGCTCCGGC GCGTTCATGA AGACCGAGGC AGACGCCCGG GCCCTGGCCG AGCGTATGGT GGCCCTGGGC GATGCCGCCG GTGTCACCAC CGTCGCGCTG CTCACCGACA TGTCTGCCCC GCTGGGCCGG ACGGCGGGCA ACGGCCTCGA GGTCATCGAG TCCGTCGAGG TCCTCTCCGG CGGGGGACCG GCGGACGTGC GCGAGCTGAC GATCGCCCTG GCCCGCGAGA TGGTGCTGGC CTCCGGCATC GAGGGGGCCG ATGCGGTGGA CGTCGCCGAG GTGCTGGACT CCGGCAAGGC GCTTCAGGTG TGGAAGCAGA TGATCGCCGC CCAGGGTGGG GACCCCGAGG CCGAGCTGGC TCTGGGGGAG CACACCGCCG AGGTCACGGC CGAGGCCGGT GGCGTCATCA CGGAGATGGA CGCCTACAAG GTGGGTGTCG CCGCGTGGCG CCTGGGTGCG GGCCGTGCCC GCAAGGAGGA CCCGGTGCAG GCCGGTGCCG GGGTGCGCTG GCACGCGGGC GTCGGGGAGC GCGTGGAGGC CGGGCAGGTG CTGTTCACCT GCTACACCGA GACGTCGGAG CGCCTGCAGC GCGGGGTGGA GACCCTCGCC GGCGCGGTGA CGATCGACAC CGGCGCCGAG CCCTTCGAGC GCACACTGGT GATCGACCGC ATCACGGCCT GA
|
Protein sequence | MTGTQHDAVD IIRTKRDGER LSDEQIDWVI DAYTRGDVAD EQMSALAMAI YLNGMEPDEI SRWTSAMIAS GERMDFSTLS RPTADKHSTG GVGDKITLPL APLVAACGVA VPQLSGRGLG HTGGTLDKLE AIPGWQADLT NEAMMQQLEE VGAVICAAGA GLAPADKKLY ALRDVTGTVE ALPLIASSIM SKKIAEGTGA LVLDVKVGSG AFMKTEADAR ALAERMVALG DAAGVTTVAL LTDMSAPLGR TAGNGLEVIE SVEVLSGGGP ADVRELTIAL AREMVLASGI EGADAVDVAE VLDSGKALQV WKQMIAAQGG DPEAELALGE HTAEVTAEAG GVITEMDAYK VGVAAWRLGA GRARKEDPVQ AGAGVRWHAG VGERVEAGQV LFTCYTETSE RLQRGVETLA GAVTIDTGAE PFERTLVIDR ITA
|
| |