Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1274 |
Symbol | |
ID | 8664549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 1311295 |
End bp | 1312575 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Thymidine phosphorylase |
Protein accession | YP_003337015 |
Protein GI | 271962819 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.643295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGA TCGACGTCAT CGTCACCAAG CGGGACGGCC GGGAGCTGTC CGTCGAGCAG ATCGACTGGG TGATCGACGC CTACACCAAG GGTGTCGTCG CCGACGAGCA GATGTCCTCC CTTGCGATGG CGATCCTGCT CAACGGCATG ACCCGGCGGG AGATCGGCGA GTGGACCCAG GCCATGATCC GGTCCGGGGC CCGGATGGAC TGGTCGGCGC TGCCCGGCCG TACCACCGAC AAGCACTCCA CCGGCGGTGT CGGCGACAAG ATCACGCTGC CGCTGGCCCC GCTGGTCGCC GCGTGCGGCG GCTACGTGCC CCAGCTCTCC GGCCGGGGAC TCGGCCACAC CGGCGGCACC CTGGACAAAC TGGAGTCCAT TCCGGGCTGG CGGGCGTCCC TGTCCAACCA GGAGATGCTC GACGTGCTCG GCGAGGCCGG CGCGGTCATC TGCGCGGCGG GTGACGGGCT GGCCCCGGCC GACAAGAAGC TCTACGCGCT GCGCGACGTC ACCGGCACCG TCGAGTCGAT CCCGCTGATC GCCTCCTCGA TCATGTCGAA GAAGATCGCG GAGGGCACCG GGGCGCTGGT GCTGGACGTC AAGGTCGGCT CCGGGGCCTT CATGAAGACC GTGGAGCGGG CCCGCGAGCT GGCCACGACC ATGGTCGAGC TGGGCACCGA CGCCGGTGTC GAGACCGTCG CGCTGCTCAC CGCCATGGAC CGGCCGCTGG GCCGCGCCGT GGGCAACGCC CTGGAGGTCG CCGAGTCCGT CGAGGTGCTC GCCGGCGGCG GGCCCGACGA CGTGGTCGAG CTGACCGTAC GGCTGGCGCG CGAGATGCTG CAGGCCGCGG GCCTGTCCGG GGGCAAGGAC CCGGAGCAGG CGCTGAAGGA CGGTTCGGCG ATGGACGTCT GGCGCCGGAT GATCAGCGCG CAGGGCGGCG ACCCGGATGC CCTGCTGCCG AAGGCCGCCG AGACCCTGGA GATCACCGCG CCCTCGTCCG GGGTGCTGGC CGGACTCGAC GCGTACGGCG TGGGCCTGGC CGCCTGGCGG CTCGGCGCGG GCCGGGAGCG CAAGGAGGAC CCGGTGTCCT TCGGCGCGGG CATCATGCTG CACGCCAGGC CGGGCGACCT GGTCCGCGAG GGGCAGCCGC TGATGACCCT GCACGCCGAC GAGACGTCCC GCTTCGAGCG GGCGCTCGCC GCACTGGAGG GCGCCTACGT GATCGGCGAG ACCGCCGACC CCGGCCTGCT CCCGCTGGTG ATCGACCGCA TCACCGCCTG A
|
Protein sequence | MDAIDVIVTK RDGRELSVEQ IDWVIDAYTK GVVADEQMSS LAMAILLNGM TRREIGEWTQ AMIRSGARMD WSALPGRTTD KHSTGGVGDK ITLPLAPLVA ACGGYVPQLS GRGLGHTGGT LDKLESIPGW RASLSNQEML DVLGEAGAVI CAAGDGLAPA DKKLYALRDV TGTVESIPLI ASSIMSKKIA EGTGALVLDV KVGSGAFMKT VERARELATT MVELGTDAGV ETVALLTAMD RPLGRAVGNA LEVAESVEVL AGGGPDDVVE LTVRLAREML QAAGLSGGKD PEQALKDGSA MDVWRRMISA QGGDPDALLP KAAETLEITA PSSGVLAGLD AYGVGLAAWR LGAGRERKED PVSFGAGIML HARPGDLVRE GQPLMTLHAD ETSRFERALA ALEGAYVIGE TADPGLLPLV IDRITA
|
| |