Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0191 |
Symbol | |
ID | 4284181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 203458 |
End bp | 205017 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638139657 |
Product | thymidine phosphorylase |
Protein accession | YP_755425 |
Protein GI | 114568745 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAA TGATTGACAT CAGTGCAATG CCGGCAGAGT TGAGCGAGAA AAACACGCTC ACAGCTCGAA GGTTAGGGAT CGATACGCAT GAGCACGCCG TTATCTATAT GCGTGCCGAC TGTCACATTT GCCGAGCGGA AGGGTTCAAT AACCACGCGC GTATAAAGGT AAGCGGTCCG TCAAACAAAG CGATCATTGC GACCCTAAAT CTGGTTGTAA CCGATCTGTT GTCGCCGGGT CAGATCGGTC TGTCTGAAAC CGCTTGGCTA AGATTGCAAT TGAACGAGGG CGACCCAGTT CGACTTATTC ACCCTGCGCC GCTTTTGTCA TTGAGCGCTG TTAGAGCCAA GATTTTTGGC GAGCCGTTGG ACCAGAATAA TCTGGACGCA ATCGTCGGAG ACATCGCAGC AGGCCGGTTT TCTGATATTC ATCTTTCTGC ATTTCTCACC GCCAGTGCTG CCCACGAACA GAGCTTCGAG GAAATCCGTG ATCTCACACT CTCGATGGTC AATGTCGGCC AGCGTTTAGA CTGGGGACGC GCACCGATCG TGGACAAGCA TTGCGTTGGC GGGTTACCGG GCAATCGCAC GACACCAATT GTTGTCGCTA TCTGCGTGGC CGCTGGCTTG ACCATGCCCA AAACATCCTC GCGGGCGATT ACGTCACCAG CGGGCACCGC CGATACAATG GAGACTCTCG CGCCGGTAGA GCTTGATGTT TCGGCCATGC GGCGCGTCGT TGAGACGGTG GGCGGATGTA TCGTTTGGGG TGGTGCGGTT GCGTTGAGCC CCGTGGACGA CACCTTGATC CGAATAGAAC GCGCGCTCGA CATCGACAGC GATGGCCAGT TAGTCGCCTC GGTCTTGTCG AAGAAAATCG CGGCGGGCGC TACTCATTTG GTGATCGATA TGCCCGTCGG CCCGACCGCC AAAGTGCGAA CTGAAGAAGC GGCAGCAAGA CTAGAGGGGC TCTTCGCGCG CGTAGGCTCC AATCTTGGTC TCAACTTGAA GATTATGCGG ACGGATGGCT CGCAACCGGT TGGACGTGGT ATTGGGCCGG CATTGGAGGC GTGGGATGTG CTGGCCGTTC TCAACAATGA GGGACACAAC GTTCCAGACC TAACCGTTCA AGCGACTCTG CTTGCCGGCG AGCTTCTAGA AATGGGCCAG GCAGCGCCGG CAGGCCGTGG AGCAGAGCTC GCGACCGAAC TATTGGTCAA TGGCAGGGCA TGGCGTGCAT TTGAAGCAAT ATGCGAAGCC CAGGGCGGGT TTCGTGAGCC GCCGACGGCG CCTCACTATC AAGTCATTCA ATCACCGCGC GATGGTGTCG TCGGACGGAT CGATAATCGA CGACTCGCCA AGGCGGCTAA GCTGGCGGGT GCGCCAGCGT CGAAGGCCGC CGGCATCGTG TTGCACGCGA AACTGGGAAG CCAAATGGTT AAAGGACAAC CGCTTTACTC GCTGCATTCC CAGTCACTCG GTGAACTCTC TTACGCTCAC GACTACCTCG CGAGCCAACC TGAAATCATC GAGATCGAGG AAGAACAATG CCCCCGCTAG
|
Protein sequence | MTKMIDISAM PAELSEKNTL TARRLGIDTH EHAVIYMRAD CHICRAEGFN NHARIKVSGP SNKAIIATLN LVVTDLLSPG QIGLSETAWL RLQLNEGDPV RLIHPAPLLS LSAVRAKIFG EPLDQNNLDA IVGDIAAGRF SDIHLSAFLT ASAAHEQSFE EIRDLTLSMV NVGQRLDWGR APIVDKHCVG GLPGNRTTPI VVAICVAAGL TMPKTSSRAI TSPAGTADTM ETLAPVELDV SAMRRVVETV GGCIVWGGAV ALSPVDDTLI RIERALDIDS DGQLVASVLS KKIAAGATHL VIDMPVGPTA KVRTEEAAAR LEGLFARVGS NLGLNLKIMR TDGSQPVGRG IGPALEAWDV LAVLNNEGHN VPDLTVQATL LAGELLEMGQ AAPAGRGAEL ATELLVNGRA WRAFEAICEA QGGFREPPTA PHYQVIQSPR DGVVGRIDNR RLAKAAKLAG APASKAAGIV LHAKLGSQMV KGQPLYSLHS QSLGELSYAH DYLASQPEII EIEEEQCPR
|
| |