Gene Mmar10_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0191 
Symbol 
ID4284181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp203458 
End bp205017 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content57% 
IMG OID638139657 
Productthymidine phosphorylase 
Protein accessionYP_755425 
Protein GI114568745 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAA TGATTGACAT CAGTGCAATG CCGGCAGAGT TGAGCGAGAA AAACACGCTC 
ACAGCTCGAA GGTTAGGGAT CGATACGCAT GAGCACGCCG TTATCTATAT GCGTGCCGAC
TGTCACATTT GCCGAGCGGA AGGGTTCAAT AACCACGCGC GTATAAAGGT AAGCGGTCCG
TCAAACAAAG CGATCATTGC GACCCTAAAT CTGGTTGTAA CCGATCTGTT GTCGCCGGGT
CAGATCGGTC TGTCTGAAAC CGCTTGGCTA AGATTGCAAT TGAACGAGGG CGACCCAGTT
CGACTTATTC ACCCTGCGCC GCTTTTGTCA TTGAGCGCTG TTAGAGCCAA GATTTTTGGC
GAGCCGTTGG ACCAGAATAA TCTGGACGCA ATCGTCGGAG ACATCGCAGC AGGCCGGTTT
TCTGATATTC ATCTTTCTGC ATTTCTCACC GCCAGTGCTG CCCACGAACA GAGCTTCGAG
GAAATCCGTG ATCTCACACT CTCGATGGTC AATGTCGGCC AGCGTTTAGA CTGGGGACGC
GCACCGATCG TGGACAAGCA TTGCGTTGGC GGGTTACCGG GCAATCGCAC GACACCAATT
GTTGTCGCTA TCTGCGTGGC CGCTGGCTTG ACCATGCCCA AAACATCCTC GCGGGCGATT
ACGTCACCAG CGGGCACCGC CGATACAATG GAGACTCTCG CGCCGGTAGA GCTTGATGTT
TCGGCCATGC GGCGCGTCGT TGAGACGGTG GGCGGATGTA TCGTTTGGGG TGGTGCGGTT
GCGTTGAGCC CCGTGGACGA CACCTTGATC CGAATAGAAC GCGCGCTCGA CATCGACAGC
GATGGCCAGT TAGTCGCCTC GGTCTTGTCG AAGAAAATCG CGGCGGGCGC TACTCATTTG
GTGATCGATA TGCCCGTCGG CCCGACCGCC AAAGTGCGAA CTGAAGAAGC GGCAGCAAGA
CTAGAGGGGC TCTTCGCGCG CGTAGGCTCC AATCTTGGTC TCAACTTGAA GATTATGCGG
ACGGATGGCT CGCAACCGGT TGGACGTGGT ATTGGGCCGG CATTGGAGGC GTGGGATGTG
CTGGCCGTTC TCAACAATGA GGGACACAAC GTTCCAGACC TAACCGTTCA AGCGACTCTG
CTTGCCGGCG AGCTTCTAGA AATGGGCCAG GCAGCGCCGG CAGGCCGTGG AGCAGAGCTC
GCGACCGAAC TATTGGTCAA TGGCAGGGCA TGGCGTGCAT TTGAAGCAAT ATGCGAAGCC
CAGGGCGGGT TTCGTGAGCC GCCGACGGCG CCTCACTATC AAGTCATTCA ATCACCGCGC
GATGGTGTCG TCGGACGGAT CGATAATCGA CGACTCGCCA AGGCGGCTAA GCTGGCGGGT
GCGCCAGCGT CGAAGGCCGC CGGCATCGTG TTGCACGCGA AACTGGGAAG CCAAATGGTT
AAAGGACAAC CGCTTTACTC GCTGCATTCC CAGTCACTCG GTGAACTCTC TTACGCTCAC
GACTACCTCG CGAGCCAACC TGAAATCATC GAGATCGAGG AAGAACAATG CCCCCGCTAG
 
Protein sequence
MTKMIDISAM PAELSEKNTL TARRLGIDTH EHAVIYMRAD CHICRAEGFN NHARIKVSGP 
SNKAIIATLN LVVTDLLSPG QIGLSETAWL RLQLNEGDPV RLIHPAPLLS LSAVRAKIFG
EPLDQNNLDA IVGDIAAGRF SDIHLSAFLT ASAAHEQSFE EIRDLTLSMV NVGQRLDWGR
APIVDKHCVG GLPGNRTTPI VVAICVAAGL TMPKTSSRAI TSPAGTADTM ETLAPVELDV
SAMRRVVETV GGCIVWGGAV ALSPVDDTLI RIERALDIDS DGQLVASVLS KKIAAGATHL
VIDMPVGPTA KVRTEEAAAR LEGLFARVGS NLGLNLKIMR TDGSQPVGRG IGPALEAWDV
LAVLNNEGHN VPDLTVQATL LAGELLEMGQ AAPAGRGAEL ATELLVNGRA WRAFEAICEA
QGGFREPPTA PHYQVIQSPR DGVVGRIDNR RLAKAAKLAG APASKAAGIV LHAKLGSQMV
KGQPLYSLHS QSLGELSYAH DYLASQPEII EIEEEQCPR