Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1928 |
Symbol | deoA |
ID | 5137644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2058869 |
End bp | 2060296 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640533385 |
Product | thymidine phosphorylase |
Protein accession | YP_001217852 |
Protein GI | 147673836 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.939437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTGTCTGA GAGCAGCGCG AGGCTTCGCG CTGCTTAACC TCTTTACACT ACGCAGTTTC TTTGGGAACT GGGAGGCAGC TATGTCTTTA TCTCAAGCAA AGTATTTACC TCAAGAAATT ATCCGCAGAA AGCGTGATGG TGAAGTTCTC ACCAACGATG AAATCAACTT CTTCATTCAA GGTGTGGCGA ATAACACCGT CTCTGAAGGG CAAATTGCCG CGTTTGCGAT GGCGATTTTT TTCCGTGAAA TGACTATGCC TGAGCGTATT GCACTGACGT GCGCGATGCG CGATTCCGGT ATGGTGATTG ATTGGAGCCA CATGAATTTT GATGGTCCGA TTGTGGATAA ACACTCCACG GGCGGCGTGG GCGATGTGAC CTCACTGATG CTTGGCCCTA TGGTGGCAGC CTGTGGCGGT TATGTGCCGA TGATTTCTGG CCGCGGCCTT GGTCACACTG GGGGGACGCT CGACAAACTT GAAGCCATCC CCGGCTATAA CATTACCCCA ACCAATGACG TGTTTGGCAA AGTGACCAAA CAAGCCGGCG TGGCGATCAT CGGCCAAACT GGCGATCTGG CTCCGGCGGA TAAGCGTGTG TACGCAACGC GTGATATTAC CGCGACAGTG GATAACATCT CGCTGATCAC CGCTTCAATT CTTTCTAAGA AACTGGCGGC TGGCCTTGAA TCGCTCGTGA TGGATGTGAA AGTTGGCTCT GGTGCTTTCA TGCCAACTTA CGAAGCGTCT GAAGAGCTCG CCAAATCGAT CGTGGCAGTG GCCAACGGTG CAGGCACCAA TACCACGGCG ATCTTAACTG ATATGAACCA AGTGCTGGCG TCTTCAGCGG GTAACGCGGT GGAAGTGCGT GAAGCAGTAC GCTTCCTCAC CGGCGAATAC CGTAATCCGC GTCTGCTGGA AGTCACTATG GCGTCGTGTG CGGAAATGCT GGTACTTGGC AAGTTAGCCA AAGATACCGC GCAAGCGCGT GAGAAACTGC AAGCAGTGCT GGATAACGGC CAAGCCGCCG AGCGTTTTGG CAAAATGGTC GCAGGCCTCG GTGGCCCAAG CGATTTTGTT GAAAACTACG ACAAGTATCT TGCGAAAGCT GAGATTGTTC GCCCAGTTTA CGCTCAGCAA TCTGGCGTTA TTTCTGCGAT GGATACCCGT GCCATCGGCA TGGCGGTGGT CGGCATGGGC GGTGGCCGCC GCGTGGCGAC CGATCGTATC GATTACGCTG TCGGTTTTGA TCAGTTTATC CGTCTGGGTG AAATCGCCGA CAGCAACAAA CCTTTAGCAA TGATTCATGC CCGTAATGAA GAGCAGTGGC AACAAGCTGC CAATGCACTA CAAAGTGCGA TTAAAGTGGG CGGTGATTAC CTACCAACAC CGGATGTTTA CCGTCAAATT CGAGCACAAG ACGTGTAA
|
Protein sequence | MCLRAARGFA LLNLFTLRSF FGNWEAAMSL SQAKYLPQEI IRRKRDGEVL TNDEINFFIQ GVANNTVSEG QIAAFAMAIF FREMTMPERI ALTCAMRDSG MVIDWSHMNF DGPIVDKHST GGVGDVTSLM LGPMVAACGG YVPMISGRGL GHTGGTLDKL EAIPGYNITP TNDVFGKVTK QAGVAIIGQT GDLAPADKRV YATRDITATV DNISLITASI LSKKLAAGLE SLVMDVKVGS GAFMPTYEAS EELAKSIVAV ANGAGTNTTA ILTDMNQVLA SSAGNAVEVR EAVRFLTGEY RNPRLLEVTM ASCAEMLVLG KLAKDTAQAR EKLQAVLDNG QAAERFGKMV AGLGGPSDFV ENYDKYLAKA EIVRPVYAQQ SGVISAMDTR AIGMAVVGMG GGRRVATDRI DYAVGFDQFI RLGEIADSNK PLAMIHARNE EQWQQAANAL QSAIKVGGDY LPTPDVYRQI RAQDV
|
| |