Gene VC0395_A1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1928 
SymboldeoA 
ID5137644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2058869 
End bp2060296 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content53% 
IMG OID640533385 
Productthymidine phosphorylase 
Protein accessionYP_001217852 
Protein GI147673836 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.939437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTGTCTGA GAGCAGCGCG AGGCTTCGCG CTGCTTAACC TCTTTACACT ACGCAGTTTC 
TTTGGGAACT GGGAGGCAGC TATGTCTTTA TCTCAAGCAA AGTATTTACC TCAAGAAATT
ATCCGCAGAA AGCGTGATGG TGAAGTTCTC ACCAACGATG AAATCAACTT CTTCATTCAA
GGTGTGGCGA ATAACACCGT CTCTGAAGGG CAAATTGCCG CGTTTGCGAT GGCGATTTTT
TTCCGTGAAA TGACTATGCC TGAGCGTATT GCACTGACGT GCGCGATGCG CGATTCCGGT
ATGGTGATTG ATTGGAGCCA CATGAATTTT GATGGTCCGA TTGTGGATAA ACACTCCACG
GGCGGCGTGG GCGATGTGAC CTCACTGATG CTTGGCCCTA TGGTGGCAGC CTGTGGCGGT
TATGTGCCGA TGATTTCTGG CCGCGGCCTT GGTCACACTG GGGGGACGCT CGACAAACTT
GAAGCCATCC CCGGCTATAA CATTACCCCA ACCAATGACG TGTTTGGCAA AGTGACCAAA
CAAGCCGGCG TGGCGATCAT CGGCCAAACT GGCGATCTGG CTCCGGCGGA TAAGCGTGTG
TACGCAACGC GTGATATTAC CGCGACAGTG GATAACATCT CGCTGATCAC CGCTTCAATT
CTTTCTAAGA AACTGGCGGC TGGCCTTGAA TCGCTCGTGA TGGATGTGAA AGTTGGCTCT
GGTGCTTTCA TGCCAACTTA CGAAGCGTCT GAAGAGCTCG CCAAATCGAT CGTGGCAGTG
GCCAACGGTG CAGGCACCAA TACCACGGCG ATCTTAACTG ATATGAACCA AGTGCTGGCG
TCTTCAGCGG GTAACGCGGT GGAAGTGCGT GAAGCAGTAC GCTTCCTCAC CGGCGAATAC
CGTAATCCGC GTCTGCTGGA AGTCACTATG GCGTCGTGTG CGGAAATGCT GGTACTTGGC
AAGTTAGCCA AAGATACCGC GCAAGCGCGT GAGAAACTGC AAGCAGTGCT GGATAACGGC
CAAGCCGCCG AGCGTTTTGG CAAAATGGTC GCAGGCCTCG GTGGCCCAAG CGATTTTGTT
GAAAACTACG ACAAGTATCT TGCGAAAGCT GAGATTGTTC GCCCAGTTTA CGCTCAGCAA
TCTGGCGTTA TTTCTGCGAT GGATACCCGT GCCATCGGCA TGGCGGTGGT CGGCATGGGC
GGTGGCCGCC GCGTGGCGAC CGATCGTATC GATTACGCTG TCGGTTTTGA TCAGTTTATC
CGTCTGGGTG AAATCGCCGA CAGCAACAAA CCTTTAGCAA TGATTCATGC CCGTAATGAA
GAGCAGTGGC AACAAGCTGC CAATGCACTA CAAAGTGCGA TTAAAGTGGG CGGTGATTAC
CTACCAACAC CGGATGTTTA CCGTCAAATT CGAGCACAAG ACGTGTAA
 
Protein sequence
MCLRAARGFA LLNLFTLRSF FGNWEAAMSL SQAKYLPQEI IRRKRDGEVL TNDEINFFIQ 
GVANNTVSEG QIAAFAMAIF FREMTMPERI ALTCAMRDSG MVIDWSHMNF DGPIVDKHST
GGVGDVTSLM LGPMVAACGG YVPMISGRGL GHTGGTLDKL EAIPGYNITP TNDVFGKVTK
QAGVAIIGQT GDLAPADKRV YATRDITATV DNISLITASI LSKKLAAGLE SLVMDVKVGS
GAFMPTYEAS EELAKSIVAV ANGAGTNTTA ILTDMNQVLA SSAGNAVEVR EAVRFLTGEY
RNPRLLEVTM ASCAEMLVLG KLAKDTAQAR EKLQAVLDNG QAAERFGKMV AGLGGPSDFV
ENYDKYLAKA EIVRPVYAQQ SGVISAMDTR AIGMAVVGMG GGRRVATDRI DYAVGFDQFI
RLGEIADSNK PLAMIHARNE EQWQQAANAL QSAIKVGGDY LPTPDVYRQI RAQDV