Gene Ajs_2690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_2690 
Symbol 
ID4671304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp2852963 
End bp2854504 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content70% 
IMG OID639839747 
Productthymidine phosphorylase 
Protein accessionYP_986911 
Protein GI121595015 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.44946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGC AAATGCATGA AGCCGGCCGC ACCAAGGAGG CCGGGAACAG GCTGCAGGCG 
TGGCGCACGG GCATCGACAC CTATCAGGAG CCGGTGGTCT ACATGCGGCG CGATTGCCCG
GTCTGTCGAT CGGAGGGTTT CACGACCCAG GCGCGCGTGC AATTGACGGC CGGCGGCCGA
AGCATCGTGG CGACGCTCAG TGTCGTCGAC GGTGATTGGC TGGCCGAGAA CGTCGCCGGT
CTGTCCGAGT CCGCGTGGGC ATCGCTGGGT GCGCAGCCGG GTGAGCCGGT CGCGGTCACC
CATGCGCCAC CGCTGGATTC GCTCAGCCAT GTCCGGGCCA AGGTCTACGG CAACTCCCTT
GGCGACGCCC AGTTCGGCGC GATCATCTCC GACGTCGCGG CGGGCCGTTA CTCGGACCTG
CACCTCGCCA CCTTCATCAC CGCCTGCGCG GGTGATCGCC TTGACCTGGC GGAGACGCTG
TCGCTCACGA AAGCCATGAT CGCCGTCGGC GACCGCATCG ATTGGGGTCG TCCGCTGGTC
GTGGACAAGC ACTGCGTCGG CGGCCTGCCG GGGAATCGCA CGACCCTGCT CGTGGTGCCC
ATCGTGACCG CCTGCGGCCT CATGATGCCA AAGACCTCCT CGCGCGCCAT CACCTCACCG
GCCGGGACGG CCGACACGAT GGAGGTGCTG GCACCGGTCA ACCTGGATGT GCCGAGCATG
CGACGCGTGG TCGAGCGCAC GGGCGGCTGC ATCGTGTGGG GCGGCTCGGT CCGGCTCAGC
CCCGCCGACG ACATCCTTAT CCGCGTCGAG CGGCCACTCG ATCTCGACAG CGAAGGGCAG
CTCGTGGCCT CGGTTCTGTC CAAGAAGGCC GCTGCGGGTT CGACCCATGT GCTGATCGAT
CTGCCCGTGG GCGCCACCGC GAAAGTGCGC AGCGCACACG CCGCGGCGTC GCTCGGCCGG
CGCCTGCAGG AGGTGGGTGG CGCCATAGGC CTGCAGGTGT TCTTGCGCGT CACCGACGGC
GAACAGCCGG TTGGCCGCGG GATCGGTCCC GCACTGGAAG CCCGCGACGT GCTGGCCGTC
CTGCAAGGGA CGCGCGAGGC CCCGGCCGAC CTGCGAGAGC GGGCGCTGCG GCTGGCGGCG
GACATCCTCG AAATGGGCGG CGCGGCGCCG GCCGGCGGCG GCCTGAAGCT CGCCACCGAG
GTGCTGGCCG ATGGCCGGGC CTGGGCCAAG TTCCAGGCAA TCTGCTCTGA GCAGGGCGGC
CTTCGCAGCC TCCCGATGGC GGCGCATCTG CACACCGTCG AATCGCCGGG CACGGGCCGT
GTCACCCGCA TCGACAACCG CCTGCTCGCG CGGGCGGCCA AGCTCGCCGG TGCGCCGACC
GCCCCGGCCG CCGGCATCGA TGTGCACGCG CGCCTTGGCG ATCGCGTCGA AGCCGGACAG
CCGCTGTTCA CGCTGCATGC CCAGGCACCA GGCGAACTGG CCTATGCCCT GGAGTTCGTC
CGCGCGCGCC CACCGATCTT CCAGATTTCA GAGAACGTAT GA
 
Protein sequence
MTAQMHEAGR TKEAGNRLQA WRTGIDTYQE PVVYMRRDCP VCRSEGFTTQ ARVQLTAGGR 
SIVATLSVVD GDWLAENVAG LSESAWASLG AQPGEPVAVT HAPPLDSLSH VRAKVYGNSL
GDAQFGAIIS DVAAGRYSDL HLATFITACA GDRLDLAETL SLTKAMIAVG DRIDWGRPLV
VDKHCVGGLP GNRTTLLVVP IVTACGLMMP KTSSRAITSP AGTADTMEVL APVNLDVPSM
RRVVERTGGC IVWGGSVRLS PADDILIRVE RPLDLDSEGQ LVASVLSKKA AAGSTHVLID
LPVGATAKVR SAHAAASLGR RLQEVGGAIG LQVFLRVTDG EQPVGRGIGP ALEARDVLAV
LQGTREAPAD LRERALRLAA DILEMGGAAP AGGGLKLATE VLADGRAWAK FQAICSEQGG
LRSLPMAAHL HTVESPGTGR VTRIDNRLLA RAAKLAGAPT APAAGIDVHA RLGDRVEAGQ
PLFTLHAQAP GELAYALEFV RARPPIFQIS ENV