Gene YpsIP31758_4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4161 
Symbol 
ID5388184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4697020 
End bp4698348 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content50% 
IMG OID640867189 
ProductAzgA family purine transporter 
Protein accessionYP_001403103 
Protein GI153948181 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.241199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAT CAAACCTTGA TACCGAGCAG GGCCTGCTCG AACGTGTATT TAAACTGAAA 
CAGCATGGCA CCACAGCTCG TACTGAGTTG ATTGCGGGTA TCACGACTTT CTTGACCATG
GTCTATATCG TATTCGTAAA CCCGCAGATT CTCGGGGTTG CGGGTATGGA TGTGCAGGCG
GTGTTCGTGA CAACCTGCCT GATCGCCGCA TTTGGCAGCA TTTTTATGGG CTTATTGGCT
AACTTACCTG TGGCACTGGC ACCGGCGATG GGGCTTAACG CTTTCTTCGC TTTTGTGGTG
GTAGGGGCGA TGGGTATTTC TTGGCAGGTC GGTATGGGCG CTATTTTCTG GGGGGCAATC
GGTTTCCTTT TGCTAACCAT TTTCCGCATT CGTTACTGGA TGATAGCGAA CATCCCACTG
AGCCTGCGTG TGGGGATCAC AAGTGGTATT GGCCTGTTTA TTGCCATGAT GGGGTTGAAG
AATGCCGGTA TCGTGGTTGC AAACCCAGAT ACACTGGTGG CGGTGGGTAA TCTGACCTCT
CACAGTGTAC TGTTGGGTGC ACTGGGTTTC TTTATTATCG CAGTCTTGGC TTCTCGTAAT
ATTCACGCGG CAGTGCTGGT TTCTATTGTG GTTACCACAC TGATTGGCTG GGCGCTGGGT
GATGTGCATT ATTCGGGCAT TTTCTCCATG CCACCAAGTG TGACTTCTGT GGTTGGGCAG
GTTGATTTAG CTGGCGCGTT GAATATTGGT ATGGCGGGTA TTATTTTCTC CTTCATGCTG
GTTAACCTGT TTGATTCATC CGGCACATTG ATTGGTGTCA CGGATAAAGC CGGTTTAGCG
GATCATAAAG GCAAGTTTCC GCGCATGAAA CAAGCGCTGT ATGTGGACAG TATCAGCTCC
GTTGCCGGTG CTTTTATTGG TACTTCATCA GTGACCGCGT ATATCGAAAG TTCTTCCGGG
GTATCTGTTG GCGGCCGTAC CGGGTTAACC GCTGTTGTTG TCGGGATACT CTTCCTGCTG
GTGATATTTA TTTCTCCGTT GGCGGGTATG GTTCCTGCGT ATGCGGCCGC GGGCGCGCTG
ATTTATGTTG GTGTGTTGAT GACATCTAGC CTGGCACGGG TGAAGTGGGA TGATTTGACT
GAAGCCGTTC CAGCCTTTGT CACGGCTGTC ATGATGCCGT TCAGTTTCTC TATCACTGAA
GGGATCGCAC TGGGCTTTAT CTCTTATTGT TTGATGAAGT TAGGTACTGG CCGCTGGCGT
GAAATCAGCC CTTGCGTAGT GGTAGTGGCG CTACTGTTTA TGCTGAAAAT TGCGTTTGTT
GATCACTGA
 
Protein sequence
MSKSNLDTEQ GLLERVFKLK QHGTTARTEL IAGITTFLTM VYIVFVNPQI LGVAGMDVQA 
VFVTTCLIAA FGSIFMGLLA NLPVALAPAM GLNAFFAFVV VGAMGISWQV GMGAIFWGAI
GFLLLTIFRI RYWMIANIPL SLRVGITSGI GLFIAMMGLK NAGIVVANPD TLVAVGNLTS
HSVLLGALGF FIIAVLASRN IHAAVLVSIV VTTLIGWALG DVHYSGIFSM PPSVTSVVGQ
VDLAGALNIG MAGIIFSFML VNLFDSSGTL IGVTDKAGLA DHKGKFPRMK QALYVDSISS
VAGAFIGTSS VTAYIESSSG VSVGGRTGLT AVVVGILFLL VIFISPLAGM VPAYAAAGAL
IYVGVLMTSS LARVKWDDLT EAVPAFVTAV MMPFSFSITE GIALGFISYC LMKLGTGRWR
EISPCVVVVA LLFMLKIAFV DH