Gene YpAngola_A4187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4187 
Symbol 
ID5802667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4478216 
End bp4479544 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content50% 
IMG OID641341954 
ProductAzgA family purine transporter 
Protein accessionYP_001608457 
Protein GI162418663 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.13379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0103225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT CAAACCTTGA TACCGAGCAG GGCCTGCTCG AACGTGTATT TAAACTGAAA 
CAGCATGGCA CCACAGCTCG TACTGAGCTG ATTGCGGGTA TCACGACTTT CTTGACCATG
GTCTATATCG TATTCGTAAA CCCGCAGATT CTCGGGGTTG CGGGTATGGA TGTGCAGGCG
GTGTTCGTGA CAACCTGCCT GATCGCCGCA TTTGGCAGCA TTTTTATGGG CTTATTGGCT
AACTTACCTG TGGCACTGGC ACCGGCGATG GGGCTCAACG CTTTCTTCGC TTTTGTGGTG
GTGGGGGCGA TGGGTATTTC TTGGCAGGTC GGTATGGGCG CTATTTTCTG GGGGGCAATC
GGTTTCCTTT TGCTAACCAT TTTCCGCATT CGTTACTGGA TGATAGCGAA CATCCCACTG
AGCCTGCGTG TGGGGATCAC AAGTGGTATT GGCCTGTTTA TTGCCATGAT GGGGTTGAAG
AATGCCGGTA TCGTGGTCGC AAACCCAGAT ACACTGGTGG CGGTGGGTAA TCTGACCTCT
CACAGTGTAC TGTTGGGTGC ACTGGGTTTC TTTATTATCG CAGTCTTGGC TTCTCGTAAT
ATTCACGCGG CAGTGCTGGT TTCTATTGTG GTTACCACAC TGATTGGCTG GGCGCTGGGT
GATGTGCATT ATTCGGGCGT TTTCTCCATG CCACCAAGTG TGACTTCTGT GGTTGGGCAG
GTTGATTTAG CTGGCGCGTT GAATATTGGT ATGGCGGGTA TTATTTTCTC CTTCATGCTG
GTTAACCTGT TTGATTCATC CGGCACATTG ATTGGTGTCA CGGATAAAGC CGGTTTAGCG
GATCATAAAG GCAAGTTTCC GCGCATGAAA CAAGCGCTGT ATGTGGACAG TATCAGCTCC
GTTGCCGGTG CTTTTATTGG TACTTCATCA GTGACCGCGT ATATCGAAAG TTCTTCCGGG
GTATCTGTTG GCGGCCGTAC CGGGTTAACC GCTGTTGTTG TCGGGATACT CTTCCTGCTG
GTGATATTTA TTTCTCCGTT GGCGGGTATG GTTCCTGCGT ATGCGGCCGC GGGCGCGCTG
ATTTATGTTG GCGTGTTGAT GACATCTAGC CTGGCACGGG TGAAGTGGGA TGATTTGACT
GAAGCCGTTC CAGCCTTTGT CACGGCTGTC ATGATGCCGT TCAGTTTCTC TATCACTGAA
GGGATCGCAC TGGGCTTTAT CTCTTATTGT TTGATGAAAT TAGGTACTGG CCGCTGGCGT
GAAATCAGCC CTTGCGTAGT GGTAGTGGCG CTACTGTTTA TGCTGAAAAT TGCGTTTGTT
GATCACTGA
 
Protein sequence
MSKSNLDTEQ GLLERVFKLK QHGTTARTEL IAGITTFLTM VYIVFVNPQI LGVAGMDVQA 
VFVTTCLIAA FGSIFMGLLA NLPVALAPAM GLNAFFAFVV VGAMGISWQV GMGAIFWGAI
GFLLLTIFRI RYWMIANIPL SLRVGITSGI GLFIAMMGLK NAGIVVANPD TLVAVGNLTS
HSVLLGALGF FIIAVLASRN IHAAVLVSIV VTTLIGWALG DVHYSGVFSM PPSVTSVVGQ
VDLAGALNIG MAGIIFSFML VNLFDSSGTL IGVTDKAGLA DHKGKFPRMK QALYVDSISS
VAGAFIGTSS VTAYIESSSG VSVGGRTGLT AVVVGILFLL VIFISPLAGM VPAYAAAGAL
IYVGVLMTSS LARVKWDDLT EAVPAFVTAV MMPFSFSITE GIALGFISYC LMKLGTGRWR
EISPCVVVVA LLFMLKIAFV DH