Gene YpAngola_A2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2122 
Symbol 
ID5800592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2215294 
End bp2216523 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content51% 
IMG OID641340031 
Productmajor facilitator transporter 
Protein accessionYP_001606576 
Protein GI162420366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.113023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTGGT TTAGCGATCT CAACCAGACA GAAAGAAAAA CTTTCATTGC TGCTTTTGGC 
GGTTGGGCAT TGGATGCGCT AGATTTCATG GTTTTTACAT TCGTGATTGC TACGCTGATG
GCTCTCTGGC AAATCGACGC AGGCCAGGCC GGAATGTTGA GCACCGTCAC CCTGCTTTTC
TCCGCTATTG GTGGCTGGGG TGCGGGTATC CTTGCTGATC GTTATGGCAG GGTGCGTATC
CTGCAAATAA CTATTTTATG GTTCTCCTTA TGTACGGTAC TCATCGGATT TGCTCAGAAC
TTCGAACACG TTTTCATCCT ACGCGCGCTA CAAGGTTTAG GTTTTGGTGG TGAATGGGCC
GTAGGTTCTG TATTGATGGG GGAAATCGTC CGCGCCGAAC ATCGCGGGAA AGCCGTAGGC
ACCGTGCAAA GTGGATGGGC GATTGGTTGG GGTGCAGCAG CCTTGCTTTA CACGCTAATG
TTCTCGGTAC TATCAGAGGA GTGGGCGTGG CGCGCATTAT TCTGGATTGG TGTGCTTCCC
GCCCTTTTAG TACTCTACAT ACGTAAAAAC GTCCCAGAAC CCGCGTTATT CCTGAAATCC
CGCCACGAGC AGCAAGCCAA GGATCGACGC GTTTCTTCCT TCGCCATTTT CTCCCCCGCG
CTACTCAAGA CCACCGTTCT GGCTTCTTTA CTCTGTACCG GAGTGCAAGG CGGCTACTAC
GCCATCACAA CTTGGCTGCC CACTTTTCTC AAACTCGAAC GCAATCTCTC GGTGATCGGT
ACTGGTGGGT ATCTCATGGT GATTATATTT GGCTCTTTCG TCGGCTATAT TTGTGGCGCT
TATCTCACCG ACAAACTTGG GCGACGGGCA AATCTGATTA TCTTCTCACT GCTGTCGTGT
ATAACAATTT TTGCTTATAC CCAACTTACG CTGACCAACA CGCAGATGCT CGTTCTGGGA
TTCCCACTCG GCTTTTCGGC ATCCGGGATC TTCAGCGGCA TGGGTGCCTT TCTCACTGAG
CTTTTTCCTT CGGCGGTACG TGCTACAGGA CAAGGTTTTA CCTATAGTTT TGGTCGTGCC
GTCGGTGCAC TCTTTCCGGG GCTGGTCGGC TATCTGAGTC AGTCAAGTAG TCTGGCATTT
GCCATTGGAG TCTTCGCTGG CAGTGCCTAT TGCATCGTAT TGGTAATGAG CCTATTGCTG
CCCGAGACCA AAAATAAACA GCTAGAATAA
 
Protein sequence
MRWFSDLNQT ERKTFIAAFG GWALDALDFM VFTFVIATLM ALWQIDAGQA GMLSTVTLLF 
SAIGGWGAGI LADRYGRVRI LQITILWFSL CTVLIGFAQN FEHVFILRAL QGLGFGGEWA
VGSVLMGEIV RAEHRGKAVG TVQSGWAIGW GAAALLYTLM FSVLSEEWAW RALFWIGVLP
ALLVLYIRKN VPEPALFLKS RHEQQAKDRR VSSFAIFSPA LLKTTVLASL LCTGVQGGYY
AITTWLPTFL KLERNLSVIG TGGYLMVIIF GSFVGYICGA YLTDKLGRRA NLIIFSLLSC
ITIFAYTQLT LTNTQMLVLG FPLGFSASGI FSGMGAFLTE LFPSAVRATG QGFTYSFGRA
VGALFPGLVG YLSQSSSLAF AIGVFAGSAY CIVLVMSLLL PETKNKQLE