Gene YpAngola_A3250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3250 
Symbol 
ID5801728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3451924 
End bp3453084 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID641341080 
Productmajor facilitator transporter 
Protein accessionYP_001607602 
Protein GI162420198 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000228404 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGTTG CACTACTGGC GTTGGCGCTG TGTGCTTTTG CTATTGGTAC TACTGAATTT 
GTCATTATGG GGTTATTGCC CCAGGTAGCG GGTGATTTGC ACATATCGAT TCCAACTGCG
GGCTGGTTGA TCAGTGGTTA TGCGCTAGGT GTGGCAATTG GCGCGCCCAT CATGGCAGTG
CTGACCGCGA AATTACCGCG CAAGAAGACA CTGTTACTGT TAATGGTGAT TTTTATCATC
GGTAACCTGA TGTGTGCCTT GGCATACAGT TATGACTTCC TGATGTTCGC GCGAGTGATC
ACCGCGTTGT GTCATGGGGC CTTTTTTGGT ATCGGCGCGG TGGTCGCCGC AAATCTGGTG
GCACCAAACC GGCGGGCCTC GGCGGTGGCA CTGATGTTTA CGGGGCTGAC GCTGGCGAAT
GTATTGGGTG TCCCACTGGG GACCGCTCTA GGTCAGGCTT TTGGCTGGCG TTCGACATTT
TGGGTGGTAT CGGTCATCGG TTTGTTCTCG TTGGCAGCCC TGTATAGCAA GTTGCCCTCC
TCCAGCGAGG AAGCACCGAC TGAGCTTCGT AAGGAGATTG CCGCTTTGCG TGGCGGTGGA
ATTTGGCTCT CCTTACTGAT GACCGTATTT TTTGCCGCAG CCATGTTTGC GCTCTTTACC
TACATCGCCC CCATTTTGAC GGAGGTCACA CAGGTTTCTG AGCATGGCGT CAGTTGGACG
TTACTGCTAA TGGGGGTTGG CTTGACGCTC GGTAATATCG TCGGGGGCAG GCTAGCTGAC
TGGCGTTTAT CGGTCAGTTT AACCATGACA TTCTTGTTGA TCGCGGTATT TTCTGCCCTG
TTTAGTTGGA CCAGTTATTC ACTGTTGGCG GCGGAAGTGA CACTGTTTTT GTGGTCAGCC
GCCGCATTTT CTGCAGTGCC TGCGTTGCAA ATTAATGTCG TCGCTTATGG CAAGAAAGCC
CCTAATCTGG TGTCAACGCT GAATATTGCG GCCTTTAATG TGGGTAACGC CTTAGGGGCG
TGGGTCGGGG GGGTTGTGAT TGCCAAAGGG CTTGGTTTGA CGGCGGTGCC GCTGGCCGCC
GCGGCACTGG CGGTCATGGG GTTGTTGCTG TGTCTGTTTA CCTTTTCCCG CGCGCGTACT
ATTGGGAATA AAATGGCTTA G
 
Protein sequence
MPVALLALAL CAFAIGTTEF VIMGLLPQVA GDLHISIPTA GWLISGYALG VAIGAPIMAV 
LTAKLPRKKT LLLLMVIFII GNLMCALAYS YDFLMFARVI TALCHGAFFG IGAVVAANLV
APNRRASAVA LMFTGLTLAN VLGVPLGTAL GQAFGWRSTF WVVSVIGLFS LAALYSKLPS
SSEEAPTELR KEIAALRGGG IWLSLLMTVF FAAAMFALFT YIAPILTEVT QVSEHGVSWT
LLLMGVGLTL GNIVGGRLAD WRLSVSLTMT FLLIAVFSAL FSWTSYSLLA AEVTLFLWSA
AAFSAVPALQ INVVAYGKKA PNLVSTLNIA AFNVGNALGA WVGGVVIAKG LGLTAVPLAA
AALAVMGLLL CLFTFSRART IGNKMA