Gene YpAngola_A2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2574 
Symbol 
ID5801045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2693717 
End bp2694919 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content50% 
IMG OID641340443 
Productinner membrane transport protein YdhC 
Protein accessionYP_001606985 
Protein GI162419019 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.723156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGT CATCCAGTTT CATGTTCTAT CTGGCCGGCC TGAGTATGCT GGGTTATTTG 
GCCACAGACA TGTATCTCCC CGCCTTCGGT GCAATGCAAC GCGAACTGCA AACCACCGCC
GGTGCCATCA GCGCCAGTTT AAGTATTTTT CTGGCCGGTT TTGCCATAGC ACAGCTTATC
TGGGGGCCCC TTTCAGACAA ATTGGGCCGC AAGCCTGTCC TGTTAGCGGG TTTGGGGTTA
TTTGCCATCG GTTGCTTAGG GATGTTATGG GTTGAAAATG CCACTCAATT GTTGGTTCTA
CGCTTCATAC AAGCCGTGGG CGTCTGTTCA GCGGTGGTGA GCTGGCAGGC TTTAGTGGTA
GACCGCTATC GCGACGGGAA GGCCAACCGC GTGTTTGCCA CCATAATGCC GCTGGTTGCC
TTATCACCCG CATTAGCGCC GTTATTAGGT GCCTGGTTAC TGAACCACTT CAGTTGGCGG
TCAATTTTTG TAGTTCTATT AGCCATTACA CTGCTCTTAC TGATCCCGAC AATGGTGCTG
CAAGAAAGGA AAAAAGCACG CGCCGATAAC AGCGATCAAC CTAAAAACAC GGTGAGTTTC
TGGCAACTGC TCCAATCACC CACATTCAGT GGCAACGTCA TGATCTTTGC TGCCTGTAGT
GCTGGCTTCT TTGCTTGGCT AACGGGTTCA CCTTTCATTC TGGAAAATAT GGGGTACAGC
CCAAATGTTA TTGGCTTGAG TTATGTCCCA CAGACGCTGG CTTTTCTGCT TGGTGGATTT
GGTTGCCGCA GTGCACTTGC CCACATAAAA GGGAACACCC TGCTACCCTG GCTGTTAGTG
GGCTATGCTG GCAGCATGAT TGCCCTGTAC TTGATCGCCA CCTTAACAAC GCCATCATTG
CTCACGTTGC TTATCCCTTT CTGCTTAATG GCGCTTGTGA ATGGTGCCTG CTACCCTATT
ATCGTAGCGA ATGCATTAAT GCCGTTTCCA GACAATACCG GCAAAGCGGC GGCCTTGCAG
AATACCCTGC AACTCGGTTT GTGCTTTGTG GCCAGTATGT TGGTATCGGC CTATATCAGC
CAGCCGCTCT TAGCGACAGT GACAGTGATG CTATTGACGG TAGTTTTGGC GGCTTTGGGT
TACGGCATCC ATTGTTACGC GTTACGTAAA GACAAAACCA GATTGATGAC TCGGACACCA
TAA
 
Protein sequence
MKTSSSFMFY LAGLSMLGYL ATDMYLPAFG AMQRELQTTA GAISASLSIF LAGFAIAQLI 
WGPLSDKLGR KPVLLAGLGL FAIGCLGMLW VENATQLLVL RFIQAVGVCS AVVSWQALVV
DRYRDGKANR VFATIMPLVA LSPALAPLLG AWLLNHFSWR SIFVVLLAIT LLLLIPTMVL
QERKKARADN SDQPKNTVSF WQLLQSPTFS GNVMIFAACS AGFFAWLTGS PFILENMGYS
PNVIGLSYVP QTLAFLLGGF GCRSALAHIK GNTLLPWLLV GYAGSMIALY LIATLTTPSL
LTLLIPFCLM ALVNGACYPI IVANALMPFP DNTGKAAALQ NTLQLGLCFV ASMLVSAYIS
QPLLATVTVM LLTVVLAALG YGIHCYALRK DKTRLMTRTP