Gene YpAngola_A2443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2443 
Symbol 
ID5800913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2563146 
End bp2564351 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID641340318 
Productmultidrug resistance protein MdtH 
Protein accessionYP_001606862 
Protein GI162419997 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.313716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGG TATCGCAAGC TCGAAGCTTG GGTAAGTATT TTTTATTGTT TGATAATTTA 
TTAGTTGTGT TGGGATTTTT TGTCGTATTC CCGCTCATTT CTATCCGATT TGTCGATCAA
TTAGGTTGGG CTGCGTTGGT CGTGGGCCTT GCTTTAGGCC TAAGGCAATT GGTTCAACAG
GGATTGGGGA TCTTTGGCGG GGCCATTGCC GACCGGTTCG GTGCCAAACC GATGATTGTC
ACCGGGATGC TAATGCGAGC CGCGGGTTTT GCCTTGATGG CCATGGCGGA TGAACCTTGG
ATACTTTGGC TGGCCTGTGC GCTGTCAGGG CTGGGCGGCA CCCTGTTTGA TCCACCACGC
ACAGCGTTGG TCATCAAGTT GACCCGCCCC CATGAACGAG GCCGTTTTTA TTCGCTGCTA
ATGATGCAGG ATAGCGCCGG AGCGGTAATT GGTGCGTTGA TTGGCAGTTG GCTGCTGCAA
TATGACTTCC ACTTCGTCTG TTGGACCGGT GCGGCTATTT TTGTGCTGGC AGCCGGTTGG
AACGCCTGGT TACTCCCCGC TTACCGTATC TCGACCGTTC GCGCCCCGAT GAAAGAGGGC
CTGATGCGAG TGTTGCGTGA TCGCCGTTTT GTGACCTATG TGCTGACGCT TACTGGCTAT
TATATGCTCG CCGTACAGGT CATGCTGATG CTGCCGATCG TGGTGAATGA ACTTGCTGGC
TCACCGGCCG CAGTAAAATG GATGTATGCC ATTGAAGCCG CACTTTCTCT GACGCTGCTT
TATCCGCTGG CCCGTTGGAG TGAAAAGCGT TTTAGCCTTG AGCAACGCTT AATGGCCGGC
TTACTGATCA TGACGCTCAG TTTGTTCCCC ATCGGGATGA TCACTCATCT ACAAACACTG
TTCATGTTTA TCTGTTTCTT CTATATGGGA TCAATCCTCG CTGAACCCGC CCGTGAAACA
CTGGGAGCAT CGCTGGCTGA CTCCCGCGCC CGTGGTAGCT ATATGGGCTT TAGTCGCCTT
GGATTAGCGC TGGGGGGGGC ATTAGGTTAT ACCGGCGGTG GCTGGATGTA CGATACCGGC
AAAACCCTTG ATATGCCTGA GCTACCCTGG TTCTTGCTGG GTATTATTGG ACTCATTACA
TTGGCAGGGC TTTACTGGCA ATTCAATCGA CGGCGGATTG AATCTGCCAT GTTGAGCAGC
AGCTAA
 
Protein sequence
MALVSQARSL GKYFLLFDNL LVVLGFFVVF PLISIRFVDQ LGWAALVVGL ALGLRQLVQQ 
GLGIFGGAIA DRFGAKPMIV TGMLMRAAGF ALMAMADEPW ILWLACALSG LGGTLFDPPR
TALVIKLTRP HERGRFYSLL MMQDSAGAVI GALIGSWLLQ YDFHFVCWTG AAIFVLAAGW
NAWLLPAYRI STVRAPMKEG LMRVLRDRRF VTYVLTLTGY YMLAVQVMLM LPIVVNELAG
SPAAVKWMYA IEAALSLTLL YPLARWSEKR FSLEQRLMAG LLIMTLSLFP IGMITHLQTL
FMFICFFYMG SILAEPARET LGASLADSRA RGSYMGFSRL GLALGGALGY TGGGWMYDTG
KTLDMPELPW FLLGIIGLIT LAGLYWQFNR RRIESAMLSS S