Gene YpAngola_A4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4006 
Symbol 
ID5802485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4260089 
End bp4261249 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID641341792 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001608299 
Protein GI162421607 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGTA AAATTATTTT GGCCTGTTTG GTGTTCACGC TTGTCGCTTG TGATCAATCC 
TCTTCACCTT CAGCGACACC CTCGCGACAA GAGGTGGGGG TGGTGACCTT AAAAACTCAA
CCGGTCACAT TGAGTAGCGA TTTATCAGGC CGAACGGTGG CGGCAATGAC GTCTGAAGTC
AGGCCACAGG TCGATGGTAT CATCAAGAAA CGCTTGTTTA CTGAAGGATC TGAAGTCACC
GCGGGACAAG TGCTCTATCA AATAGACCCT GCTAGTTATC AGGCCGCTTA TGACACCGCA
AAAGCCGCGC TACAAAATGT TCAGGTCAGC GTAAAATCGG CCAAGCTGAA AGCCCAGCGC
TATGCGGCAT TAGCCAAAGA AAATGGTGTG TCGCAGCAAG ATGCTGATGA CGCCCAAACC
AGCTATCAGC AAGCGCTGGC CAATGTGGCC GAAAAAACTG CTGCGCTGGA AACGGCGCGT
ATCAATTTGG CCTATACCCA AGTGCGGGCA CCCATTTCAG GTCGCATCGG CATTTCCTCT
GTGACACCAG GGGCGCTGGT GACGGCTAAT CAAACCACGG CATTAGCCAC CATCCGTAAC
CTTGATCCCA TTTATGTTGA TTTAACCCAG TCCAGTGCGC AGTTGCTTGC TTTACGCAAA
CAGCAGCAGG CAGGCAATGA CACAGTAGCG AATGCGCCGG TTCAGCTAAC GCTGGAAGAT
GGCTCGGTTT ATGCCCACGA AGGTTCTCTG CAACTGACCG AGGTGGCCGT GGATGAAGCG
ACCGGTGCGG TGACCTTGCG TGCGAAATTC CCTAACCCTG AACACCAATT ATTGCCGGGC
ATGTTTGTTC GCGCTTCAGT ACGGAACGGT GTAAATAACA CCGCTATTTT GGCACCACAG
CAAGGCATCA CTCATGATGC GAAAGGGAAT GCTACCGCGT TGGTGGTTAA TCAACAGCAG
CAAGTTGAGC GGCGTGAAGT GGTAACCGAA CGCACGATTG ATAGCTACTG GTTGATTAGC
CGTGGGCTGG CTGCAGGTGA CCGCCTGATT GTTGAAGGAA CCGAAAAGGT CAGTGTTGGC
GATGACGTCA AACCGGTAGA GGTGAGCACT ACGCTGCCCG TTGTGGCCGA ACCGACTACG
CCATCAACTG GGGAGAAATA A
 
Protein sequence
MHSKIILACL VFTLVACDQS SSPSATPSRQ EVGVVTLKTQ PVTLSSDLSG RTVAAMTSEV 
RPQVDGIIKK RLFTEGSEVT AGQVLYQIDP ASYQAAYDTA KAALQNVQVS VKSAKLKAQR
YAALAKENGV SQQDADDAQT SYQQALANVA EKTAALETAR INLAYTQVRA PISGRIGISS
VTPGALVTAN QTTALATIRN LDPIYVDLTQ SSAQLLALRK QQQAGNDTVA NAPVQLTLED
GSVYAHEGSL QLTEVAVDEA TGAVTLRAKF PNPEHQLLPG MFVRASVRNG VNNTAILAPQ
QGITHDAKGN ATALVVNQQQ QVERREVVTE RTIDSYWLIS RGLAAGDRLI VEGTEKVSVG
DDVKPVEVST TLPVVAEPTT PSTGEK