Gene YpAngola_A3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3891 
SymbolhasE 
ID5802369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4128322 
End bp4129650 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content49% 
IMG OID641341682 
ProductHlyD family hemolysin secretion protein 
Protein accessionYP_001608192 
Protein GI162418329 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.76201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCTG AATCAGTCTG TTATTCGGTT GCGAATGCCA ATCGCCAACC ACCACTCCAA 
ATAAACAGTG GGCGCTATCT CAATATAGGT GGGGGGCTGG TCGTTATCGG CTTCATCGGC
TTTTTACTTT GGGCGGGGTT AGCGCCGTTA GATAAAGGCG TTGCCGTGAC CGGGCTGCTT
GTGGTGGCAG AAAACCGCAA AGTGATCCAG CCGCTGCAAG GGGGGCGTAT TCAGCAACTG
CATGTCACTG AAGGCGACGA AATAGTGTCT GGGCAATTAT TGGTCACGCT GGATGACACG
GCGATACGCA ATCAGCGGGA TAATTTACAG CATCAGTATC TAAGCGCCCT TGCTCAGGAA
GCACGTTTAA CTGCTGAGCA AAACGATCTG GATGTGATCA CTTTCCCGCA GGCGCTGCTT
GAGCACGCAA CACAACCAGC GGTTGAACGT AATATTATTT TGCAGCAACA GCTTTTACAT
CACCGCCGCC AGGCGCACTT GAGTGAAATC GCCCGGTTAT CGACACAGCT CACTCGCCAT
CAGGCTCGGC TCGATGGGTT GCAAGCTATG CGGAGCAATC ATCAACGTCA ATCCAATTTA
TTCCAGCAAC AATTAGACAG TGTGCAGTTA TTGGCAAAGG ATGGTCATAT TGCCAAAAAT
AAATTGTTAG AAATGGAAAG CCAGTCAACC TCACTCCAGG CACGAGTAGA ACAAAGTACC
AGCGATATTG CGGAAGCACA TAAGCTTATC GATGAAACAG AACAACATGT TTTACAACGG
CGCGAACAAT ATCAAAGTGA GAATAGCGAG CAATTAGCCA AGGCACAGCA AAACACCCAA
GAACTGGTCC AACGTTTAAA TATTGCAGAA TATGAATTGA GTCATACCCG TATTTTTGCA
CCAGTCAGTG GTTCAGTTAT CGCATTGGCT CAACACACTG TGGGGGGCGT AGTAAGCTCA
GGGCAGGCAT TAATGGAAAT TGTCCCCAGT GGGCAGCCGT TATTCGTTGA GGCTCAGTTG
CCGGTGGAGC TAATTGATAA GGTTGCGGTC GGGCTACCTG TAGATCTCAA TTTTTCTGCG
TTTAATCAAA GTAATACCCC CCGGCTACAA GGCTCTGTAT GGCGCATCGG AGCCGATCGT
ATACAACCCC CACCTACTTC ACCGCCTTAT TACCCTTTAA CCGTTGCGAT TGATCTTGAC
CCGACGGAAC TGGCAATCCG TCCAGGTATG GCCGTTGATG TTTTTATACG TACCGGAGAA
AGGTCATTAC TCAGCTATTT ATTTAAGCCA TTCACTGATC GCCTGCACCT CGCGTTAGCC
GAGGAATAA
 
Protein sequence
MLPESVCYSV ANANRQPPLQ INSGRYLNIG GGLVVIGFIG FLLWAGLAPL DKGVAVTGLL 
VVAENRKVIQ PLQGGRIQQL HVTEGDEIVS GQLLVTLDDT AIRNQRDNLQ HQYLSALAQE
ARLTAEQNDL DVITFPQALL EHATQPAVER NIILQQQLLH HRRQAHLSEI ARLSTQLTRH
QARLDGLQAM RSNHQRQSNL FQQQLDSVQL LAKDGHIAKN KLLEMESQST SLQARVEQST
SDIAEAHKLI DETEQHVLQR REQYQSENSE QLAKAQQNTQ ELVQRLNIAE YELSHTRIFA
PVSGSVIALA QHTVGGVVSS GQALMEIVPS GQPLFVEAQL PVELIDKVAV GLPVDLNFSA
FNQSNTPRLQ GSVWRIGADR IQPPPTSPPY YPLTVAIDLD PTELAIRPGM AVDVFIRTGE
RSLLSYLFKP FTDRLHLALA EE