Gene YpAngola_A1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1084 
Symbol 
ID5799547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1117016 
End bp1118704 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content54% 
IMG OID641339062 
ProductShlB/FhaC/HecB family haemolysin secretion/activation protein 
Protein accessionYP_001605634 
Protein GI162418225 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGTT TTTATTATTC TATATTAGTG GCTGGCGTAT TATTGAAAAT AGTGGCTATT 
CCAGATGCCA GTTATGGCGC GGAATTAGCG CCCGTTCAAC AAAGTATTCA CCAACAGGAG
CGGCAACGGG CACTTGAAGA GCGCCTGGCT CCGCCGACGC CGGATGTGCG CTTATCTGCG
CCCTCGGCCT TTTTTAGCCG TATTATTTTT CCGCTGGAGA CACCCTGTTT TGTTATTAAT
CGGATAAAAA TCAGTGGGGC CGAGCCATTG CCCCGTTGGT TGCCGTTGCA ACGCATTGCC
GATCAAGCGC AAGGCCAGTG TCTGGGGGCC AAGGGCATTA ACCTGCTAAT GAGCCAAATG
CAGAACCGTT TGGTCGATCA CGGCTATGTG ACTACCCGAG TGCTGGCCCC ACAGCAGGAT
TTAAACAGCG GCACGCTGGC GCTAAACGTC GTGCCCGGCA AAATACGCGG TGTGGAACTG
ACGCCAGACA GTAATAGGTA TGTCACGTTA TTCAGCGCTT TTCCTGCCCG TGCCGGGACG
CTGTTGGATT TACGCGATAT CGAACAAGGC TTGGAAAATC TACAGCGTGT TCCCACGGTG
CAAGCCAATA TGGTGTTGAT CCCAGGCTCT GCCCCCGGTG AGACGGATAT CATTCTGAAC
TGGCAACAGC GAAAAATGTG GCGGCTGGCA GCCTCACTGG ATGATTCGGG TACCCGCAGC
ACTGGCCGCT ATCAAGGGGG GGCGACGTTG TTTCTGGATA ACCCGCTTTC TCTGAGTGAT
CTTTTTTATG TCTCAGCTGG TGGCGCACTG CAACGCCGTG GCGATAAAGG CACGAATAAT
CTGACCGGCC ATTACTCATT GCCATTCGGT TATTGGACCG CAGGCATGAC CGCCAGCCGT
TATGACTATT ACCAGGCCGT TGCGGGCCTG AATGGCGATA TCAACTACCG AGGTGAAAGT
GAGAACGTGG CGTTCCAACT CAGCCGGTTG TTGCACCGTA ATGCCAGCCA GAAAACCACC
TTTACCTACG ATGTGCTAAC CCGTTCGTCG AAAAACTATA TCAACGATAC CGAAGTGGAA
GTACAGCGCC GCCGCACCTC GGCCTGGCGG ATCGGGCTAC AACACCGCCA CTTTATCTCG
CAGGCGATTT TGGATGCCGG TATCAGCTAT CAGCGTGGCA CCCGCTGGTT TGGTGCCATA
CCCGCGCAGG AAGAGTATTT CGGCGAAGCC ACCGCCCTGA GCAAAATTCT GCGATTGAAT
GCGCAACTGG ATATTCCTTT TGTGGTTATG GCGCAAAACC TCCATTACAA CCTGCAATAT
CAGCGCCAAA GTACCAACAC GCCACTGACG CCGCAGGATC AGTTCTCCAT TGGTGGCCGC
TGGTCGGTGC GTGGTTTTAA TGGTGAGCGC ACGCTGATTG CCGATCGCGG CTGGTGGGTG
CGCAATGATA TCGGCTGGTA TCTGCCGCTA CCGGGGCATG AGCTGTATGT CGGTGTGGAT
TACGGCGAAG TCGGTGGTCG TAGTGGCGCT TATCTGTTAG GCCGCCATTT GGCGGGCAGT
GCGGTCGGGG TACGTGGCAA CGTACTGAAT ACCCGCTATG ACCTGTTTGC GGGGAAACCG
CTCTCTAAAC CTAATGGTTT CAAAACCGAT TCGCTGGCGG TGGGTTTTAA CCTGAATTGG
CTGTACTGA
 
Protein sequence
MSRFYYSILV AGVLLKIVAI PDASYGAELA PVQQSIHQQE RQRALEERLA PPTPDVRLSA 
PSAFFSRIIF PLETPCFVIN RIKISGAEPL PRWLPLQRIA DQAQGQCLGA KGINLLMSQM
QNRLVDHGYV TTRVLAPQQD LNSGTLALNV VPGKIRGVEL TPDSNRYVTL FSAFPARAGT
LLDLRDIEQG LENLQRVPTV QANMVLIPGS APGETDIILN WQQRKMWRLA ASLDDSGTRS
TGRYQGGATL FLDNPLSLSD LFYVSAGGAL QRRGDKGTNN LTGHYSLPFG YWTAGMTASR
YDYYQAVAGL NGDINYRGES ENVAFQLSRL LHRNASQKTT FTYDVLTRSS KNYINDTEVE
VQRRRTSAWR IGLQHRHFIS QAILDAGISY QRGTRWFGAI PAQEEYFGEA TALSKILRLN
AQLDIPFVVM AQNLHYNLQY QRQSTNTPLT PQDQFSIGGR WSVRGFNGER TLIADRGWWV
RNDIGWYLPL PGHELYVGVD YGEVGGRSGA YLLGRHLAGS AVGVRGNVLN TRYDLFAGKP
LSKPNGFKTD SLAVGFNLNW LY