Gene YpAngola_A3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3524 
Symbol 
ID5802000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3746071 
End bp3748707 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content58% 
IMG OID641341340 
Producthemagglutination repeat-containing protein 
Protein accessionYP_001607853 
Protein GI162418354 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.20838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTA TCCAAAAGTG TGACTGTAAT TATCTGGTCA GATTTGCAGT TTCAGATTCC 
TTTATCAGAG ACAATTCTTT TCGCAGCTTG CTTGCTATTT TTATGGTGAC GATATTTGTG
CCAAATACTG CATTTTCTCA ACTCTATGTG AACAATGCCA ATGATCCTGG CTGTTATGTC
ATTGCTGATG GGGGGTTTAC GGGTGTTACT AATAGTGATC AAAATTGTTA TACGTTGTCT
TTGAATAATA TTAATACCAA TGGGGGGCAG CTATTTGTTG GTGGAAAAGG TGGGATCGCT
GGGAAGCCTA GCTTTATCGC CACCCCTTGG ACAGGGACAT TCACGACGGC CGTTGGCACC
AGTAATGTCG CCACAGCATA CGGCTTTGTC GTTCAAAGCA ATGGTGCATT TATCAACGGT
GATACCTACG TGAAGGGGGG GTTATTTTTG AACGGTAGGA AAGCCACCAA TCTTGCCCCC
GCAACGATTT CATCGACCTC CACCGATGCG GTGGTTGGTA GCCAGCTTTA TACGGTGATC
CAAGATGGAA CCCGCTATTT CCACGCCAAC TCAGTGAACC CGCAAGACTC CGTGCCTGCG
GGTCAGGATG CTATTGCTGT CGGGCCGGCG ACGGTGGTCA ATGGTAATAA CGGGATTGGT
ATTGGTAGTA GCGCCGTTGT TGGGCCGAGT GCTGTCGGGG GCATTGCAAT TGGCCCCAAC
ACTCAGGCGA CCGGTATCGC CAGCACGGCC CTTGGTGCCG GGTCGCAAGC GCATGGATCA
CAGTCTTTGG CATTGGGGGC GGGAGCAACT GCCAGCCAGG CAAACAGTAT CGCGTTAGGG
GCGTCGTCGG TCACCACGGT CGGTGCTGAG AGCGACTACA GTGCGTACGG ACTGACGGCT
CCCCAAACGT CGGTGGGCGA GGTGGGGATG GGCACGGCAC AGGGGAATCG CAAGATCACC
GGTGTGGCAG CCGGTTCGGC TGATTATGAT GTGGTCAATG TCGCGCAATT GACCGCTGTT
GGTGACAAGG TCGAGCAGAA TACCGCCGAC ATTACCAGTT TGGGTGGCCG GGTCACCAAT
GTTGAGGGGG GGATGACCCG TATCACCAAC GGGGGCGGTA TAAAGTACTT CCACACTCAC
TCCACCGAGC CTGATTCGGT GGCCAGCGGC AGTGATTCGG TGGCGATCGG ACCGAATGCG
CAGGCGTCCG GTACCACGTC GATAGCCATG GGGGCCGGGT CGACAGCGCA GGGAGCACAG
TCTCTGGCAT TGGGGGCGGG AGCGGCTGCC AGCCAGGCAA ACAGTATTGC ATTAGGGGCG
TCGTCGGTCA CCACGGTCGG TGCTGAGAGC GACTACAGTG CGTACGGACT GACAGCTCCC
CAAACGTCGG TGGGCGAGGT GGGGGTGGGC ACGGCACAGG GGAATCGCAA GATCACCGGT
GTGGCAGCCG GTTCGGCTGA TTATGATGCG GTCAATGTCG CGCAATTGAC CGCTGTTGGT
GACAAGGTCG ATCAGAATAC CGCTGACATC ACCAGCTTAG ACGGCCGGGT CACCAATGTT
GAGGGGGAGA TGGCCAGCAT CACCAACGGG GGCGGCGTGA AATACTTCCA CACCCACTCC
ACCGAGTCTG ACTCGGTGGC CAGCGGCAGT GATTCGGTGG CGATCGGACC GAATGCGCAG
GCGTCAGGTA CGGCTTCGGT GGCCTCCGGC AAGGGTACGC TGGCCTCCGG TAACGGTGCG
GTGGCGATAG GTGATGCAGC AAGCGTCAGC GCAGAGGGCA GTGTTGCCCT GGGGCAGGGT
TCCGCTGACA ACGGGCGCGG TGCAGAGAGC TACACCGGCA AGTACTCCAC TACGGATAAC
ACCACCTCAG GTACGGTGTC GGTGGGCAAT GCGGCAACCG GAGAGACCCG GACGGTCAGC
AACGTTGCCG ACGGGCGAGA GGCCATGGAT GCAGTCAATC TGCGGCAACT CGATGGTGCA
ATGGCGGCGG TGGGTGACAC CGTATCAGGG TTGCAGAACG GCACTGACGG GATGTTCCAG
GTGAACAACA ACAGCGGTCA GGCCAAGCCT TCGGTCACCG GAACTGATGC GATGGCGGGG
GGGGGGGGGG CAGGCTCCGT GGCGTCTGGC AGCCACAGTA CCGCGATGGG TACGGGCAGC
AAGGCGACGG CGGCAAACAG CACCGCGCTG GGGGCCAACT CAGTGGCGGA TCGTGAAAAC
AGTGTCTCGG TGGGGTCAGT GGGTAATGAA CGGCAGCTCA CTAATATTGC TGTGGGGACT
CAGGGCACTG ATGCGGTGAA TCTGGATCAA CTCAACCATA GCATGTCGAA TGTCACCAAC
GACGCCAATG CTTATACAGA CCAGCGCTAT TCTGCACTTA AAGAAGATCT GAAAAAACAG
GATAGTACGT TAAGTGCGGG GATCGCCGGT GCCATGGCGA TGGCGAGCCT GACTCAACCC
TATACGCCGG GTGCCAGCAT GGCGACCATT GGTGCGGCCA GCTATCGGGG CCAGTCGGCG
CTGTCGGTGG GGGTGTCGAG TATTTCTGAC AGTGGGCGAT GGGTCAGCAA ATTGCAGGCC
TCCTCTAATA CACAAGGCGA TATGGGGGTT GGTGTCGGCG TCGGTTATCA ATGGTAA
 
Protein sequence
MKSIQKCDCN YLVRFAVSDS FIRDNSFRSL LAIFMVTIFV PNTAFSQLYV NNANDPGCYV 
IADGGFTGVT NSDQNCYTLS LNNINTNGGQ LFVGGKGGIA GKPSFIATPW TGTFTTAVGT
SNVATAYGFV VQSNGAFING DTYVKGGLFL NGRKATNLAP ATISSTSTDA VVGSQLYTVI
QDGTRYFHAN SVNPQDSVPA GQDAIAVGPA TVVNGNNGIG IGSSAVVGPS AVGGIAIGPN
TQATGIASTA LGAGSQAHGS QSLALGAGAT ASQANSIALG ASSVTTVGAE SDYSAYGLTA
PQTSVGEVGM GTAQGNRKIT GVAAGSADYD VVNVAQLTAV GDKVEQNTAD ITSLGGRVTN
VEGGMTRITN GGGIKYFHTH STEPDSVASG SDSVAIGPNA QASGTTSIAM GAGSTAQGAQ
SLALGAGAAA SQANSIALGA SSVTTVGAES DYSAYGLTAP QTSVGEVGVG TAQGNRKITG
VAAGSADYDA VNVAQLTAVG DKVDQNTADI TSLDGRVTNV EGEMASITNG GGVKYFHTHS
TESDSVASGS DSVAIGPNAQ ASGTASVASG KGTLASGNGA VAIGDAASVS AEGSVALGQG
SADNGRGAES YTGKYSTTDN TTSGTVSVGN AATGETRTVS NVADGREAMD AVNLRQLDGA
MAAVGDTVSG LQNGTDGMFQ VNNNSGQAKP SVTGTDAMAG GGGAGSVASG SHSTAMGTGS
KATAANSTAL GANSVADREN SVSVGSVGNE RQLTNIAVGT QGTDAVNLDQ LNHSMSNVTN
DANAYTDQRY SALKEDLKKQ DSTLSAGIAG AMAMASLTQP YTPGASMATI GAASYRGQSA
LSVGVSSISD SGRWVSKLQA SSNTQGDMGV GVGVGYQW