Gene YpAngola_A1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1189 
Symbol 
ID5799654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1232691 
End bp1235549 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content47% 
IMG OID641339161 
ProductRHS repeat family protein 
Protein accessionYP_001605731 
Protein GI162419579 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.667138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000780044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCACAA GTTTACCAAC CCAGCTTTGC GCCAACACCC CCGCCCTCAC CATCCACGAT 
AACCGGGGGT TAGCTATTCG TACACTGGCT TATAACCGCC GCGATCATAA TGAAACCGTT
GACGAACTGA TCAGCCGCAA CCGCTATAAC GCCTCCGGTC AGCTAATCGC CAGCCGTGAT
CCGCGCCTTG AGGTGGATAA TTTCCGCTAT CAATACAGCC TCAGCGGTGT TCCACTGCGC
ACCGACAGCG TCGATAGCGG CAGTACACTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG
CTCACGCTCG ATGCACACCA CACCCGCCGC TGGGTGGAGT ATGAGACCGG TGAACACAGT
TTAGGCCGCC CGCTAAGTTA CCACGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC
CGCTTTTTCT ATGCCACAAA CAGCGAGCAG GATAAAAACT GCAACCTGAA CGGCCAGTGT
GTACGCCATT ACGACAGCGC TGGTTTGCAG GCACTGATTA GCCAGTCGAT TATTGGCGTA
CCACTGCAAC AACAGCGCCG TCTACTGACG AATCCCAAAG GCCCAGTTGA CTGGTTTGGC
GAAAAGGAAA ACTGGGGCGC TCGCCTGAGC GAACAGCCGT TTGTTAGCCA TAGCACCACC
GATGCTCTCG GCCAGTTACT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC
TATAACCGCG CCGGACAACT TATCGGTTCG TGGCTAACAA TAAAAAATAG CGCTGAACAG
GTGATCCTCC GTTCACTCAC CTATTCGGCT GCTGGGCAAA AACTGCGCGA AGAGAGCGGC
AACGGGGTGA TTACCGAATA CCGTTATGAA CCCCAGACTC AGCGCTTAAT CGGCATTAAA
ACCACCCGTC CGGCGAAGAA AGACCGCCCG ACCCGGTTAC AAGACCTGCG TTACGATTAT
GACCCGGTCG GGAATATTCT CGCCATCCAT AATGACGCCG AAACCACCCG CTTCTACCGT
AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTATCA GCTTATCGAG
GCCACTGGCC GTGAAGCGGA TACCAACGGC ATACAAAACA GCCAGTTACC CGCGTTGGCG
TCACTGAACG ACAGCAACCA GTTCGTCAAC TACACCCGCA GCTACCACTA TGACCGCGCC
GGTAACCTGC TAAAAATTCA GCATACCGGT GCCAGCCAAT ACAGTACCCA TATCACGGTG
TCCGATTCGT CCAATCACGG CATTCAGCAA CAAGATGGCA TCATCGCCCG TGATATTCGC
TCCCAGTTTG ATGCGGCGGG TAATCAGCAA CAACTGCAAC CCGGTCAGCC CCTGCGCTGG
AACAGCCGCA ATCAGTTACA GCAGGTGGAA CCTGTGCCCC GCAACGATGG CATCAGTGAC
AGCGAAAGTT ATCTCTATGA TGGCAGCGGT AGGCGGGTGG CCAAAATCAG TCTCCATAAA
ACCCATAACG CCATCCAAAC CCGTTCGGTC ATTTATTTAG CGGGACTGGA ACTGCGTGGC
CAACATAATG ACAATAATCT GACAGAAAGT TTTCAGGTGA TAACCGTGGG TGCTGCGGGC
CGTGCTCAGG TACGGGTATT ACACTGGGAG AGCGGCCAAC CCGTTGATAT CGTCAATGAC
CAACTGCGTT ACAGTTTCGA TAATCACCTT GGCTCGGCGT TAATCGAATT AGACAGCGAT
GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGCG GTACCGCGGT GTTAGCCTCC
CGTAATACCG TGGAAGCCAA ATATAAAACC GTTCGTTATT CCGGTAAAGA GCGCGATGCC
ACCGGGCTGT ATTATTATGG TTACCGTTAT TACCAACCGT GGCTGGGCCG ATGGTTAAGC
GCCGACCCCG CAGGCACTAT AGACGGACTG AATTTATATC GGATGGTGAG GAATAACCCA
ATCAGATGGC GTGATAACAA TGGGCTATTA ACCGAAGAGC AAATTAATAT GTACGTTAAT
TTGTTTAGTA ATATTGGATT AAAAAATGAT GATGAATTAA AGAGTGAATT ATTAAAATAT
GGTTTAAGTG AAGAAGAGCA AAACCAGATA TACCTTAATA TGTTAAGACC TATGCAGTCT
GGATCATCAA GCTCATTATT CTCCTTCCCT TCTGAAAGTA GTTCAAGTTC TGGGAGTACG
CAAAGTGTTG ATTCAGGTTA TCTCAGTCCA GTAAGAAACT ATCATTTTTT TGAAGATATT
AAGTTAGCAA CAATGCACCG TCCCTATCCA AAAAAACAAG CCTCTAGTGA CACAATAACA
TATTCAGCAG AAGATTTAAC AGAAGCTAGC CCTATAAAAA TTCTCATTGG TTTGGATTTG
ACCAGTAAAA ACACCCAACC ATATAAGTCA GCGCTTGCCG AAAAAGGAAT TAAGTATATC
ACTAAAGAAA AATATGAAAT AACAGACTTT TTTGAAGAAG GAGGATTATC GACTGAACAA
ATAGATTTAA CAGTAAATAA AATATTAAAA TTACAAAAAA AGGATCTTGT AGGAATTCAT
TGTGGGGCAG GTAATGGAAG AAGTGGAGTT ATTGCATCAG CATTATCCAT TAATAAACAG
TATACAACAG ATAAAATAAA TAGTTTTGAC GTAACTCATT CATTAAGAGG GTCAATACTT
AAAGACACAC AAACATACCA AGTGGATACG GTAACCGCCA AGGCGGTTGG GATTATCAGA
GAAATAAATC CTAAAGCAGT GGAACGTAAT CAGGATGTTA TTTCCCTATA TAGATATTCT
CATTTTTTAT ATACAAGAAA ACACACTACA TCATTATAA
 
Protein sequence
MSTSLPTQLC ANTPALTIHD NRGLAIRTLA YNRRDHNETV DELISRNRYN ASGQLIASRD 
PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTLDAHHTRR WVEYETGEHS
LGRPLSYHEQ AKGGLKTVTD RFFYATNSEQ DKNCNLNGQC VRHYDSAGLQ ALISQSIIGV
PLQQQRRLLT NPKGPVDWFG EKENWGARLS EQPFVSHSTT DALGQLLTQT DAKGHIQRMA
YNRAGQLIGS WLTIKNSAEQ VILRSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIGIK
TTRPAKKDRP TRLQDLRYDY DPVGNILAIH NDAETTRFYR NQKIVPETTY RYDALYQLIE
ATGREADTNG IQNSQLPALA SLNDSNQFVN YTRSYHYDRA GNLLKIQHTG ASQYSTHITV
SDSSNHGIQQ QDGIIARDIR SQFDAAGNQQ QLQPGQPLRW NSRNQLQQVE PVPRNDGISD
SESYLYDGSG RRVAKISLHK THNAIQTRSV IYLAGLELRG QHNDNNLTES FQVITVGAAG
RAQVRVLHWE SGQPVDIVND QLRYSFDNHL GSALIELDSD GDIISQEEYY PFGGTAVLAS
RNTVEAKYKT VRYSGKERDA TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP
IRWRDNNGLL TEEQINMYVN LFSNIGLKND DELKSELLKY GLSEEEQNQI YLNMLRPMQS
GSSSSLFSFP SESSSSSGST QSVDSGYLSP VRNYHFFEDI KLATMHRPYP KKQASSDTIT
YSAEDLTEAS PIKILIGLDL TSKNTQPYKS ALAEKGIKYI TKEKYEITDF FEEGGLSTEQ
IDLTVNKILK LQKKDLVGIH CGAGNGRSGV IASALSINKQ YTTDKINSFD VTHSLRGSIL
KDTQTYQVDT VTAKAVGIIR EINPKAVERN QDVISLYRYS HFLYTRKHTT SL