Gene YpAngola_A1880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1880 
Symbol 
ID5800351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1948622 
End bp1954600 
Gene Length5979 bp 
Protein Length1992 aa 
Translation table11 
GC content50% 
IMG OID641339811 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001606366 
Protein GI162419079 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0009665 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATTTGT TGAGGTTTCT GCTGATCTCC CCCTTTGCCC TTATCAAGGG ACTCTATCGG 
TTAAGCGCTT ATCTTTTACG GTTAGTGGGC CGCCTATTGC GCCCAGTCGT CGGTAACCTG
AATTGGCGCG CGCCACAATG GATGACGAAA ACCGCCAACG GGCTACACTG TGCCTTCAAC
CGTTCAGAGC AATGGGTCGC TAAGCATCCC AAAGGGATCA GTGCGGCAAT AGTGCTGTTG
ATGGCTGCCG CCAGCGCCGC CTTCTATGGC TATCACTGGT ATCTTAACCG GCCCCAACCG
ATTGAACCGG CACCTATGGT TTATCAGGAA ACCAGTATCA GAGTCTCGGC GCCAAGAACG
GTTAACTACC AGGCGCAAAA ACCAGAAGCC CAGCCGCTTA GTCTTAATTT TATGCATTCA
GCCGCACCTA TCACCGCGAT GGGTCAGGTC GTCGATCAGG GCATCTCATT AACCCCCGCG
ATAGAGGGTG AATGGAAATG GGCAACTGAA CGCACGTTGG TGTTTACTCC CAAAAAAGCC
TGGCCGATGG GCGCTAACTA CCAAATTACG ATCGATACAG AAAAACTTTT AGCACCACAG
ATTAAGCTCA ATCAGACCGA ACTTAATTTC ACAACGCCAG CTTTTGCTTA TCAATTGGAA
AAAGCGGAAT ATTATCAGGA TCCGCAAGAG GCCCAAAAGC GCAGCACTAT TTTCCATGTG
CAATTTAATG CCCCAGTTGA TGTTGCCAGC TTTGAAAAAC AGATCCTCTT GGGATTGGTC
GAAGGTAAAT CCAAGTCAGA GAAGAAACTT AATTTCTCCG TCGTTTATGA TGAGAAAAAG
CTTAATGCCT GGATACATTC GCAACCCTTG ATGCCAATGG ATAAAGGCGG TTCGGTCCAT
CTATCGATTA ATAAAGGGGT GAATGCCAGT GTCGCCGCCA CGCCTACGAC ACAGGCACAG
AATAAATGGG TATCCGTCCC TAACCTATAT AGCCTGGCGG TTAATAGTAT CAATGCCACG
TTGGTCGAGT CAGATAACAA TAATGGTGAG CGGGCCTTAA TTATTGCTAT CAGCGACGCG
GTTAAAGATA AAGAGATCAA AAATGCGGTC AAAGCCTGGT TACTGCCGCA ACATAATTTT
CAAGCGAAAG AGAGCGCCAA AACATCAACC GATTTCTATC CTTGGGATAT GGATGATATT
GACGATAATC TGCTGCAACA ATCAACGCCG CTGGCGCTGA CCCTCAATGA GGCCGAGCAA
GAGTATCAGC CAATATTCAG CTTTAAGTTT GATGCCCCTT CCTATCGCAC ACTGCTGATC
GAGGTTAACA ATAGCCTGAC ATCGGTGGGC GGTTATAAAA TGCCGGAAAA AATCTACCAA
ATAGTCAGGG TTCCCGATTA CCCTAAGACG CTGCGCTTTA TGTCACAAGG CTCGTTATTA
TCGATGCAGG GTGATAAGCA GATCAGCGTC GCCGCCCGTA ATATGACTGG CATGAAACTG
GATATTAAGC GGGTTATTCC TAGCCAGTTA CAACATATTG TGTCATTTAA AAGCAGCGAA
TATTCATCAG CTCACTTTAA CCGCCTGAGT GATGAATATT TTACTGAACA CTTCCAGTAC
CAAACCGCGC TGAATAATGA CAACCCCGGC GAGATCAATT ATCAAGGGGT CGATCTGTCC
CGTTATCTTG CAAATAATCC GAGTGCTCGG CGTGGGGTGT TCTTACTCAC CCTGTCAGCT
TGGGATCCGG AGAAAAGGGA TAATCAGCAA CACAGCGAGG AAGACTACGA CGAAGACCAG
GAATGGGTCG GCGATTCACG CTTTGTGGTG ATCACGGACT TAGGCATTAT CACCAAGCAA
TCGCAGGATA GATCCCGTGA TGTGTTTGTG CAATCCATTC ACTCGGGTCT GCCCGCCGCC
GATGCTAAAG TCTCTGTGGT GGCAAAAAAT GGTGTGGTCT TACTGAGCCA AATCACCGAT
AGCAAAGGGC ATGTCCATTT TCCTGCGCTG GACGCCTTTA AAAATGAACG CCAACCGGTC
ATGTTCCTGG TGGAAAAAGA AGGGGATGTC TCCTTCCTGC CCACCCGAGC CACCTATGAC
CGTAACCTTG ATTTCTCACG TTTTGATATT GATGGCGAAG AGACCCCGTC CGACCCACGT
ACTCTAAGCA GCTATCTGTT TTCTGACCGG GGAGTTTATC GCCCAGGCGA CCGCTTCAAT
ATTGGTCTGA TCACCCGGAC CGCCAACTGG GCTACCGCAC TCGATGGCGT CCCCCTGCGG
GCGGAGATCC GTGACCCACG AGATACCTTG ATGAGTACCC TGCCGATAAC CTTGGACAGC
AGTGGTTTCA ATGAGCTCAG CTATACGACC GGTGAAAACT CACCTACCGG TGAATGGAAC
GTCTATCTCT ATCTGGTTGG TAAGAATAAT GAAACGTCGA TGTTGCTGGG GCACACCACC
GTAAATGTTA AAGAGTTCGA GCCTGATCGC TTAAAAGTGC AACTGCAACT GACGCCAGAG
CGTCAACAAG GCTGGGTTAA ACCGCAGGAG CTGCAAGCCA ATATCAATGT ACAAAATCTA
TTCGGTACAC CAGCACAGGA GCGCCGTGTC ACCTCTAGAC TGATCTTGCG GCCAATGTAC
CCGAGTTTTG CCCCGTTCCC TGATTACCTG TTCTATGAGA ATCGCCATAA CAGCGATGGT
TTTGAGACCG AACTGGAAGA GCAAACGACC GATCTACAGG GGATGGCGAC CATTCCATTG
GATCTGAAAT CCTATGCTGA CGCCACCTAT CAACTGCAAT TGCTGTCGGA AGCCTTTGAA
GCGGGTGGAG GCCGCTCTGT GGCCGCGACT GCGCGGGTTC TGGTCTCACC TTACGACTCT
CTGGTTGGGG TGAAAGCCGA TGGCGATCTG AGTTATATCA ACCGTGATGC CGTGCGTAAG
CTGAATATTA TTGCCGTTGA CCCGAGCCTG AATAAAATTG CGCTGCCAGA CTTGAGTCTG
TCATTGATTG AGCAGAAGTA TATTTCAGTG CTAACCAAAC AGGATTCAGG CGTTTATAAA
TATCAATCAC GGCTAAAGGA GCAGTTGGTC TCAGAGCAAC CGCTACAAAT CAGCCCGACA
GGGACGGATT TCACCCTGGT GACCCAGCAG CCTGGTGATT TTATTCTGGT GGTTAAGGAC
AGTCAGGGGC AGGTTCTGAA CCGTATTAGT TATACGGTGG CGGGTAACGC AAACCTGACC
CGCTCACTGG ATCGCAACAC CGAATTAAAG CTAAAACTGA ATCAGGCCGA ATATCTGCAA
GGCGAAGAAA TTGAGATTGC GATTAATGCA CCTTATGCCG GTAGCGGTCT GATCACGATA
GAAAAAGATA AAGTGTATAG CTGGCAGTGG TTCCACAGTG ATACCACCAG CTCTGTGCAG
AGAATCCGCA TCCCACCGGC AATGGAAGGC AATGGCTATA TCAACGTACA ATTCGTGCGT
GATGTGAATT CCGATGAGAT CTTTATGAGC CCACTGAGTT ACGGTGTGAT GCCATTTAAG
ATCAGTACCA AAGCGCGTCA GGCGGCTATC GAGTTAGCGT CGCCGTCAGT CATTAAACCG
GGTGAAGTGT TACCGATTAA AGTGACCACC GATTCACCAC AGCGCGTGGT GGTGTTTGCC
GTCGATGAAG GTATTTTGCA GGTGGCACGC TATCGCCTGA AAGATCCACT GGATTACTTC
TTCCGTAAAC GTGAACTGAG TGTACAGAGT GCACAAATTC TCGATTTGAT CCTGCCGGAA
TTCAGCAAGC TGATGGCACT GACCTCCGCA CCTGGAGGCG ACGCCGGGGA AGGGCTGGAT
CTGCACCTCA ATCCGTTTAA ACGCAAACAA GACAAGCCGG TGGCTTATTG GTCTGGTATC
ACCGAAGTGA ATGGTGAAAC CACCTTCAAT TACCCGATTC CCGACTATTT CAATGGTAAA
ATTCGCGTGA TGGCCATCTC TGCGACCCCT GATCGCATTG GTAAAGTCCA GACCTCGACC
ACCGTGCGGG ATAACTTTAT TCTGACGCCG AATGTCCCCG CGATGGTAGC ACCGGGAGAT
GAATTTGATG TCACCGTGGG TGTGAGTAAC AACCTGCAAG GATTGAAGGG TAAAGCGGTT
GATATCACCG TGCGTCTGAC ACCACCGCCA CAACTGGAAG TGGTGGGTGA AGCGCAACAC
AGCCTGTCGC TGGCAGAAAA ACGTGAAACG CTTGTCAGCT TCCGCCTACG CGCCCGTTCA
GCATTGGGTG ATGCTCCACT GGTGTTTGAT GCCAGCTATG GCTCTCAATC CAGCCGCCGG
ACGGTCAGTA CCTCGGTACG CCCGGCGATG CCATTCCGAA CGCAATCGGT GATGGGCCGG
ATGGAGGGTA ACAAGCATAC TGTGACCAAT CTGCGCCAGA TGTTTGATAA TTATGCTCAA
CGTCAGGCGA CCGCTTCCCA CTCACCGTTG GTCTTAACCC AAGGTCTGGC GCGGTACCTG
GCTGATTACC CGTACTACAG TTCTGAGCAA ATTGTCAGCC GCTCGATTCC GTTGATTATG
CAAAGCAAAC ATCCTGAAAT GGACAGTGCC CTCAATCAGA ATGAGGTCCG TGATCAACTG
AAAAACATGC TACGTATCCT GAGCTCTCGG CAGAATAGCA CTGGTGCAAT CGGTTTGTGG
CACGCCTCCC CTACCCCTGA TCCGTTTGTC ACACCTTATG TCGTGCAATT TCTGCTGGAA
GCGAAATCTG CCGGTTACAG CTTGCCGAAT GACATCTTGG AGGGGGCCAA CAACGCACTG
CGTCTGTTAG CGGTTCGACC TTATGATGAC CTTTACTCTC TGCGTTTGCG GGCCTTTGCT
GTTTACCTGT TGACCTTGCA GGGGGAGATC ACCACCAATA CTCTGGCATC GGTGCAAAGT
ACGTTACAGC AACTTTATCC TGACAGTTGG CAGACTGATC TGAGTGCCAT TTATCTGGCC
TCATCATACC GTCTGCTCAA AATGGATGAC GAAGCCAATA AACTGCTGCA ACCCACCTGG
AAACAACTGG GTAAAGCCTA CAGCAAGGCC TGGTGGACGC AGAATTATTT TGATCCACTG
GTGCAAGATG CAACCCGGTT GTATCTGATC ACTCGCCATT TCCCAGAGAA AGTCTCTTCT
ATTCCGCCAC AAGCACTGGA AAATATGGTG CTGGCACTGA GGGATGAGCA TTACACGACC
TATTCATCCG CGATGAGCAT TCTGGCACTG GAAAGTTACA CCAGCCAGGT AGCCGCCCAG
CAAGATACGC CAGAAACCCT GCAAATCATC GAGATCAGTA AAAGCAAAGG GATCGACCCT
AACGTTATCT CAACGCTGAA CGGCCTGTTC GTTCAAGGTG ATTTTACCGG TGAGGCTAAA
GCGATTCAGT TTAACAACTA TGCCTCGGCA CCCGCTTGGT ATGTGGTCAA TCAATCAGGC
TATGACCTTC AGCCACCAAA AGACGCCATC TCTAATGGGC TGGAAATCAG CCGCAGCTAC
ACCGATGAGC AGGGTAAGCC GGTGACCCAA GTCACCTTAG GGCAGAAAGT TAACGTGCAC
CTAAAAATCC GGGCTAACGC TAAACAAGGT CAAAATAATC TGGCGATTGT CGATCTACTG
CCGGGCGGTT TTGAAGTGGT ACAACAAACG GCACCTGAAC CAGAGTTTTA TGATAATCAG
GATGATCAGG ATGAGGAAAC TGGCAGCGGC TGGCAGTCGC CGCTAATGGT ATCTGGCTCC
AGTTGGTACC CTGACTACAG TGATATTCGT GAAGATCGCG TGATCATTTA TGGCAGTGCC
AGTACCGACG TTAAAGAGTT TATCTACCAA ATCAAATCAA CCAATACGGG TCGCTTTGTG
GTGCCACCGG CTTACGGCGA AGCCATGTAT GATCGTAATG TACAGGCGCT GTCGGTCGGT
AAAGGGCATA TCCTTGTCGT TCCACCTGAG GCAAAATAG
 
Protein sequence
MDLLRFLLIS PFALIKGLYR LSAYLLRLVG RLLRPVVGNL NWRAPQWMTK TANGLHCAFN 
RSEQWVAKHP KGISAAIVLL MAAASAAFYG YHWYLNRPQP IEPAPMVYQE TSIRVSAPRT
VNYQAQKPEA QPLSLNFMHS AAPITAMGQV VDQGISLTPA IEGEWKWATE RTLVFTPKKA
WPMGANYQIT IDTEKLLAPQ IKLNQTELNF TTPAFAYQLE KAEYYQDPQE AQKRSTIFHV
QFNAPVDVAS FEKQILLGLV EGKSKSEKKL NFSVVYDEKK LNAWIHSQPL MPMDKGGSVH
LSINKGVNAS VAATPTTQAQ NKWVSVPNLY SLAVNSINAT LVESDNNNGE RALIIAISDA
VKDKEIKNAV KAWLLPQHNF QAKESAKTST DFYPWDMDDI DDNLLQQSTP LALTLNEAEQ
EYQPIFSFKF DAPSYRTLLI EVNNSLTSVG GYKMPEKIYQ IVRVPDYPKT LRFMSQGSLL
SMQGDKQISV AARNMTGMKL DIKRVIPSQL QHIVSFKSSE YSSAHFNRLS DEYFTEHFQY
QTALNNDNPG EINYQGVDLS RYLANNPSAR RGVFLLTLSA WDPEKRDNQQ HSEEDYDEDQ
EWVGDSRFVV ITDLGIITKQ SQDRSRDVFV QSIHSGLPAA DAKVSVVAKN GVVLLSQITD
SKGHVHFPAL DAFKNERQPV MFLVEKEGDV SFLPTRATYD RNLDFSRFDI DGEETPSDPR
TLSSYLFSDR GVYRPGDRFN IGLITRTANW ATALDGVPLR AEIRDPRDTL MSTLPITLDS
SGFNELSYTT GENSPTGEWN VYLYLVGKNN ETSMLLGHTT VNVKEFEPDR LKVQLQLTPE
RQQGWVKPQE LQANINVQNL FGTPAQERRV TSRLILRPMY PSFAPFPDYL FYENRHNSDG
FETELEEQTT DLQGMATIPL DLKSYADATY QLQLLSEAFE AGGGRSVAAT ARVLVSPYDS
LVGVKADGDL SYINRDAVRK LNIIAVDPSL NKIALPDLSL SLIEQKYISV LTKQDSGVYK
YQSRLKEQLV SEQPLQISPT GTDFTLVTQQ PGDFILVVKD SQGQVLNRIS YTVAGNANLT
RSLDRNTELK LKLNQAEYLQ GEEIEIAINA PYAGSGLITI EKDKVYSWQW FHSDTTSSVQ
RIRIPPAMEG NGYINVQFVR DVNSDEIFMS PLSYGVMPFK ISTKARQAAI ELASPSVIKP
GEVLPIKVTT DSPQRVVVFA VDEGILQVAR YRLKDPLDYF FRKRELSVQS AQILDLILPE
FSKLMALTSA PGGDAGEGLD LHLNPFKRKQ DKPVAYWSGI TEVNGETTFN YPIPDYFNGK
IRVMAISATP DRIGKVQTST TVRDNFILTP NVPAMVAPGD EFDVTVGVSN NLQGLKGKAV
DITVRLTPPP QLEVVGEAQH SLSLAEKRET LVSFRLRARS ALGDAPLVFD ASYGSQSSRR
TVSTSVRPAM PFRTQSVMGR MEGNKHTVTN LRQMFDNYAQ RQATASHSPL VLTQGLARYL
ADYPYYSSEQ IVSRSIPLIM QSKHPEMDSA LNQNEVRDQL KNMLRILSSR QNSTGAIGLW
HASPTPDPFV TPYVVQFLLE AKSAGYSLPN DILEGANNAL RLLAVRPYDD LYSLRLRAFA
VYLLTLQGEI TTNTLASVQS TLQQLYPDSW QTDLSAIYLA SSYRLLKMDD EANKLLQPTW
KQLGKAYSKA WWTQNYFDPL VQDATRLYLI TRHFPEKVSS IPPQALENMV LALRDEHYTT
YSSAMSILAL ESYTSQVAAQ QDTPETLQII EISKSKGIDP NVISTLNGLF VQGDFTGEAK
AIQFNNYASA PAWYVVNQSG YDLQPPKDAI SNGLEISRSY TDEQGKPVTQ VTLGQKVNVH
LKIRANAKQG QNNLAIVDLL PGGFEVVQQT APEPEFYDNQ DDQDEETGSG WQSPLMVSGS
SWYPDYSDIR EDRVIIYGSA STDVKEFIYQ IKSTNTGRFV VPPAYGEAMY DRNVQALSVG
KGHILVVPPE AK