Gene YpAngola_A3277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3277 
Symbol 
ID5801754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3476966 
End bp3480040 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content49% 
IMG OID641341101 
Productpolysaccharide lyase family protein 8 
Protein accessionYP_001607623 
Protein GI162419870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00028263 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACAT TATTAGGTAC TGCGGCTTAC ACACCTCACC TTTCGGCCAA TAATACAAAA 
AGTGCGAAAA CAGAGGTAAG TCAAAAAGCG CTGATGCAAT CTTCGATTAT TGATTTCGAT
GGCAATCAGC TCCCTGATTT TGTCACCGCC AGTTCAGGTT CAACCTTATC GCTGAGCTCC
ACCCGCTATA TTATGGGTCA ACAGTCGCTT AAATGGGAAT GGCAGGCAGG CAGTACCATG
ACCATTGAGC ATCCTATTAC ACTGGTCACC GATAAGGTCG CCTCTAAAAC CTGGGGACGT
AAAGCCACAC AGGTGCTCTC TTTTTGGATC TATAACGAAA CTCCGGTGGA TGATTATATG
ATCGTCGATC TGGGGCGAGG GCTCGGCCCG TCAGGCGCAC CGGATGCCGG TATTAAAGTC
AAAATGAATT TCAGTGGCTG GCGCACCGTG GGCGTGTCAC TGCAAAATGA TCTGGAAGGG
CGAGAAATCG AAGGCATTGG TTTCGATGAT AATGCAGAAG GTGAAAGCGG TGGACGTAGC
ACGATCGCCG GTGGTGCGAA AAGCGATATG GACAGCATCC GTTTTCGGGC ACCGCTAAAA
GCACCAGCGG GTATCTTCTA TATTGATCGC GTTATGTTGT CTATTGATGA TGCTCGCTAT
CAATGGTCAG ATGATCACGT CAAAACCCGT TATCAGATCC CGGAAATTAA TTTTCGCTTA
CCCGCCAAAT TACCCGCACC AACGGCCGGG GAAATAGCGG CAGCTGATGA CATTCGTGGT
GCATTGATCC GCGTCTTTAC CGAAGGTCAA GGTGGTGAGA ATGGCTTGGT TGTGGTCGAT
AATCGCGAGA AATTACGCGC GCATTTTGCA GCATTAAAAA TTCAACGTGA TGCCGATGGG
CAATTATCAG GTCGGCATAT TATTACAGAT AAACAGAAGG TGTTATATCA ACCAGAGTTT
ATGGATGATA TTGATCAGCA GCGTTTCAGC GATTATGTGC TCTTGGGTGA TTACACCACA
TTAATGTTCA ATATCAGCCG AGCCTATCAG CGCAGCACAT CCGAAGAGGT GCGTCAGGAA
CTGGCTGACA TGTATATCCT GATGACTCAA CACCTGTTGG ATCAAGGTTA TATGGATGGT
AGTGGGCTGG TGACAACCCA CCACTGGGGA TACAGCTCAC GCTGGTGGTA TATCTCAGCC
ATGTTAATGG CAAACGAACT GGATAAGCAT CAACTACGTC AGCCAGTTTC CGATGCGCTG
CTGTGGTATT CCCGTGAATT TAAAGCCAAC TTTGATATGG TTCCGGGGCC GGAAAGCTCG
GATTTGGACT ATTTCAACAC CCTATCCCGT CAACATCTGG CGTTGCTGAT GCTGGAATCT
GATCCGGCCA AACGCGTGGC ATTGTTTAAG CGTTTTGGTG AGTACATCAA TGTGGCACTT
TCACAGACCC CACCGGGGGG CTACGACGGC TTGCGGCCAG ATGGTACGGC ATGGCGTCAT
GAAGGCAACT ATCCAGGTTA CTCATTCCCT GCCTTTAAAA ATGCCGGCCA ATTAGTATAC
ATGCTACATG GCACACCGTT TGCGGTGAGT GATGAAGGGC GAGCGGCCCT GAAAAAAGCC
ATGTTATCGG CATGGGTCTA CAGTAACCCA GCAACCTCGT TGGGGCTGGC GGGCCGTCAT
CCGTTTAACT CTTCAAGTGT GACTCTGTTC CAAGACGCCT TCCGCTGGTT GGCACTCACG
GGGGATCCGA AAACCGGTGA TAAAGTGGAT AAAGCGCTGG CGGCCGCTTA TCTGCAAATT
ACCGAAACAC CAGAAAGTGA GAGTGAAGCA ATCTTTGGTG TGCGCATTGC TCCAGCAGAG
CTCCCCCAAG GAAACTGGAC ATTTAACGGC GGTGCATTCG GGATCCACCG CTTCAGCGAC
AAGATGGTGA CGCTGAAGGC GTACAACAGC AATGTCTGGT CATCAGAGAT CTATTATCGT
GATAACCGTT ATGGCCGTTA TCAGAGTCAT GGTGCAATAC AAGTACTCCC GTATGGTAAC
CAGAAAGAGA TAGGTTTTGT ACAAGATGGC TGGGACTGGA ATCGCAATCC GGGCACAACT
ACGATCCACC TTCCATTAGC TGAACTGGAT AGCCCGAATA CGCATACCCT GATGTTACGT
GGTAATCATC CATTCAGTGG TCACTCATCA CTAAGTGGCA AATATGGCAT GTTTGCCTTT
AAATTTGATG CGCCATCTAT GCCGAAGTTT GACAGCAGTT TTACCGCGCG TAAGAGTGCT
CTCGAAACTG AAAATCGGTT AGTTCTCTTG GGCAGTAACA TCAGTAACAA TACGGAAAAA
TATGCGACTG AAACCACTCT GTTCCAACAT GGCATTACAG ATAAAGCCAG TGATCTCTGG
GTGAACGGCG AACGTATTAC GGCATTGCCT TACCAGCGCG CACTGGAGGA AGGTGATTGG
TTAATTGATG GCCATGGTAA CGGCTATCTC CTGACCAAAG GGGCGAAAGG TGAAGTGCGC
CGCCAGCATC AGGTTTCTGC CAATGATAAA TCCCGTGACC CTACTGAAGG CAATTTCAGT
CTGGCCTGGC TTGATCACGG GGCGAAACCA CAGAATGCAC AATACGAATA TCTGATGGTG
CTAGAGGCAA CACCGGAAGG CATGCAACAA CTGGCCACAG ATTATCGGGC CGGTAAGAAA
GTGTATGAAG TGTTACGTCA AGATGCCTCC GCACATATTG TGCGTGATAA CGTGACGCAG
ACAACCGGTT ACGCTGCCTT TAGTTCTGTA ACACCAGATG AGGGTGTGGT GAAAAATATT
GCTCAGCCAG CCATTGTTAT GACGCAAATG CAGAGTCATG GTCAGCTTAA GATCAGCGGC
GTTACGCCAG ATCTGAATAT GACCCGCACA ACGAAAGCGA CGCCATTAGC GATTGCCGTG
ACACTCAACG GCCAGTGGCA GGCCACGGAG AATAATCCGC AGGTGAGCGT AAAACTTGCG
GGAGAGACCA CGCAACTGGT GTTTACCAGC TACTTTGGTA TGCCACAAGA GGTCACGCTC
AAGCCGATGA ACTAG
 
Protein sequence
MTTLLGTAAY TPHLSANNTK SAKTEVSQKA LMQSSIIDFD GNQLPDFVTA SSGSTLSLSS 
TRYIMGQQSL KWEWQAGSTM TIEHPITLVT DKVASKTWGR KATQVLSFWI YNETPVDDYM
IVDLGRGLGP SGAPDAGIKV KMNFSGWRTV GVSLQNDLEG REIEGIGFDD NAEGESGGRS
TIAGGAKSDM DSIRFRAPLK APAGIFYIDR VMLSIDDARY QWSDDHVKTR YQIPEINFRL
PAKLPAPTAG EIAAADDIRG ALIRVFTEGQ GGENGLVVVD NREKLRAHFA ALKIQRDADG
QLSGRHIITD KQKVLYQPEF MDDIDQQRFS DYVLLGDYTT LMFNISRAYQ RSTSEEVRQE
LADMYILMTQ HLLDQGYMDG SGLVTTHHWG YSSRWWYISA MLMANELDKH QLRQPVSDAL
LWYSREFKAN FDMVPGPESS DLDYFNTLSR QHLALLMLES DPAKRVALFK RFGEYINVAL
SQTPPGGYDG LRPDGTAWRH EGNYPGYSFP AFKNAGQLVY MLHGTPFAVS DEGRAALKKA
MLSAWVYSNP ATSLGLAGRH PFNSSSVTLF QDAFRWLALT GDPKTGDKVD KALAAAYLQI
TETPESESEA IFGVRIAPAE LPQGNWTFNG GAFGIHRFSD KMVTLKAYNS NVWSSEIYYR
DNRYGRYQSH GAIQVLPYGN QKEIGFVQDG WDWNRNPGTT TIHLPLAELD SPNTHTLMLR
GNHPFSGHSS LSGKYGMFAF KFDAPSMPKF DSSFTARKSA LETENRLVLL GSNISNNTEK
YATETTLFQH GITDKASDLW VNGERITALP YQRALEEGDW LIDGHGNGYL LTKGAKGEVR
RQHQVSANDK SRDPTEGNFS LAWLDHGAKP QNAQYEYLMV LEATPEGMQQ LATDYRAGKK
VYEVLRQDAS AHIVRDNVTQ TTGYAAFSSV TPDEGVVKNI AQPAIVMTQM QSHGQLKISG
VTPDLNMTRT TKATPLAIAV TLNGQWQATE NNPQVSVKLA GETTQLVFTS YFGMPQEVTL
KPMN