Gene YpAngola_A2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2566 
Symbol 
ID5801037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2683567 
End bp2686509 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content55% 
IMG OID641340435 
Producthypothetical protein 
Protein accessionYP_001606977 
Protein GI162421106 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00381829 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAT CACTATTCAG TAAAACCCCG GCGGTCACGG TCCTCGACAA CCGTGGGCTG 
TCCGTGCGCA GCATCGCGTA CCATCGCCAT CCCGACTCGC CGGATGACAC TGGTGAGCGT
ATCACCCATC ACCAGTATGA TGCCCGTGGT TTCCTCACGC AAAGCGCCGA TCCCCGCCTG
CACGACACTG GTCGGGCGAA CGTCAAGTAC CTGTCGGACC TGGTCGGAAA CGCCCTGTGT
ACCGTCAGCG CCGATGTTGG TACCACTGTA GCTCTGAACG ATGCCGCCGG TCGGCTATTT
ATGACAGTCA GTAATATTGA CACCGCTGAC GACGATCTGG AGGACAGAAG CCAGGTGGTG
AACCGTACCT GGCAGTATGA GGGTGCCTCA CTGCCGGGAC GCCTGCTGAG CATTACAGAG
CAGGTCACTG GTGAAGCCGC CCGCGTCACC GAGCGTTTTA CGTATGCCGC CAACACGGAC
GAGAAAAAAG CCCTGAACCT GGTCGGACAG TGCGTCAGCC ACTACGACAC AGCAGGTCTG
AGGCAGATGG ACAGTATTGC CCTGACTGGT GTGCCACTCT CCGTCACCCG CCGCTTGCTG
AAGGATGCGG ACAACCCGGA TACCGTGGCA AACTGGCAGG GTGAGGACAG CTCCGCCTGG
AATGACCAGT TGGCAGCAGA CAGGCTAACC ACTCTGACGA CTGCGGACGC TATCGGCGCG
GTGCTAACCA CCACCGATAC AAAAGGCAAC GTGCAGCGGG TGGTGTACAA CGTGGCAGGC
CTGCTGTCGG GCAGTTGGCT GAGGGTGAAA GGCGGCGCGG AGCAGGTTAT CGTTCAATCG
TTGACGTACT CGGCAGCCGG GCAGAAGCTG CGCGAGGAGC ACGGTAACGG CGTGGTGACC
ACGTACACAT ACGAGCCGGA GACGCAACGC CTGACCGGCA TCCGGACGGA GCGGTCAGCC
GGACATGCAT CCGGAACGAA GGTGCTTCAG GACTTGCGCT ATGAGTATGA TCCCGTGGGC
AACGTGCAGA GCATCAGAAA CGACGCAGAA GAAACCCGCT TCTGGCGCAA CCAGAAAGTG
GTGCCGGAGA ATACGTATGT CTACGACAGC TTGTACCAGT TGGTCAGCGC CACCGGGCGC
GTGATGGTGA ACGCCGGACA GCAGGGCCGC AGCCTGCCTT CCGCCACCCT TCCTATAGAG
AGTTCCGCAT ATACAAACTA CACCTATACC TACGATACCG CGGGTAACCT GACGCAGATC
CGACACACCC CGGCAACCGG CAGTGGTCAC ACAACGGATA TCACGGTCAG CGACCGCAGC
AACCAGGGCG TGCTGAGCAC ACTGACCACA AATCCGGCAG AGGTTGACGC TCTGTTCACG
GCAGGCGGCC AGCAGAAACA GCTACAGCCA GGACAGCATC TTATCTGGAC AGCGCGTAAC
GAGCTGCTGA AGGTGACGCC GGTGGTACGG GACGGAGACA GCGATGACAG GGAAAGCTAC
CGTTATGACG GAAACAGCCA GCGCATCCTG AAGGTCAGCG TGCAGAAAAC GGGAGGCAGC
ACGCAGACGC AGCGGGTTAT GTACCTGCCG AGGCTGGAAC TGCGTAGCAC AGCCAGCGGC
GTGACAGAAA CGGAAAGCCT GCAGATTATC ACTGTCGGTG AGGCGGGCCG GGCGCAGGTG
CAGGTGCTGC ATTGGGAAAA GGGCAAGCCG GACGCTATCG ATAATGACCA ACTGCGCTAC
AGCTATGACA ACTTGATCGG CAGCAGTACG CTTGAGGTGG ATGGGGATGG CAATGTTATC
AGCATGGAGG AATACTACCC GTACGGCGGC ACGGCGGTGT GGACAGCGCA CAGCCAGACG
GAAGCAGACT ACAAAACGAT CAGATACTCA GGGAAGGAAC GCGATGCGAC GGGGTTGTAC
TACTATGGCT GGCGTTATTA CCAACCTTGG GTGGGGCGCT GGCTCTCCGC AGATCCGGTG
GGAACAGTGG ACGGACTGAA CCTGTATTTG ATGGTAGGAA ATAACCCAAC ATCCTTCCAT
GACAGTAATG GCTTAATACG TGAAGGACAG AGTGCAAGGA AATTAGTGGG GGAAGCCTTT
GTGTATCCTT TACATATGTC GGTGTTTGAA CGCATATCCA TTGAAGAGAA TATGGCAATG
AGCGTAAGGG AAGCGGGTAT TTATACTATT TTAGCACTGG GTGAAGGTGC AGCAGCAAAA
GGCCATAATA TTCTTGAGAA AACAATTAAA CCCGGATCCC TGAAGGCTGT CTATGAGAAT
AAGGCCGGAG CTGCTCTTGA ACTGGCAAAA AATAGTGGTT TTATTGGCCG GGTTGGCCGG
TGGAATGCGT CTGGTGTGCA GGGGGTTTAT GCGTACAACA GACCAAGCGG GGAGGATTTG
GTTTATCCTG CCAGCCTGCA GGATACTTCT GATAATGAAT TAGTGAATGC ATGGATAAAA
CATAAGATAA TCACGCCTTA TACTGGGGAT TATGACATGC ACGATATTAT TAAATTCAAT
CGTGGAAAAG GGTATGTGCC CACCGCGGAA AGCGCTGAGG AAACAGGAGT AAAAGACCTA
ATTAATAAAG GCGTTGCAGA AGTCGATCCC GCCCGGCCTT TTGAGTATAC AGCGATGAAT
GTCATTCGCC ATGGGCCTCA GGTAAACTTT GTTCCTTACA TGTGGGAATA TGAACACGAT
AAAGTTGTTA GCGATAATGG CTATCTGGGG GTAGTTGCGC GTCCAGGTCC ATTCCCGATA
GCAATGGTAC ATCAGGGGCA ATGGACTGTT TTTGACGACA GTAAAGAGCT GTTTAACTTT
TACAAATCGA GTAATACTCC GCTACCAGAA CACTGGCAAC AGAATTTTAT TGCAAGAGGC
CCTGGTATAG TTGCAACTCC GCGGCATGCT GACGTTCTTG ATAAACGACG AATCATGCAT
TAA
 
Protein sequence
MNTSLFSKTP AVTVLDNRGL SVRSIAYHRH PDSPDDTGER ITHHQYDARG FLTQSADPRL 
HDTGRANVKY LSDLVGNALC TVSADVGTTV ALNDAAGRLF MTVSNIDTAD DDLEDRSQVV
NRTWQYEGAS LPGRLLSITE QVTGEAARVT ERFTYAANTD EKKALNLVGQ CVSHYDTAGL
RQMDSIALTG VPLSVTRRLL KDADNPDTVA NWQGEDSSAW NDQLAADRLT TLTTADAIGA
VLTTTDTKGN VQRVVYNVAG LLSGSWLRVK GGAEQVIVQS LTYSAAGQKL REEHGNGVVT
TYTYEPETQR LTGIRTERSA GHASGTKVLQ DLRYEYDPVG NVQSIRNDAE ETRFWRNQKV
VPENTYVYDS LYQLVSATGR VMVNAGQQGR SLPSATLPIE SSAYTNYTYT YDTAGNLTQI
RHTPATGSGH TTDITVSDRS NQGVLSTLTT NPAEVDALFT AGGQQKQLQP GQHLIWTARN
ELLKVTPVVR DGDSDDRESY RYDGNSQRIL KVSVQKTGGS TQTQRVMYLP RLELRSTASG
VTETESLQII TVGEAGRAQV QVLHWEKGKP DAIDNDQLRY SYDNLIGSST LEVDGDGNVI
SMEEYYPYGG TAVWTAHSQT EADYKTIRYS GKERDATGLY YYGWRYYQPW VGRWLSADPV
GTVDGLNLYL MVGNNPTSFH DSNGLIREGQ SARKLVGEAF VYPLHMSVFE RISIEENMAM
SVREAGIYTI LALGEGAAAK GHNILEKTIK PGSLKAVYEN KAGAALELAK NSGFIGRVGR
WNASGVQGVY AYNRPSGEDL VYPASLQDTS DNELVNAWIK HKIITPYTGD YDMHDIIKFN
RGKGYVPTAE SAEETGVKDL INKGVAEVDP ARPFEYTAMN VIRHGPQVNF VPYMWEYEHD
KVVSDNGYLG VVARPGPFPI AMVHQGQWTV FDDSKELFNF YKSSNTPLPE HWQQNFIARG
PGIVATPRHA DVLDKRRIMH