Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2566 |
Symbol | |
ID | 5801037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 2683567 |
End bp | 2686509 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641340435 |
Product | hypothetical protein |
Protein accession | YP_001606977 |
Protein GI | 162421106 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00381829 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAT CACTATTCAG TAAAACCCCG GCGGTCACGG TCCTCGACAA CCGTGGGCTG TCCGTGCGCA GCATCGCGTA CCATCGCCAT CCCGACTCGC CGGATGACAC TGGTGAGCGT ATCACCCATC ACCAGTATGA TGCCCGTGGT TTCCTCACGC AAAGCGCCGA TCCCCGCCTG CACGACACTG GTCGGGCGAA CGTCAAGTAC CTGTCGGACC TGGTCGGAAA CGCCCTGTGT ACCGTCAGCG CCGATGTTGG TACCACTGTA GCTCTGAACG ATGCCGCCGG TCGGCTATTT ATGACAGTCA GTAATATTGA CACCGCTGAC GACGATCTGG AGGACAGAAG CCAGGTGGTG AACCGTACCT GGCAGTATGA GGGTGCCTCA CTGCCGGGAC GCCTGCTGAG CATTACAGAG CAGGTCACTG GTGAAGCCGC CCGCGTCACC GAGCGTTTTA CGTATGCCGC CAACACGGAC GAGAAAAAAG CCCTGAACCT GGTCGGACAG TGCGTCAGCC ACTACGACAC AGCAGGTCTG AGGCAGATGG ACAGTATTGC CCTGACTGGT GTGCCACTCT CCGTCACCCG CCGCTTGCTG AAGGATGCGG ACAACCCGGA TACCGTGGCA AACTGGCAGG GTGAGGACAG CTCCGCCTGG AATGACCAGT TGGCAGCAGA CAGGCTAACC ACTCTGACGA CTGCGGACGC TATCGGCGCG GTGCTAACCA CCACCGATAC AAAAGGCAAC GTGCAGCGGG TGGTGTACAA CGTGGCAGGC CTGCTGTCGG GCAGTTGGCT GAGGGTGAAA GGCGGCGCGG AGCAGGTTAT CGTTCAATCG TTGACGTACT CGGCAGCCGG GCAGAAGCTG CGCGAGGAGC ACGGTAACGG CGTGGTGACC ACGTACACAT ACGAGCCGGA GACGCAACGC CTGACCGGCA TCCGGACGGA GCGGTCAGCC GGACATGCAT CCGGAACGAA GGTGCTTCAG GACTTGCGCT ATGAGTATGA TCCCGTGGGC AACGTGCAGA GCATCAGAAA CGACGCAGAA GAAACCCGCT TCTGGCGCAA CCAGAAAGTG GTGCCGGAGA ATACGTATGT CTACGACAGC TTGTACCAGT TGGTCAGCGC CACCGGGCGC GTGATGGTGA ACGCCGGACA GCAGGGCCGC AGCCTGCCTT CCGCCACCCT TCCTATAGAG AGTTCCGCAT ATACAAACTA CACCTATACC TACGATACCG CGGGTAACCT GACGCAGATC CGACACACCC CGGCAACCGG CAGTGGTCAC ACAACGGATA TCACGGTCAG CGACCGCAGC AACCAGGGCG TGCTGAGCAC ACTGACCACA AATCCGGCAG AGGTTGACGC TCTGTTCACG GCAGGCGGCC AGCAGAAACA GCTACAGCCA GGACAGCATC TTATCTGGAC AGCGCGTAAC GAGCTGCTGA AGGTGACGCC GGTGGTACGG GACGGAGACA GCGATGACAG GGAAAGCTAC CGTTATGACG GAAACAGCCA GCGCATCCTG AAGGTCAGCG TGCAGAAAAC GGGAGGCAGC ACGCAGACGC AGCGGGTTAT GTACCTGCCG AGGCTGGAAC TGCGTAGCAC AGCCAGCGGC GTGACAGAAA CGGAAAGCCT GCAGATTATC ACTGTCGGTG AGGCGGGCCG GGCGCAGGTG CAGGTGCTGC ATTGGGAAAA GGGCAAGCCG GACGCTATCG ATAATGACCA ACTGCGCTAC AGCTATGACA ACTTGATCGG CAGCAGTACG CTTGAGGTGG ATGGGGATGG CAATGTTATC AGCATGGAGG AATACTACCC GTACGGCGGC ACGGCGGTGT GGACAGCGCA CAGCCAGACG GAAGCAGACT ACAAAACGAT CAGATACTCA GGGAAGGAAC GCGATGCGAC GGGGTTGTAC TACTATGGCT GGCGTTATTA CCAACCTTGG GTGGGGCGCT GGCTCTCCGC AGATCCGGTG GGAACAGTGG ACGGACTGAA CCTGTATTTG ATGGTAGGAA ATAACCCAAC ATCCTTCCAT GACAGTAATG GCTTAATACG TGAAGGACAG AGTGCAAGGA AATTAGTGGG GGAAGCCTTT GTGTATCCTT TACATATGTC GGTGTTTGAA CGCATATCCA TTGAAGAGAA TATGGCAATG AGCGTAAGGG AAGCGGGTAT TTATACTATT TTAGCACTGG GTGAAGGTGC AGCAGCAAAA GGCCATAATA TTCTTGAGAA AACAATTAAA CCCGGATCCC TGAAGGCTGT CTATGAGAAT AAGGCCGGAG CTGCTCTTGA ACTGGCAAAA AATAGTGGTT TTATTGGCCG GGTTGGCCGG TGGAATGCGT CTGGTGTGCA GGGGGTTTAT GCGTACAACA GACCAAGCGG GGAGGATTTG GTTTATCCTG CCAGCCTGCA GGATACTTCT GATAATGAAT TAGTGAATGC ATGGATAAAA CATAAGATAA TCACGCCTTA TACTGGGGAT TATGACATGC ACGATATTAT TAAATTCAAT CGTGGAAAAG GGTATGTGCC CACCGCGGAA AGCGCTGAGG AAACAGGAGT AAAAGACCTA ATTAATAAAG GCGTTGCAGA AGTCGATCCC GCCCGGCCTT TTGAGTATAC AGCGATGAAT GTCATTCGCC ATGGGCCTCA GGTAAACTTT GTTCCTTACA TGTGGGAATA TGAACACGAT AAAGTTGTTA GCGATAATGG CTATCTGGGG GTAGTTGCGC GTCCAGGTCC ATTCCCGATA GCAATGGTAC ATCAGGGGCA ATGGACTGTT TTTGACGACA GTAAAGAGCT GTTTAACTTT TACAAATCGA GTAATACTCC GCTACCAGAA CACTGGCAAC AGAATTTTAT TGCAAGAGGC CCTGGTATAG TTGCAACTCC GCGGCATGCT GACGTTCTTG ATAAACGACG AATCATGCAT TAA
|
Protein sequence | MNTSLFSKTP AVTVLDNRGL SVRSIAYHRH PDSPDDTGER ITHHQYDARG FLTQSADPRL HDTGRANVKY LSDLVGNALC TVSADVGTTV ALNDAAGRLF MTVSNIDTAD DDLEDRSQVV NRTWQYEGAS LPGRLLSITE QVTGEAARVT ERFTYAANTD EKKALNLVGQ CVSHYDTAGL RQMDSIALTG VPLSVTRRLL KDADNPDTVA NWQGEDSSAW NDQLAADRLT TLTTADAIGA VLTTTDTKGN VQRVVYNVAG LLSGSWLRVK GGAEQVIVQS LTYSAAGQKL REEHGNGVVT TYTYEPETQR LTGIRTERSA GHASGTKVLQ DLRYEYDPVG NVQSIRNDAE ETRFWRNQKV VPENTYVYDS LYQLVSATGR VMVNAGQQGR SLPSATLPIE SSAYTNYTYT YDTAGNLTQI RHTPATGSGH TTDITVSDRS NQGVLSTLTT NPAEVDALFT AGGQQKQLQP GQHLIWTARN ELLKVTPVVR DGDSDDRESY RYDGNSQRIL KVSVQKTGGS TQTQRVMYLP RLELRSTASG VTETESLQII TVGEAGRAQV QVLHWEKGKP DAIDNDQLRY SYDNLIGSST LEVDGDGNVI SMEEYYPYGG TAVWTAHSQT EADYKTIRYS GKERDATGLY YYGWRYYQPW VGRWLSADPV GTVDGLNLYL MVGNNPTSFH DSNGLIREGQ SARKLVGEAF VYPLHMSVFE RISIEENMAM SVREAGIYTI LALGEGAAAK GHNILEKTIK PGSLKAVYEN KAGAALELAK NSGFIGRVGR WNASGVQGVY AYNRPSGEDL VYPASLQDTS DNELVNAWIK HKIITPYTGD YDMHDIIKFN RGKGYVPTAE SAEETGVKDL INKGVAEVDP ARPFEYTAMN VIRHGPQVNF VPYMWEYEHD KVVSDNGYLG VVARPGPFPI AMVHQGQWTV FDDSKELFNF YKSSNTPLPE HWQQNFIARG PGIVATPRHA DVLDKRRIMH
|
| |