Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2958 |
Symbol | |
ID | 5801430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 3118410 |
End bp | 3120629 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641340802 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001607332 |
Protein GI | 162421891 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0161296 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCA TTCGCCCGCA ACAGCTCGTG GTCTTGAAAA GCAGTTACCA GATAGGCCAT GAAAGCCATA TGGGGATCAG CGTCGTTGCT GGCTGTTATC TCTCTAAACC TGAGCATATG GTGACGGAGT CACAGATTTG GCAGGCATGG AAAGCGGCAC CGCTCTCTTT CCGCATGTTG GACAGTGCTG AGCCAAAACC CTTTGCTGAG TTTTTGCTGG CCGGTCATGC CGGTATTGGT GAAGAGGTTA CCTCGCTGAG TGCAGAGGTT AGTGTTGGCT CACTGACCCG GCGCTGGTGT ATTGAGGGCG AAAGCAACAA AACGGGTCTG GTCATTAAAC CTTTCTTACG TATGTCGATG GACCATACGC AAAGTTGGGG GGGCAAAGGC TGTAAGGAGA ATCCACTGGG ACGTGGGTAT AACGATGAGC GCAAACCGAC GATTATGTCT TTAGGCCTTG ACGGCTCTGC TATCGTTCGC TCACCGCTGG CGTCTCCGTC ACCTGTGCCT CATGACTTCC AACTGCGTAA AGTGCATATC AATGAAGTCG CCTCGACCAT GACCGATCCT CAATATCTGG AAACATTTTA TCCCGGTTTG CCACCGCAAA TTGATCGTCG CTATTTCCAA ATGGCTCCGC CAGGGCAGTG GCTGAAGAAA AGTGCATGGC CTGATAGCGT ACCGTTCAAA CTCATCGGTT TTCGCCCGGA CAATGAGGAG ATCAGCGGTG CGTTTCCTGC GGTCAGTGCA AGGGCGTTTG TTTGGGATAA CCCTTCAGCG CCCCCCAGTG AGGTGACCTT ACTGCGGAAA ACACTGTGGC TGTTACCGGA TAACGATATG GGGCTAATGG TGTTTACCGG CAGCGTGCCA CTGACTCACC TTTTTGATGA GCCTATCGAT ACGTTGCTGG TGGGATTGGA TGACTCCCAT TCGCTACGTG AGTTGGAATA TTACCAACAG GTCTATAAAA GTCGCAGCGT TGAAGGTGCT GCGAGTTTTG AATTCCTCAA AGATCCGGAA CTGATGCCAG AGGGGATGCC GCTGAACGTC ATCCGGGATT TGGCGGATCA CCCAGACTCG CTGCGTTATA GCGCTTCCGC CATGTCTGAA GCGGAGTCCG AACGTTTCTA TCAGGATGTT CAGGATGCTA TCGATCGGCA GGAACAGCAG AAGAGTGAAG AACAAGAGAC GCTGGGTGAT TTGAATGTCC CCGCAGCCGG CAAAGAGGAA GCGGGAACCC AATGGTTGGA AAGCAAAGAA GATACGGCAA CCAACGTCAC ATTTTTAGGG ACTGACTTCT CTGGAATGAC CTTGGACAAC AAGCAATTCC GCTATTGCAT GTTTACCGGT TGCCATTTTG ACAAGGCGAC ATTTAAAGAC TGCACCTTCG AGCATTGCCA GTTTACGCAA AGTGATTTTG AAAACTCCCG TTGGAACAAT GTGCATTTAA GTGGCTGTTT ATTCAAACAG GCAGAGTGGC AAAAAGCCGC CTTTACCCAC TGTAAATGGG AGAAATCCAC CTTTGAGTAT GGGGTGTTTA AACACGCTCA GTTTACCGAC AATGCGTTAG ATAACTGCCT GATTAACCAT AGTGATTTCA GCCTTGGCAC GTTTGATCAT TGTACGCTGA ATGGCTGTTT CTTCTCCGAA ACACATTGTG ATCAAACACA ATTTAATCAG GTCATCATCA CGTCGTGCAT ATTCGAAAAA TGCGACGGCC CGAAGGCTTG CTTTACCGAA AGCACGATAG AGAAAACCTC GTTTATTAGC AGCAGTTGGG TGGGGGGGCG CTTGAGTCAT TGCTATCTCA ATAGCTTGAC CACGGGCCTG AATACCAATC TCTCTGAGTC GCATTTTGAG CAGTGCAGCC TGAATAAAAT GGGCTTCCTC AAGGTCAATT TACAATCCAG TACCTTTATT AATTGCTCGA TGTTGGAGAG TTGCTGCGAT AAGGCTGATT TCTCTCAGGC GACGCTGATT GCCTGTGATA TGACCGCGGT ACGGTTAAAA GATGCCAACT TAGTCCATAG CCACTGGCAG AACACCAGCT TACAGCAAAG CATGTTTTAC AACGCTGACT TACGTGATGC CACTTTCCAG CGTTGCAATC TGGCGGGCGC TAATCTGGCG ATGATCAGCC AAAACATGGA CACCCGATTT GAACATTGTT TGACGGAAAA GACGCACTGG ATCCCGCGTC GTTACACCGT CCCGGCATAA
|
Protein sequence | MRIIRPQQLV VLKSSYQIGH ESHMGISVVA GCYLSKPEHM VTESQIWQAW KAAPLSFRML DSAEPKPFAE FLLAGHAGIG EEVTSLSAEV SVGSLTRRWC IEGESNKTGL VIKPFLRMSM DHTQSWGGKG CKENPLGRGY NDERKPTIMS LGLDGSAIVR SPLASPSPVP HDFQLRKVHI NEVASTMTDP QYLETFYPGL PPQIDRRYFQ MAPPGQWLKK SAWPDSVPFK LIGFRPDNEE ISGAFPAVSA RAFVWDNPSA PPSEVTLLRK TLWLLPDNDM GLMVFTGSVP LTHLFDEPID TLLVGLDDSH SLRELEYYQQ VYKSRSVEGA ASFEFLKDPE LMPEGMPLNV IRDLADHPDS LRYSASAMSE AESERFYQDV QDAIDRQEQQ KSEEQETLGD LNVPAAGKEE AGTQWLESKE DTATNVTFLG TDFSGMTLDN KQFRYCMFTG CHFDKATFKD CTFEHCQFTQ SDFENSRWNN VHLSGCLFKQ AEWQKAAFTH CKWEKSTFEY GVFKHAQFTD NALDNCLINH SDFSLGTFDH CTLNGCFFSE THCDQTQFNQ VIITSCIFEK CDGPKACFTE STIEKTSFIS SSWVGGRLSH CYLNSLTTGL NTNLSESHFE QCSLNKMGFL KVNLQSSTFI NCSMLESCCD KADFSQATLI ACDMTAVRLK DANLVHSHWQ NTSLQQSMFY NADLRDATFQ RCNLAGANLA MISQNMDTRF EHCLTEKTHW IPRRYTVPA
|
| |