Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2952 |
Symbol | |
ID | 5801424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 3112747 |
End bp | 3114354 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641340798 |
Product | hypothetical protein |
Protein accession | YP_001607328 |
Protein GI | 162418688 |
COG category | [S] Function unknown |
COG ID | [COG3455] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03349] type IV / VI secretion system protein, DotU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.314306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAT TTGAACGCCA GATCCGTGCA GCCATTTCCG CAGCACGCAA TGGCGCAAAA CATGCGGAAC AGTCACTGAC TACACCAATG TGGCAAGCCA AAAGCACCGT AGCCTCATTG GGTGGGATTG TCCCTAGAAG TGGCTCTTCG TCAACGTCAC AGGCGGAGAA CTATAAGGAA GGTCTCGCGG ACCAGGCTGC CTCGGGCAAC AACATGGCGC GCACGAGTGC GCCACCGGTC ACTTTGTATC AGCAACAGCC AAATGCGAAT GACAGCTATC CAAACGGGAA TAACAACAAT CCAAACGGGG ATAACAACAA TCCAAACGGG AGTAACAACA ATATAGCGAG AGTACAGCGT ATGCCGCATG GCATTTCCAG GGGCTTATAT GAGCGCCCTG GGATGTTATT GGGTGCCTGG GATAACGCCT ATATTGCTGC GGCTATGCCT TTGCTGCTGG TGGAAAATAT TCGTAGCTGG CCGACGCGTA ACGCCGCAGA GGTCAGGCCA CCGATTGTGC GGGAATTACA ATATTTCCAG CAACATTTGC AGAAAAAGAA CTACCCGCAA GAAGACATTA ACCACCTGTC TTACCTGCTA TGTACCTATA TCGATGGCAT TTTTAACGGG CTGCAAACCC CAGACTCCTA CAACCAAAGT CTGTTAGTGG AGTTTCACCG TGATGCCTGG GGGGGTGAGG ACTGCTTCGA ACATCTGCGG GTCTATATGA ACTCGCCGAA ACAGTACCGG GAAGTTCTGG AATTCTATGA TCTGATTATG TGCCTTGGTT TTGACGGTAA ATACCAGATG ATAGAGCATG GTGCGGTTCT GCTGATGGAT TTACGCAGCC GTCTCCACAC GCAACTCTAC GGTCAGGACG CCACACAATC TTTGGCTATC GCGCAAGCGG TCAAAGGTTC TCCGCGTCGC CAATATATCA AGGCGCTGAA AATCTTCACC TATGGTTTCG CACTGTGCCT TTGTGCTTAC GGCGTCACGG CGTGGTATCT GCACCAGCAA TCCCAACAGA TCCGCAGCAA CATTCTGACG TGGGTACTGC CTGAACCGCG GAAAATCAAC ATCATGGAGA CCTTGCCGAA TCCGCTATCC AACATCCTGA ATGAAGGGTG GCTGGAGGTC AGGAAAGATC CGCGTGGATG GCTATTAATC TTCACCTCCG ACGGCGCGTT CCGCACGGGT GAAGCGACCC TCTCGGAAGA GTTTATCAAC AAGAAGAATA TCGAACGTCT TGGGCTGGCA TTAGCCCCAT GGCCGGGAGA TATCGAGGTT ATTGGTCATA CGGATAACAA ACCGTTCCGT AGCACTTCCG GTAACAACAA CCTCAAACTT TCCGCGGCCA GAGCATCGGT GGTGGCAGAT AAACTGCGGG AATCCACTCA AATCAACGAA ACCCATCAGC GAGAAATAAG TGCCATCGGA CGGGGGGAGA GCGATCCTTT AGCTGACAAT GCAACGGAAG AAGGGCGCAA GCGTAACCGG CGTGTGGATA TCCTATGGAA AATTGGTCAG CGCGATGCCG ATAAGGCCAT GAAGCAATTC CTGGAGAACC CAACACCAGA AGTTCAAGGA ACGAATACCC AACAATAG
|
Protein sequence | MNEFERQIRA AISAARNGAK HAEQSLTTPM WQAKSTVASL GGIVPRSGSS STSQAENYKE GLADQAASGN NMARTSAPPV TLYQQQPNAN DSYPNGNNNN PNGDNNNPNG SNNNIARVQR MPHGISRGLY ERPGMLLGAW DNAYIAAAMP LLLVENIRSW PTRNAAEVRP PIVRELQYFQ QHLQKKNYPQ EDINHLSYLL CTYIDGIFNG LQTPDSYNQS LLVEFHRDAW GGEDCFEHLR VYMNSPKQYR EVLEFYDLIM CLGFDGKYQM IEHGAVLLMD LRSRLHTQLY GQDATQSLAI AQAVKGSPRR QYIKALKIFT YGFALCLCAY GVTAWYLHQQ SQQIRSNILT WVLPEPRKIN IMETLPNPLS NILNEGWLEV RKDPRGWLLI FTSDGAFRTG EATLSEEFIN KKNIERLGLA LAPWPGDIEV IGHTDNKPFR STSGNNNLKL SAARASVVAD KLRESTQINE THQREISAIG RGESDPLADN ATEEGRKRNR RVDILWKIGQ RDADKAMKQF LENPTPEVQG TNTQQ
|
| |