Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1626 |
Symbol | pip |
ID | 5800097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1678456 |
End bp | 1679406 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641339572 |
Product | proline iminopeptidase |
Protein accession | YP_001606129 |
Protein GI | 162419193 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAT TACGTGGACT TTATCCTGCA TATGAACCTT ACGACAGCGG TTTATTAGAC ACCGGGGACG GGCATCAAAT TTATTGGGAG CTCTGCGGCA ATCCGAAGGG CAAGCCCGCG ATCTTTATTC ACGGGGGGCC AGGGGGCGGG ATTGCACCTT ATCATCGGCA GCTATTCAAC CCTGCAAAAT ATAATGTGAT GTTATTTGAT CAACGTGGCT GTGGGCGCTC TAAACCCCAT GCCAGCTTGG ATAACAACAC GACTTGGCAT CTGGTGGAGG ATATTGAACG TCTGCGCAAG ATGGCCGGGA TTGAACAATG GCTGGTGTTT GGCGGTTCTT GGGGATCGAC TCTGGCATTG GCTTATGGCG AAACACACCC TGAACGTGTC AGTGAGATGG TTCTGCGTGG GATCTTCACT TTACGCAGGA AAGAACTGCA TTGGTACTAT CAAGAGGGGG CCTCGCGCTT TTTCCCCGAG AAATGGCAGC GGGTACTGTC AATTTTATCC CCAGAAGAGC AGGGCGATGT GATAGCGGCT TATCGTAAAC GGCTGACATC ACCTGATCGG GCAATACAGC TAGAGGCCGC TAAAATATGG AGTTTGTGGG AAGGCGAAAC AGTGACCTTA TTACCAACTA AAAGCTCGGC TTCCTTTGGT GAAGAGCATT TTGCACTGGC GTTTGCCCGG ATTGAAAATC ACTATTTCAC GCATCTTGGC TTCTTGGACA GTGATAACCA GTTGTTAGAC AATGTGACAC GTATACGGCA TATCCCAGCT GTAATTATTC ATGGTCGATA TGATATGGCG TGTCAGCTAC AGAACGCCTG GGATTTAGCA CAGGCTTGGC CTGAAGCTGA GCTCTATATC GTTGAAGGTG CCGGGCACTC CTTTGATGAG CCAGGGATAC TGCATCAACT TATTCTAGCT ACTGATAAAT TTGCTCACTG A
|
Protein sequence | MEQLRGLYPA YEPYDSGLLD TGDGHQIYWE LCGNPKGKPA IFIHGGPGGG IAPYHRQLFN PAKYNVMLFD QRGCGRSKPH ASLDNNTTWH LVEDIERLRK MAGIEQWLVF GGSWGSTLAL AYGETHPERV SEMVLRGIFT LRRKELHWYY QEGASRFFPE KWQRVLSILS PEEQGDVIAA YRKRLTSPDR AIQLEAAKIW SLWEGETVTL LPTKSSASFG EEHFALAFAR IENHYFTHLG FLDSDNQLLD NVTRIRHIPA VIIHGRYDMA CQLQNAWDLA QAWPEAELYI VEGAGHSFDE PGILHQLILA TDKFAH
|
| |