Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_2058 |
Symbol | pip |
ID | 7386872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 1688809 |
End bp | 1689783 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643651281 |
Product | proline iminopeptidase |
Protein accession | YP_002549476 |
Protein GI | 222148519 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.689253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAC CAGATCTTCA TCCGCCCTTG CCACCTTACC GAACCGGCCA TCTGCCGGTC ACCGACGGCC ACCAGATCTA TTTCGAATGC AGCGGCAACC GTAAGGGCAT CCCGGCCCTT ATTCTGCATG GCGGCCCAGG CTCCGGGCTC TCTGAAACAA CACGCCGGTT CTTCGACCCC GCGCACTATC ACATCATCCA GTTCGACCAG CGCCACTGCG GCCGCAGTCT TCCCTTTGCT GGCGATCCGG TGGTCGATCT CAGCACCAAC ACACTGCCGC ATCTGCTTGA GGATATCGAA GCCTTACGCC TCCATCTCGG CATTGAGCGC TGGTTGGTGA TGGGCGGATC ATGGGGATCA ACGCTAGCGC TCGCCTATGC CCAAGCGCAT CGGACACGGG TTCTCGGCCT GCTGCTGACC ATGGTCGTCA CGACAAGTGC TGCCGAAATC GAATGGATCA CGCGCGGCGT CGGACAGTTT TTTCCGGCAG AACATGCACG CTTTCTCGAC CATCTTCCAA GGGATCAGCA AGACGGCGAT CTCACTACCG CCTATCACCG GCTGCTGATC AATCCTGACA AAGAGATTCA TGAAAAGGCC GCAAGTGCCT GGTGTGCATG GGAATCTGCC ACTCTCTCCA TAAAGCCGGG CTATGTCCCA CATGTCAGGT GGTCGGACGC CCGATTTCGG CTGTGCTTTT CCCGGCTCGT CACCCATTAC TGGTCGCATC GTGCCTGGCT AGCCGATGGC GAAATCCTTC GGCGTATCAA ACTATTAGAG GGTCTTCCTG CGATATTGAT TCATGGCCGA CTCGATTTCG GCAGCCCACT CAAGACCGCC TATGACCTGC ACCTTGCATG GGCTGGCAGT CGGTTGATTA TCGTCGAGGA TGGCGAACAC AATATCAGTG CGCCAGGTAT GGCGGCCAAC GTCATCAGGT CGCTTCAACA GCTTGCCCAC GAATTGAAAC CTTGA
|
Protein sequence | MTGPDLHPPL PPYRTGHLPV TDGHQIYFEC SGNRKGIPAL ILHGGPGSGL SETTRRFFDP AHYHIIQFDQ RHCGRSLPFA GDPVVDLSTN TLPHLLEDIE ALRLHLGIER WLVMGGSWGS TLALAYAQAH RTRVLGLLLT MVVTTSAAEI EWITRGVGQF FPAEHARFLD HLPRDQQDGD LTTAYHRLLI NPDKEIHEKA ASAWCAWESA TLSIKPGYVP HVRWSDARFR LCFSRLVTHY WSHRAWLADG EILRRIKLLE GLPAILIHGR LDFGSPLKTA YDLHLAWAGS RLIIVEDGEH NISAPGMAAN VIRSLQQLAH ELKP
|
| |