Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1479 |
Symbol | pip |
ID | 7386449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 1241822 |
End bp | 1242787 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643650857 |
Product | proline iminopeptidase |
Protein accession | YP_002549062 |
Protein GI | 222148105 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.293243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGT TGCGCACCCT CTACCCTGAG ATCGAACCTT TCGAGACCGG CTTTCTCGAT GTCGGCGATG GCCACGTCAT CCATTGGGAG CGGGTCGGCA CCAGGGGTGC CAAGCCTGCG GTGTTTCTGC ACGGCGGACC GGGTGGCGGC ATCAATCCCA ACCAGAGGCG GGTGTTCGAC CCCGCTCTTT ATGATGTAGT CCTGTTCGAT CAACGCGGCT GTGGAAAATC CACGCCGCAT GCCCATCTGG ACGCCAACAC CACCTGGCAT CTGGTCGCCG ATATCGAGCG CTTGCGTGAC ATGATCGGCG TCGAAAAATG GCTGGTATTT GGCGGCTCCT GGGGTTCGAC GCTGGCGCTG GCCTATGCGC AGACCTATCC CGAGCGGGTC AGCGAACTGG TGCTGCGCGG CATCTACACG CTGACCAAGG CGGAGCTGGA CTGGTATTAT CAGTTCGGAG TCTCCGAGAT GTATCCTGAT CGTTGGGAGC ATTTCATCGC GCCCATTCCG CTTGAAGAGC GCCATGACAT GATTTCCGCC TATCATCGCC GTCTGACCGG AGAGGACAAG GAAGTGCAGC TCGCCTGCGC CCGCGCCTGG AGCCAGTGGG AAGGCGCGAC GATTTCGTTG ATCCCCAATT TGCAACAGAT CGAAAATTTC GGCGAGGACC ATTACGCCAT CGCCTTTGCC CGTCTGGAAA ACCATTTTTT CATGAACCGG ATCTGGATGG AAGACGGTCA ATTGCTGCGC GATGCCCATA AGCTCAAAGG CATTCCGGGC GTGATCGTGC ATGGGCGCTA TGACATGCCC TGCCCGCTGC GCTATGCATG GGAATTGTCC AAGCTCTGGC CGGATGCCGA TTTGCACATT GTCGAGGCTG CGGGCCATGC CATGAGCGAA CCGGGTATTC TCGACCAATT GATCCGTGCC ACTGACCGTT TTGCCGGAAA AATCCAAAAC ACATAA
|
Protein sequence | MTELRTLYPE IEPFETGFLD VGDGHVIHWE RVGTRGAKPA VFLHGGPGGG INPNQRRVFD PALYDVVLFD QRGCGKSTPH AHLDANTTWH LVADIERLRD MIGVEKWLVF GGSWGSTLAL AYAQTYPERV SELVLRGIYT LTKAELDWYY QFGVSEMYPD RWEHFIAPIP LEERHDMISA YHRRLTGEDK EVQLACARAW SQWEGATISL IPNLQQIENF GEDHYAIAFA RLENHFFMNR IWMEDGQLLR DAHKLKGIPG VIVHGRYDMP CPLRYAWELS KLWPDADLHI VEAAGHAMSE PGILDQLIRA TDRFAGKIQN T
|
| |