Gene Avi_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_2058 
Symbolpip 
ID7386872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1688809 
End bp1689783 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content58% 
IMG OID643651281 
Productproline iminopeptidase 
Protein accessionYP_002549476 
Protein GI222148519 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.689253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAC CAGATCTTCA TCCGCCCTTG CCACCTTACC GAACCGGCCA TCTGCCGGTC 
ACCGACGGCC ACCAGATCTA TTTCGAATGC AGCGGCAACC GTAAGGGCAT CCCGGCCCTT
ATTCTGCATG GCGGCCCAGG CTCCGGGCTC TCTGAAACAA CACGCCGGTT CTTCGACCCC
GCGCACTATC ACATCATCCA GTTCGACCAG CGCCACTGCG GCCGCAGTCT TCCCTTTGCT
GGCGATCCGG TGGTCGATCT CAGCACCAAC ACACTGCCGC ATCTGCTTGA GGATATCGAA
GCCTTACGCC TCCATCTCGG CATTGAGCGC TGGTTGGTGA TGGGCGGATC ATGGGGATCA
ACGCTAGCGC TCGCCTATGC CCAAGCGCAT CGGACACGGG TTCTCGGCCT GCTGCTGACC
ATGGTCGTCA CGACAAGTGC TGCCGAAATC GAATGGATCA CGCGCGGCGT CGGACAGTTT
TTTCCGGCAG AACATGCACG CTTTCTCGAC CATCTTCCAA GGGATCAGCA AGACGGCGAT
CTCACTACCG CCTATCACCG GCTGCTGATC AATCCTGACA AAGAGATTCA TGAAAAGGCC
GCAAGTGCCT GGTGTGCATG GGAATCTGCC ACTCTCTCCA TAAAGCCGGG CTATGTCCCA
CATGTCAGGT GGTCGGACGC CCGATTTCGG CTGTGCTTTT CCCGGCTCGT CACCCATTAC
TGGTCGCATC GTGCCTGGCT AGCCGATGGC GAAATCCTTC GGCGTATCAA ACTATTAGAG
GGTCTTCCTG CGATATTGAT TCATGGCCGA CTCGATTTCG GCAGCCCACT CAAGACCGCC
TATGACCTGC ACCTTGCATG GGCTGGCAGT CGGTTGATTA TCGTCGAGGA TGGCGAACAC
AATATCAGTG CGCCAGGTAT GGCGGCCAAC GTCATCAGGT CGCTTCAACA GCTTGCCCAC
GAATTGAAAC CTTGA
 
Protein sequence
MTGPDLHPPL PPYRTGHLPV TDGHQIYFEC SGNRKGIPAL ILHGGPGSGL SETTRRFFDP 
AHYHIIQFDQ RHCGRSLPFA GDPVVDLSTN TLPHLLEDIE ALRLHLGIER WLVMGGSWGS
TLALAYAQAH RTRVLGLLLT MVVTTSAAEI EWITRGVGQF FPAEHARFLD HLPRDQQDGD
LTTAYHRLLI NPDKEIHEKA ASAWCAWESA TLSIKPGYVP HVRWSDARFR LCFSRLVTHY
WSHRAWLADG EILRRIKLLE GLPAILIHGR LDFGSPLKTA YDLHLAWAGS RLIIVEDGEH
NISAPGMAAN VIRSLQQLAH ELKP