Gene Avi_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_4158 
Symboldcp 
ID7386934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3506323 
End bp3508407 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content59% 
IMG OID643652852 
Productpeptidyl-dipeptidase 
Protein accessionYP_002551025 
Protein GI222150068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.964957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTTC CAGACCGTAA TCTTGGCCCC AACGGTGCCT TTGCCACCGT CACCGAATGG 
AATGGGCCGC ATGGCCTGCC GAATTTCACC GCCATCGGCG ATGAGGATTT TGTCCCGGCT
TTCGACATGG CGCTGGCTGA ACACGACGCT GATATCGACA CCATCGCCCA TTATCCGTCG
GAGCCGACAT TCGACAACAC CATCGTTGCG CTGGAGATAG CCGGGGACGG CCTGTCGCGG
GTCTCGGCGC TGTTCTGGAA CAAGGCCGGT GCCGATACCA ATCAGGTTAT CCAGGCGCTG
GAGCGCGAGA TCGCGCCGAA AATGTCGCGC CACTATTCAA AGATCAGCAT GAATGCCGCA
CTGTTTGCCC GCGTCGATGC CTTATGGGAA AAGCGCGACA GCCTGGGCCT GACGCTGGAG
CAGACCCGGG TGCTGGAGCG GCACTGGAAA GGCTTCGTCA AGGCTGGTGC CAAGCTCGCC
AAGCCCGAAC AGGAGCGGTT GGCGGCGATC AATGAGCGGC TGGCCAGCCT CGGTGCCAAT
TTCGGCCAGA ATGTGCTGGG CGATGAGACC GATTGGGCAT TGCCGCTGAC CAGCGATGAC
GAGCTGGCGG GCATTCCCGA TTTTCTGAAG GATGCGATGG CCTCTGCCGC GCAAGCCCGT
GGCAAGGGGG AATCCTATGC CGTGACGCTG TCGCGCTCGG TCATCGTCCC CTTCCTGACC
TTTTCCGAGC GGCGGGACCT GCGCGAAACA GCCTTCAAGG CCTGGGTGGC GCGTGGTGAA
AACGGTGGTG AACGTGACAA CCGCGCCATC GTTACCGAAA CCCTGGCGCT TCGGGCGGAA
AAGGCCAAGC TATTGGGCTA CAAGAATTTT GCCGCCCTGA AGCTCGACAA TACCATGGCC
AAGACCCCGG AAGCGGTCAA CGGCCTGCTG ATGCAGGTCT GGGAACGCGC CGTCGCCCAA
GCCGCCATCG AAGAGCAGGA ATTGGCGGAG TTGATTGCCA AGGACGGTAA GAATCACGCG
GTTGCGCCCT GGGATTGGCG TTTTTATGCC GAGAAGCTGC GCTCCGAGCG GTTCAATTTT
TCGGAAGCTG AACTGAAGCC TTATCTGCAA CTGGAAAAAA TCATCGAAGC CTGCTTTGCC
GTGGCGCAAA AGCTGTTCGG CATCACTGCC GTGCCGCTGA AGGACGTGAA GGGCTATCAC
CCCGATGTGC GGGTATTTGA AATCCGCGAG GCGGATGGGA CAGTGAAGGC GCTGTTCCTT
GGCGATTATT TCGCCCGGTC CTCGAAGCGC TCCGGTGCTT GGATGAGCTC CTTCCAGTCG
CAGCACAAGC TGCCGCTGAA GAACGGTGCG CAGGGCGAAT TGCCGATCAT TTACAATGTC
TGCAATTTCG CCAAGCCTGC CGAAGGCAAG CCAGCGCTGC TGTCGCTGGA CGATGCCCGC
ACGCTATTTC ATGAATTCGG TCATGCCCTG CATGGGATGC TGTCTGATGT CACTTACCCG
TCAGTATCGG GCACGGCGGT GTCGCGTGAC TTTGTCGAAC TGCCCTCGCA GCTCTATGAA
CATTGGCTGA CGGTGCCGGA TATCCTGAAA ACCTATGCCG TGCATTACCA GACCGGTGAG
GCCATGCCAC AGGCCTTGCT CGATAAGGTT CTGGCAGCGC AAACCTTCAA TGCCGGGTTC
GATACGGTCG AATTCACCTC TTCGGCGCTG GTCGATATGG CGTTTCACAC CCGGGAGGAT
CGAGTGGCCG ATCCGATGGC GGTGCAGGCC GAGATTCTCC AAAATATCGG CATGCCGTCC
TCCATCGTCA TGCGCCATGC CACACCGCAT TTCCAACATG TGTTTTCCGG CGATGGCTAT
TCGGCTGGCT ATTATTCCTA CATGTGGTCG GAAGTGCTGG ATGCCGATGC CTTCGAGGCT
TTCGAGGAAA CCGGCAATGC CTTCGACCCT GATATGGCAG AGCGCCTGAA GGACAATATC
TACGCCATTG GCGGTGCAGT GGACCCGGAA GAAACCTACA AGGCTTTCCG TGGCCGGTTG
CCGAGCCCGG AAGCGATGTT GAAGAAGCGC GGGCTTGCGG CATAA
 
Protein sequence
MTFPDRNLGP NGAFATVTEW NGPHGLPNFT AIGDEDFVPA FDMALAEHDA DIDTIAHYPS 
EPTFDNTIVA LEIAGDGLSR VSALFWNKAG ADTNQVIQAL EREIAPKMSR HYSKISMNAA
LFARVDALWE KRDSLGLTLE QTRVLERHWK GFVKAGAKLA KPEQERLAAI NERLASLGAN
FGQNVLGDET DWALPLTSDD ELAGIPDFLK DAMASAAQAR GKGESYAVTL SRSVIVPFLT
FSERRDLRET AFKAWVARGE NGGERDNRAI VTETLALRAE KAKLLGYKNF AALKLDNTMA
KTPEAVNGLL MQVWERAVAQ AAIEEQELAE LIAKDGKNHA VAPWDWRFYA EKLRSERFNF
SEAELKPYLQ LEKIIEACFA VAQKLFGITA VPLKDVKGYH PDVRVFEIRE ADGTVKALFL
GDYFARSSKR SGAWMSSFQS QHKLPLKNGA QGELPIIYNV CNFAKPAEGK PALLSLDDAR
TLFHEFGHAL HGMLSDVTYP SVSGTAVSRD FVELPSQLYE HWLTVPDILK TYAVHYQTGE
AMPQALLDKV LAAQTFNAGF DTVEFTSSAL VDMAFHTRED RVADPMAVQA EILQNIGMPS
SIVMRHATPH FQHVFSGDGY SAGYYSYMWS EVLDADAFEA FEETGNAFDP DMAERLKDNI
YAIGGAVDPE ETYKAFRGRL PSPEAMLKKR GLAA