Gene Avi_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3889 
Symbol 
ID7388569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3248969 
End bp3250021 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content61% 
IMG OID643652636 
Productdipeptidase 
Protein accessionYP_002550817 
Protein GI222149860 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.741978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGG TTTTCGACGG GCATAATGAT GTCCTCCTGC GTTTGTGGAG GGCTCATGCA 
GCGGGCGTCG ATCCGGTGCG GCAATTCATC AATGGCACAC GGGAAGGTCA TATCGATGCG
CCACGCGCCC GCCGGGGCGG GCTTGGTGGC GGTCTCTGCG CGATTTATAT TCCCTCCGAT
GGCGAGTTCG TGCTGACCGA GCCGGATGAC AAGGGCCACT ACAATACCCC TGTTGATAAG
CCCCTGGCGC GTGCATCCTC ACTCGACATC GCCTTGCAGA TGGCCGCGAT TGCGCTGCGG
GTCGAGCGGG CGGGGGGCTG GCGGCTGTGC CGCTCGACAT CGGATATTCG CGCTGCAATG
GCCGAGGGCG TCTTTGCCGC CGTGCTGCAT ATGGAAGGCT GCGAGGCCAT CGATGCTGAT
CTGGCCGCCC TTGAGGTGTT TTACCAGGCG GGCCTGCGCA CGCTCGGCCC GGTCTGGAGC
CGCCCGAATA TTTTCGGGCA TGGTGTTCCC TTCGCCTTTC CAATGTCGCC GGATACCGGG
CCGGGTCTGA CGACACTCGG TTTTGAGCTT GTGAAAGCCT GCGACCGGCT GGGCATTGCC
CTCGACCTTG CCCATATCAC CGAAAAGGGC TTCTGGGACG TGGCGAAAAC CTCCGACAAA
CCGCTGATCG CCAGCCATTC CAATGCGCAC GCGCTGACAC CAGTGGCCCG CAACCTGACG
GATCGGCAGA TGGACGCGAT CCGCGAGCGC AAGGGCATCG CCGGTTTGAA TTACGCCGTG
ACCATGCTGC GCTCCGATGC CCGCGATTTT GCCGAGACCC CGCTGTCAGA TATGGTACGC
CATATCGACT ATATGGTGGA ACGCATGGGT ATCGATTGCG TCGGCCTCGG CTCCGATTTC
GACGGTTGCA CGGTGCCCGG TGCAATCGGT GATGCCAGTG GGAACCAGAG GTTGCTTGAA
GCGTTGCAAT CGGCTGGATA CGGTGATGCA GATATTGCTA AGATTGCCCG TGAAAACTGG
CTGCGGGTGC TGGGGACGAC GTGGGGCGAG TAA
 
Protein sequence
MQLVFDGHND VLLRLWRAHA AGVDPVRQFI NGTREGHIDA PRARRGGLGG GLCAIYIPSD 
GEFVLTEPDD KGHYNTPVDK PLARASSLDI ALQMAAIALR VERAGGWRLC RSTSDIRAAM
AEGVFAAVLH MEGCEAIDAD LAALEVFYQA GLRTLGPVWS RPNIFGHGVP FAFPMSPDTG
PGLTTLGFEL VKACDRLGIA LDLAHITEKG FWDVAKTSDK PLIASHSNAH ALTPVARNLT
DRQMDAIRER KGIAGLNYAV TMLRSDARDF AETPLSDMVR HIDYMVERMG IDCVGLGSDF
DGCTVPGAIG DASGNQRLLE ALQSAGYGDA DIAKIARENW LRVLGTTWGE