Gene Avi_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3174 
SymbolpepQ 
ID7388117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp2629505 
End bp2630659 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content60% 
IMG OID643652101 
Productproline dipeptidase 
Protein accessionYP_002550285 
Protein GI222149328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTGC ATTTTTCAAC TGCCGAATAC GCCGCCCGGC TGGACCGCCT GACTGACAGG 
ATGCGCGAAC AGAAGCTGGA TGCCATGCTA CTGTTTGCCC AGGAAAGCAT GTATTGGCTG
ACCGGCTATG ACACGTTTGG CTATTGCTTC TTCCAGACAC TGGTGGTGAA GGCCGATGGC
TCGATGACGC TTCTGACCCG TTCGGCGGAT CTGCGCCAGG CCCGCCAGAC CTCAACCATC
GATAATATCC TGATCTGGGT TGACCGTACC AACGCCGATC CGACCAGCGA CCTGAAGGAT
CTGCTCAACG ATCTCGACCT GCTCGGCTGC CGTCTTGGCA TCGAATACGA CACCCATGGC
ATGACCGGAC GGGTCGCTCG GCTGCTGGAC AATCAATTGC TGAGCTTCGG TGAATTGATC
GACGCCTCGA TGCTGGTCAG CGAATTGCGG CTGATCAAGA GCCCAGAGGA AATCGCTTAT
GTCGAAAAGG CCGCCAGCCT TGCCGATGAC GCGCTGGATG CCGCCCTCCC GCTGATTTCA
GCCGGGGGCG ATGAAGCTGC CATTCTCGCT GCCATGCAGG GTGCGGTCTT TGCAGGTGGC
GGTGATTATC CGGCTAATGA ATTCATTATC GGCTCCGGCC AGGATGCGCT GCTGTGCCGC
TACAAGGCAG GCCGCCGGAC GCTGTCGGCC AATGACCAGC TGACACTGGA ATGGGCCGGT
GCTTCGGCCC ATTACCATGC CGCGATGATG CGCACCGTGC TGGTCGGCGA ACCATCGCCT
CGCCACCGCG AGCTTTATGC CGCCTGCCGG GAGGCGATCC AGGAAATCGA AACCGTGCTG
CGGCCCGGCC ATACATTCGG CGACGTGTTC GAGACCCATG CCAGAGTGCT GGACGAACGA
GGCCTGACCC GCCACCGGCT AAATGCCTGC GGTTATTCAC TGGGCGCCCG CTTCTCCCCG
TCGTGGATGG AGCACCAGAT GTTCCATATC GGCAATCCGC AGGAGATCCT GCCCAATATG
TCGCTGTTCA TCCACATGAT CATCATGGAT TCCGAACGCG AGACCGCGAT GACGCTCGGC
CACACTTATC TCACCACCGA AGGCGCGCCA CGCGCGCTGT CGCGTCATCC GCTGGATCTG
ATCGTCAAGG CGTGA
 
Protein sequence
MTLHFSTAEY AARLDRLTDR MREQKLDAML LFAQESMYWL TGYDTFGYCF FQTLVVKADG 
SMTLLTRSAD LRQARQTSTI DNILIWVDRT NADPTSDLKD LLNDLDLLGC RLGIEYDTHG
MTGRVARLLD NQLLSFGELI DASMLVSELR LIKSPEEIAY VEKAASLADD ALDAALPLIS
AGGDEAAILA AMQGAVFAGG GDYPANEFII GSGQDALLCR YKAGRRTLSA NDQLTLEWAG
ASAHYHAAMM RTVLVGEPSP RHRELYAACR EAIQEIETVL RPGHTFGDVF ETHARVLDER
GLTRHRLNAC GYSLGARFSP SWMEHQMFHI GNPQEILPNM SLFIHMIIMD SERETAMTLG
HTYLTTEGAP RALSRHPLDL IVKA