Gene Avi_3702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3702 
SymbolpepF 
ID7388176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3066232 
End bp3068091 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content60% 
IMG OID643652491 
Productoligoendopeptidase F 
Protein accessionYP_002550673 
Protein GI222149716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCA GCCCCGTTAA CCAGACCCGC CTTCCGGCAG CCGAATCCGC CAGCGTACCA 
GCGCACGGCC AAGGCCTCGG CACGCTGCCC GAATGGCAGC TCACCGATCT TTATCCGGCC
CCCTCCTCGG ATCTCTTCAA GGCCGACCTC GCCAAGGCCA GCGAAATGAG CCTGGCCTTC
GAGACCAAAT GGAAGGGGCG GCTGGAAGAT GCCGCCGCCA AGGACGCGGA TCAGGGCCTG
GGAGCCGCGC TGAAGGAATT CGAAGAACTC GAAGACCTAC TCGGCAAGAT CGGCTCCTAT
GCCGGTCTCT ATTATTATAG TGAGATGACC AAGCCGGAAA ACGGCAAATT CTTCGGCGAT
GTACAAGCCA GGCTGACGGA GCTTGCCGCC CATCTGCTGT TTTTCACGCT GGAGCTGAAT
CGGCTGGACG ATGCGGTGAT CGATGCTGCC ATTGCCCGCG ATCCGGCCAC CGCCCATTAC
AAGCCCTGGC TGATTGATCT CAGGCAAGAC AAGCCCTACC AGCTGGATGA CAAGCTCGAG
CAATTGTTTC TGGAAAAATC CCAGACCGGT TCGGCGGCCT TCAACCGGTT GTTCGACGAG
ACCCTGGCAA GCCTGCGGTT TGAGATTGAC GGCGAACAGC TGACGCTGGA ACCGGTTCTG
ACCATGTTGC AGGAGGCCGA TCCGGCCTTG CGCGAAAAGG CGGCCATGGC GCTGTCGAAG
ACCTTCAAGG ACAATATCCG GATTTTCGTG CTGGTCACCA ATACCTTGGC CAAGGACAAG
GAAATTTCCG ATCGCTGGCG CGGCTTTGCC GACATTGCCG ATAGCCGCCA CCTGTCCAAC
CGGGTGGAGC GTCCGGTGGT CGATGCGCTG GCGGCTGCGG TGCGCGATGC CTATCCGCGC
CTGTCGCACC GCTATTACAA GATGAAAGCC AAGTGGCTGG GCATGGAGCA GATGAATTTC
TGGGACCGCA ACGCCCCTCT GCCCGACAGT ATCGACCGGA TCATTCCCTG GGACGAAGCC
CGCCAGACCG TGCTGTCGGC CTATGGCGGC TTTGCGCCTG ATATGGCCGA AATCGCTGGT
CGCTTTTTTG ATGGCGGCTG GATCGATGCG CCCGCCCGCC CTGGCAAGGC GCCGGGCGCC
TTTGCCCATC CGACCGTGCC GTCTGCCCAT CCCTATGTTT TGGTCAATTA CCTCGGCAAG
CCGCGCGACG TGATGACGCT GGCCCATGAA CTGGGCCACG GCGTGCATCA GGTTCTCGCT
GGCGAACAGG GCGCGCTGAT GTGCCAGACG CCGCTGACGC TGGCCGAGAC CGCCTCGGTA
TTCGGCGAAA TGCTGACCTT CCGGGCGCTT CTGGAAAAGG CCACGGATGC GCGTGAGCGC
AAGGCCATGC TGGCCCAGAA AGTCGAGGAC ATGATCAACA CGGTCGTGCG CCAGATCGCT
TTCTACGAAT TCGAGCGCAA GCTGCACACC GCCCGCAAGG AGGGCGAGTT GACGGCGGAA
AAGATTGGCG AACTGTGGCT ATCGGTGCAG GAAGAGAGCC TTGGACCGGC CATCAAAGTG
TCCGAGGGCT ATGAGACCTG GTGGGCCTAT ATCCCCCATT TCATCCATTC GCCTTTCTAT
GTCTATGCCT ATGCCTTCGG CGATTGCCTG GTCAATTCGC TCTATGCCGT CTACCAGAAT
GCCGAACAGG GCTTCCAGCA GAAGTATTTC GACCTGTTGA AGGCTGGCGG CAGCAAGCAT
CATTCCGAAC TTCTCGCACC GTTCGGTCTG GATGCCACCG ACCCGTCCTT CTGGGCCAAG
GGTCTATCGA TGATCGAAGG GTTGATCGAC GAGCTGGAAG CGCTTGACGC CAAAGCCTGA
 
Protein sequence
MPFSPVNQTR LPAAESASVP AHGQGLGTLP EWQLTDLYPA PSSDLFKADL AKASEMSLAF 
ETKWKGRLED AAAKDADQGL GAALKEFEEL EDLLGKIGSY AGLYYYSEMT KPENGKFFGD
VQARLTELAA HLLFFTLELN RLDDAVIDAA IARDPATAHY KPWLIDLRQD KPYQLDDKLE
QLFLEKSQTG SAAFNRLFDE TLASLRFEID GEQLTLEPVL TMLQEADPAL REKAAMALSK
TFKDNIRIFV LVTNTLAKDK EISDRWRGFA DIADSRHLSN RVERPVVDAL AAAVRDAYPR
LSHRYYKMKA KWLGMEQMNF WDRNAPLPDS IDRIIPWDEA RQTVLSAYGG FAPDMAEIAG
RFFDGGWIDA PARPGKAPGA FAHPTVPSAH PYVLVNYLGK PRDVMTLAHE LGHGVHQVLA
GEQGALMCQT PLTLAETASV FGEMLTFRAL LEKATDARER KAMLAQKVED MINTVVRQIA
FYEFERKLHT ARKEGELTAE KIGELWLSVQ EESLGPAIKV SEGYETWWAY IPHFIHSPFY
VYAYAFGDCL VNSLYAVYQN AEQGFQQKYF DLLKAGGSKH HSELLAPFGL DATDPSFWAK
GLSMIEGLID ELEALDAKA