Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3174 |
Symbol | pepQ |
ID | 7388117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 2629505 |
End bp | 2630659 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643652101 |
Product | proline dipeptidase |
Protein accession | YP_002550285 |
Protein GI | 222149328 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTGC ATTTTTCAAC TGCCGAATAC GCCGCCCGGC TGGACCGCCT GACTGACAGG ATGCGCGAAC AGAAGCTGGA TGCCATGCTA CTGTTTGCCC AGGAAAGCAT GTATTGGCTG ACCGGCTATG ACACGTTTGG CTATTGCTTC TTCCAGACAC TGGTGGTGAA GGCCGATGGC TCGATGACGC TTCTGACCCG TTCGGCGGAT CTGCGCCAGG CCCGCCAGAC CTCAACCATC GATAATATCC TGATCTGGGT TGACCGTACC AACGCCGATC CGACCAGCGA CCTGAAGGAT CTGCTCAACG ATCTCGACCT GCTCGGCTGC CGTCTTGGCA TCGAATACGA CACCCATGGC ATGACCGGAC GGGTCGCTCG GCTGCTGGAC AATCAATTGC TGAGCTTCGG TGAATTGATC GACGCCTCGA TGCTGGTCAG CGAATTGCGG CTGATCAAGA GCCCAGAGGA AATCGCTTAT GTCGAAAAGG CCGCCAGCCT TGCCGATGAC GCGCTGGATG CCGCCCTCCC GCTGATTTCA GCCGGGGGCG ATGAAGCTGC CATTCTCGCT GCCATGCAGG GTGCGGTCTT TGCAGGTGGC GGTGATTATC CGGCTAATGA ATTCATTATC GGCTCCGGCC AGGATGCGCT GCTGTGCCGC TACAAGGCAG GCCGCCGGAC GCTGTCGGCC AATGACCAGC TGACACTGGA ATGGGCCGGT GCTTCGGCCC ATTACCATGC CGCGATGATG CGCACCGTGC TGGTCGGCGA ACCATCGCCT CGCCACCGCG AGCTTTATGC CGCCTGCCGG GAGGCGATCC AGGAAATCGA AACCGTGCTG CGGCCCGGCC ATACATTCGG CGACGTGTTC GAGACCCATG CCAGAGTGCT GGACGAACGA GGCCTGACCC GCCACCGGCT AAATGCCTGC GGTTATTCAC TGGGCGCCCG CTTCTCCCCG TCGTGGATGG AGCACCAGAT GTTCCATATC GGCAATCCGC AGGAGATCCT GCCCAATATG TCGCTGTTCA TCCACATGAT CATCATGGAT TCCGAACGCG AGACCGCGAT GACGCTCGGC CACACTTATC TCACCACCGA AGGCGCGCCA CGCGCGCTGT CGCGTCATCC GCTGGATCTG ATCGTCAAGG CGTGA
|
Protein sequence | MTLHFSTAEY AARLDRLTDR MREQKLDAML LFAQESMYWL TGYDTFGYCF FQTLVVKADG SMTLLTRSAD LRQARQTSTI DNILIWVDRT NADPTSDLKD LLNDLDLLGC RLGIEYDTHG MTGRVARLLD NQLLSFGELI DASMLVSELR LIKSPEEIAY VEKAASLADD ALDAALPLIS AGGDEAAILA AMQGAVFAGG GDYPANEFII GSGQDALLCR YKAGRRTLSA NDQLTLEWAG ASAHYHAAMM RTVLVGEPSP RHRELYAACR EAIQEIETVL RPGHTFGDVF ETHARVLDER GLTRHRLNAC GYSLGARFSP SWMEHQMFHI GNPQEILPNM SLFIHMIIMD SERETAMTLG HTYLTTEGAP RALSRHPLDL IVKA
|
| |