Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5916 |
Symbol | |
ID | 7381007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 924721 |
End bp | 925755 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643649429 |
Product | dipeptidase |
Protein accession | YP_002547666 |
Protein GI | 222106875 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGA CCGTTCCCGT TTTCGATGGC CATAATGATG TGCTGCTGCG GCTGCGCCGT TCTGGCCGTC CCCATCCCTA CCAGGATTTT TTGCAGGGCG GCGAGGCCGG GCATATTGAT TTGCCCAAGG CGCGCCAAGG GGGACTGGCC GGTGGACTTT GTGCCGTATT CATTCCCTCG CCAAGCTTCA AACCTGATGA CAACGGGGAT TTCCAGGCCC CGGCCCAGCC GGAAGCGCTG AATGAGACGT TGGCCATGGC CCGGCTGCTG TTCGAAATCG AGGCGAATTC GGCTGGTCGC GTCAAGGTCT GCCGCAGTGC CGCCGATATA CGGCATTCTC TCGCCAATGA AATCTTTGCG GCGGTTTTTC ACATCGAAGG CGTTGAGGCC ATCGCTGCCG ATCTCGATGC CCTTTATGTG CTGCATCAGG CGGGGCTCCG CTCGCTCGGT CCTGTCTGGA GCCGGCCGAA TATCTTTGCC CATGGCGTTC CCTTCCGCTT TCCATCCTCC TCGGATATCG GCCCCGGTCT GACGGATGCT GGCAAGGACC TGATCCGCGC CTGCAATGAG TTGAAGATCA TGGTCGATCT CTCTCATATG AATGAACAGG GCTTCTGGGA TATCGCCGGT CTTTCCAATG CACCACTTGT CGCTTCCCAT TCCAACGCCC ACGCGCTTTG CCCTCATAGC CGCAACCTCA CCGACAGGCA GTTGGATGCG ATCCGTGACA GTGGTGGTTT GGTGGGTATC AATTTCGGGG TGATTTTCCT GCGCGAAGAC GGTCAGCGCA ATCTCGATAC ACCGCTGGAT GTGCTCATCG ACCATATCGA TTATATCGTG ACGCGCATCG GCATCGACCA TGTCGCACTC GGCTCAGACT TCGATGGCAC AACCGTTCCC GCCGCCCTCA AGGACGCAAC TGGTCTTCCC TTGATCGTCC AGGGGCTGGA AGCGCGGGGC TATGATGCGA CCTCGATTGC CAAGATCTGC CATGGCAATT GGATTTCGGT GCTGGAGCGA ACCTGGGGTG CTTAG
|
Protein sequence | MTQTVPVFDG HNDVLLRLRR SGRPHPYQDF LQGGEAGHID LPKARQGGLA GGLCAVFIPS PSFKPDDNGD FQAPAQPEAL NETLAMARLL FEIEANSAGR VKVCRSAADI RHSLANEIFA AVFHIEGVEA IAADLDALYV LHQAGLRSLG PVWSRPNIFA HGVPFRFPSS SDIGPGLTDA GKDLIRACNE LKIMVDLSHM NEQGFWDIAG LSNAPLVASH SNAHALCPHS RNLTDRQLDA IRDSGGLVGI NFGVIFLRED GQRNLDTPLD VLIDHIDYIV TRIGIDHVAL GSDFDGTTVP AALKDATGLP LIVQGLEARG YDATSIAKIC HGNWISVLER TWGA
|
| |