Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3382 |
Symbol | |
ID | 4075281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 397569 |
End bp | 398549 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004890 |
Product | peptidase M19, renal dipeptidase |
Protein accession | YP_611616 |
Protein GI | 99078358 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.290781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCG ATGGTCTGCA ATACGCAAAC TGGTCGGAGA AGATCTTTCG CCAGCTTCGC GAAGGGGGCG TGGACGCGAT CCACGTCACC ATCGCCTATC ACGAGAACTT TCGTGAAACG GTTCTGAACT TTGAAAAATG GAATCGATGG TTTGAGCAAT ACCCGGACCT GATCATGAAG GGCCAGTGGG CGCAGGACAT CGACATCGCA CGCGAGACGG GCAGAACGGC CGTGTTTTTT GGGTTCCAGA ATCCCTCGCC GATCGAGGAT GACATCGGCC TGGTCGAGAT CCTTCATAGT CTCGGCGCCC GCTTCATGCA GCTGACCTAT AACAACCAGT CGCTACTGGC GACGGGCTGT TACGAGGCCG AAGACATGGG CCTGACCCGG ATGGGCAAAC AGGTTGTAAA GGAAATGAAC CGCGTCGGCC TCGTGATCGA CATGAGCCAT TCGTCGGATC GCTCCACCAT TGAGGCGGCG GAATATTCCA CACGCCCCAT CGCGATCACC CATGCCAATC CGCACGCATG GTCTCCTGCC CTGCGCAACA AGAAAGACGC GGTGATCCGC GCGGTTACCG AAAACGGCGG CATGTTCGGT TTTTCGGTCT ATCCGCACCA CCTGAGGGAC AAATCCGACT GCACGCTGGA GAGTTTCTGC GAAATGATCG CGCGCACTGC TGACACCTAT GGGGTGGAGC ATCTCGGAAT CGGCACTGAC CTTTGCCAGG ACCAGCCCGA CAGTGTCGTG GAATGGATGC GCGTCGGGCG TTGGACGAAA GAAATCGACT ACGGCGAAGG GTCCAAGTCC GCGCCCGGTT TCCCACGGAT GCCCAGCTGG TTCGAGGACA ACAGAGACTT TGAGAACATC GAGCAGGGGC TTCGGTCCGT TGGCATGACG ACCGGTGAAG TCGCCGCCAT CATGGGCGGC AACTGGTACC GCTTTTTTGC AGAAAGCTTT GGGCCCAAGG CGGGAGGCTA A
|
Protein sequence | MRIDGLQYAN WSEKIFRQLR EGGVDAIHVT IAYHENFRET VLNFEKWNRW FEQYPDLIMK GQWAQDIDIA RETGRTAVFF GFQNPSPIED DIGLVEILHS LGARFMQLTY NNQSLLATGC YEAEDMGLTR MGKQVVKEMN RVGLVIDMSH SSDRSTIEAA EYSTRPIAIT HANPHAWSPA LRNKKDAVIR AVTENGGMFG FSVYPHHLRD KSDCTLESFC EMIARTADTY GVEHLGIGTD LCQDQPDSVV EWMRVGRWTK EIDYGEGSKS APGFPRMPSW FEDNRDFENI EQGLRSVGMT TGEVAAIMGG NWYRFFAESF GPKAGG
|
| |