Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2973 |
Symbol | dcp |
ID | 5710824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3139524 |
End bp | 3141545 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641268899 |
Product | peptidyl-dipeptidase |
Protein accession | YP_001534307 |
Protein GI | 159045513 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACC CGCTTCTCGA CACCTGGACC CCGCCCTACG GCCTGCCGCC CTTCGACCGG ATCGAGGATG CCCATTTCGC GCCGGGGCTG GAGGCGGCGC TGACCGAGGC GCGGGCCGAG ATCGCGGCGA TCGCCGGGTC GGCCGAGGCC CCGACATTCG ACAACACGAT CCGCCCGCTG GAGGCCGGGG CGCGCAAGCT GGGGCAGGTG GTGCGGGTGT TCTACCACGT CGCCGCCACG GACAGCACGC CCGCGCGCGA GGCGCTGCAG AAGGACTTCA GCGCCAAGCT GAGCGCCTAT AATTCCGAAG TGATCTCCAA CGCGGCGCTC TTTGCCCGGA TCGCGGCGGT CTGGGAGGGG CGCGAGGCCC TCGGGGCCGA AGAGGCGCGG GTTGCGGAGC TGTATTACAA AGACTTCACC CGGGCGGGCG CGGCCCTCAC CGGGGCCGAC AAGGACCGGA TGACGGAGAT CAAGGCGCGG CTGGCGATGC TGGGGACGGA GTTTACCCAG AACCTGCTGG CGGATGAGCG CGATTGGGTG ATGCCGCTGG CCGACGCCGA TCTGGAGGGG CTGCCGGAGT TCGTCGTCGC CACGGCGCGC GCGGCGGCCG AGGAGCGCGG GATGGAGGGG CATGTCGTGA CCCTGTCGCG GTCTCTGATC GTGCCGTTTT TGCAATTCAG CCCGCGCCGG GATTTGCGCG AGAAGGCCTA TGAGGCCTGG GTCGCGCGGG GCGAGCATGA TGGCCCCACG GACAATCGCG GCATCGCGGC AGAGGTTCTG GCCCTGCGGG AGGAGCGCGC CAAACTGCTC GGCTACGACA GCTTCGCGGA CTACAAGCTG GAGCCGGAGA TGGCCAAGAC CCCGGCGGCG GTGCGCGATC TGCTGATGGC GGTCTGGGCC CCGGCCAAGG CGGCGGCGGA GGCCGATGCC GAGGTGCTGA CCGCGATGAT GCAGGAGGAT GGGGTCAACG GGCCGTTGGA GGCCTGGGAC TGGCGCTATT ACTCCGAAAA GCGGCGGCAG GCGGAGCATG ACCTGGATGA GGCGGAGCTG AAGCCGTACC TGCAACTCGA CAAGATGATC GAGGCGGCGT TCGATTGTGC CGCGCGCCTG TTCGGGCTGT CCTTTGCCCC CATCGATGCA CCGCTGGCCC ACCCGGACGC ACGCGCCTGG GAGATCCGGC GCGGCGAGCG GCTGATGGCG GTGTTCGTGG GGGATTACTT CGCACGGGCG GGCAAGCGGT CGGGGGCCTG GTGCGGGTCG CTGCGCGCGC AGCACAAGCT CGACGGGGAT ACCCGCGCGA TTGTCACCAA TGTGTGCAAC TTCGCCAAGC CGGCCAAGGG GCAGCCCGCG CTGCTGTCGT TCGACGATGC GCGGACGCTG TTTCATGAGT TCGGCCATGC GCTGCATCAT ATCCTGTCGG ACGTGACCTA TCCGATGATC TCGGGCACCT CGGTGGCGCG GGACTTCGTC GAACTGCCGA GCCAGCTTTA CGAGCATTGG CTGGAGGTGC CCGAGGTGCT GCGCGCCTTC GCGGTCCATG CGGAGACCGG CGCGCCGATG CCCGCCGACC TGCTGGCGCG GATGCTGGCC GCGGCAACCT ATGACATGGG GTTCCAGACG GTGGAATATG TGGCCTCGGC CCTGGTCGAC CTGGATTTCC ACGAGGGCGC GGCCCCCGCG GACCCAATGG CGCGGCAGGC GGAGGTGCTG GCCAAGCTGG GCATGCCCCA CGCGATCCGG ATGCGCCACG CAACGCCCCA TTTCGCCCAT GTGTTCGCGG GCGACGGCTA TTCTTCGGGG TATTACAGCT ACATGTGGTC CGAGGTGATG GATGCCGATG CCTTCGCGGC CTTCGAGGAG GCCGGCAGCG CCTTCGACCC CGACACCGCC GCCAAGCTGG AGGCGCATAT CCTGTCGGCG GGCGGATCGG CGGAGGCAGA CGGGCTATAC CGCGCGTTCC GCGGCCGGAT GCCGGGGGTC GAGGCCCTGC TCAAGGGACG GGGGCTGGAC AAGGCCGCGT GA
|
Protein sequence | MTNPLLDTWT PPYGLPPFDR IEDAHFAPGL EAALTEARAE IAAIAGSAEA PTFDNTIRPL EAGARKLGQV VRVFYHVAAT DSTPAREALQ KDFSAKLSAY NSEVISNAAL FARIAAVWEG REALGAEEAR VAELYYKDFT RAGAALTGAD KDRMTEIKAR LAMLGTEFTQ NLLADERDWV MPLADADLEG LPEFVVATAR AAAEERGMEG HVVTLSRSLI VPFLQFSPRR DLREKAYEAW VARGEHDGPT DNRGIAAEVL ALREERAKLL GYDSFADYKL EPEMAKTPAA VRDLLMAVWA PAKAAAEADA EVLTAMMQED GVNGPLEAWD WRYYSEKRRQ AEHDLDEAEL KPYLQLDKMI EAAFDCAARL FGLSFAPIDA PLAHPDARAW EIRRGERLMA VFVGDYFARA GKRSGAWCGS LRAQHKLDGD TRAIVTNVCN FAKPAKGQPA LLSFDDARTL FHEFGHALHH ILSDVTYPMI SGTSVARDFV ELPSQLYEHW LEVPEVLRAF AVHAETGAPM PADLLARMLA AATYDMGFQT VEYVASALVD LDFHEGAAPA DPMARQAEVL AKLGMPHAIR MRHATPHFAH VFAGDGYSSG YYSYMWSEVM DADAFAAFEE AGSAFDPDTA AKLEAHILSA GGSAEADGLY RAFRGRMPGV EALLKGRGLD KAA
|
| |