Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2721 |
Symbol | |
ID | 5713620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2882651 |
End bp | 2884507 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641268646 |
Product | putative metallopeptidase |
Protein accession | YP_001534055 |
Protein GI | 159045261 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0993196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTTT CTGCCCGCAA CGCTAGATTG TACCGGCAAG AAACAACCAA AGCACCCATG TTCCAATCTT TTTCCGCCAC CACCACCCCC GATCAGGGGC CACCCCGTCT TGCGGCCCTG CGCGCCGAAA TGGCCGCGGA GGAGCTCGCG GGTTTTCTCG TGCCGCGCGC CGACGCGCAT CAGGGAGAAT ACGTGGCCCC GCGCGACGAC AGGCTTGCGT GGCTCACGGG GTTCACCGGC TCCGCGGGGT TCTGCATCGC CCTGGCCGGG ACCGCAGGGA TCTTCATCGA CGGGCGCTAC ACCCTGCAGG TCCGCGCGCA GGTCGACAAC GGGGCATTCA CCCCGGTTCC CTGGCCCAAG ACGCAACCGG GCCCCTGGCT GCGCGAAGCG CTTCCCACCG GGGTGATCGG GTTCGACCCC TGGCTGCATA CCAATGCCGA GATCGCGCGG CTGGAGGCGA GCCTCGGCGA CGCGCTGTCC TTGCGCCGCA CGGACAACCT GATCGACAGG ATCTGGCCGG ATCAGCCCGC GCCACCCCAA GGCGCGGTCA TCGTCCATCC CGATAGCCTG GCCGGTCGCA GCAGCGCCGA GAAGCGCAGG TCCCTCGCGC AGCACCTGAC CGAGTCCGGG GCAAAATCCG TGGTCCTCAC CCTGCCCGAC AGCCTGTGCT GGCTGCTCAA CATCCGCGGC GCGGATATCC CACGCAACCC GGTGGTCCAT GCCTTCGCGG TCCTACATGA CGATGCAAGC TGCGATCTCT TCATCGATCC GGCCAAGCTC GATGACGATC TGCGCGCCCA TCTCGGGCCC GAGATCCGCT GCCACCCGCC GCACGACCTG GCCGCAGCCC TCGGCGCGCT GGCCGGTCCG GTCCAGGTCG ACCCGAACAC CGCGCCTGTC GCGATCTTCG ACCTGATGGC CGCCCAGGAC ACCCCGGTGA TCGAGGCCGA CGACCCCTGC ATCCTGCCCA AGGCCTGCAA GACCGCGGCC GAGATCGCGG GCACCACCGA GGCGCACCTG CGCGACGGGG CAGCGGTTGT CGAGTTCCTC ACCTGGTTCT CGGGTCAAAA CCCCGCGGAG CTGACCGAAA TCGACGTGGT CATGGCGCTC GAAGCCGCCC GGCAGGCCAC GGGCGCCCTG CGCGACATCA GTTTCGAGAC GATCTGCGGC ACTGGCCCGA ACGGCGCCAT CGTCCATTAC CGCGTGACCG AAGGCACCAA CCGGCGGATC ACCCCCGGCG ATCTGCTGCT GATCGACAGC GGTGGCCAGT ATGCGGACGG GACGACCGAC ATCACCCGCA CGCTGGCCAC AGGCACCCCG CCGGAGGGCG CCAGAGCCGC CTTCACACGG GTCCTGCAGG GCATGATCGC CATCAGCCGC GCGCGCTGGC CCAAGGGGTT GGCAGGTCGC GACCTGGACG CGCTGGCCCG CGCCCCGCTG TGGATGGCCG GGCAAGATTA CGACCACGGC ACCGGGCACG GTGTGGGCAC CTATCTGTGC GTCCACGAAG GCCCGCAGCG GCTCAGCCGG ATCAGCGAAG TGCCCCTCGA GTCGGGCATG ATCCTCAGCA ACGAGCCCGG CTATTATCGC GAAGGCGCCT TCGGCATCCG GCTAGAGAAC CTCGTCGTCG TCACGCAGGC CGACCCGCCC GAGGGCGGCG ATCCGCAACG CGAGATGTTG CGCTTCGACA CCCTGACTTA CGTCCCGCTC GAGACCGCCC TCATCGACAC CGCGATGCTG TCGCAGGCCG AGATCGACTG GATCGACACC TATCACGCGG AAACCCGCCA GCGCCTCCGG GACCGGCTGA CGCCCGAGGC GCGTCGCTGG CTGGACAGGG CAACGCGCCC GCTGTGA
|
Protein sequence | MAVSARNARL YRQETTKAPM FQSFSATTTP DQGPPRLAAL RAEMAAEELA GFLVPRADAH QGEYVAPRDD RLAWLTGFTG SAGFCIALAG TAGIFIDGRY TLQVRAQVDN GAFTPVPWPK TQPGPWLREA LPTGVIGFDP WLHTNAEIAR LEASLGDALS LRRTDNLIDR IWPDQPAPPQ GAVIVHPDSL AGRSSAEKRR SLAQHLTESG AKSVVLTLPD SLCWLLNIRG ADIPRNPVVH AFAVLHDDAS CDLFIDPAKL DDDLRAHLGP EIRCHPPHDL AAALGALAGP VQVDPNTAPV AIFDLMAAQD TPVIEADDPC ILPKACKTAA EIAGTTEAHL RDGAAVVEFL TWFSGQNPAE LTEIDVVMAL EAARQATGAL RDISFETICG TGPNGAIVHY RVTEGTNRRI TPGDLLLIDS GGQYADGTTD ITRTLATGTP PEGARAAFTR VLQGMIAISR ARWPKGLAGR DLDALARAPL WMAGQDYDHG TGHGVGTYLC VHEGPQRLSR ISEVPLESGM ILSNEPGYYR EGAFGIRLEN LVVVTQADPP EGGDPQREML RFDTLTYVPL ETALIDTAML SQAEIDWIDT YHAETRQRLR DRLTPEARRW LDRATRPL
|
| |