Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3415 |
Symbol | gcp |
ID | 5712473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3593049 |
End bp | 3594119 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641269344 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001534749 |
Protein GI | 159045955 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCT CTCTGACCGT TTTGGGTATC GAAAGCAGCT GCGATGACAC CGCGGCGGCG GTCTTGCGCG GACCCGAGGT GCTGTCTTCG GTCGTTTATG GCCAGACGGC GCTGCATGCG GCTTTCGGTG GCGTCGTGCC CGAGTTGGCC GCACGCGCAC ATGTCGAGAA GCTGGATATC GCCGTCGCGG CGGCCCTGTC GGAGGCGGTT CTGGCGCTCG ATCAGATCGA TGTGATCGCG GTGACCGCGG GGCCGGGGCT GATCGGTGGG GTACTGTCGG GCGTCATGTT GGCCAAGGGA CTTTCAGCGG CGTCGGGCGT GCCGCTCATC GGGGTGAACC ACCTCGCGGG TCATGCGCTG ACGCCGCGAT TCACCGATGG GCTGGCGTTT CCCTACCTGA TGCTGCTGGT CTCTGGGGGT CATTGCCAGT TTCTGCGCGT GGAAGGGCCA GAGGCATTCC ATCGCTTGGG CGGCACCATT GATGACGCGC CGGGGGAGGC TTTCGACAAG ACTGCCAAGC TCCTCGGCCT GCCACAACCG GGGGGGCCTG CCGTCGAGGC GGAGGCCCGG GCGGGCGATC CCGCGCGTTT TGTCTTCCCA CGGCCACTGC TGGACCGGGC TGGGTGCGAC ATGTCCTTTT CCGGGTTGAA GACGGCCCTT CTGCGGGCTC GGGACGGTCT GGTGTCGGCG GGCGGCGGCC TGACAGCGCA GGATCGGGCC GATCTCTGCG CCGGGTTCCA GGCGGCGATC TGTGACGTAC TGGTGGAAAA ATCGCGACGC GCCCTGACCC AGTCCGAAGG CGTGACCGGC TTCGCGGTGG CTGGCGGCGT GGCGGCAAAT GAGCAGGTTC GGTCCGGCTT GGCCCGGTTG GCTGCGGAAC TGGATGCTCC GTTTGTCGCA CCGCCGCTGC GGTATTGCAC CGATAATGCG GCGATGATCG CCTGGGCTGG GCAGGAGGCG TTCTCTGCTG GTGCGCGCTC TGGTCTGGAT CTGTCGGCGC GCCCGCGTTG GCCGCTGGAT AACAGCCAGC CTGCGCTCCT GGGTTCAGGC AAGAAGGGCG CCAAGGCGTG A
|
Protein sequence | MTPSLTVLGI ESSCDDTAAA VLRGPEVLSS VVYGQTALHA AFGGVVPELA ARAHVEKLDI AVAAALSEAV LALDQIDVIA VTAGPGLIGG VLSGVMLAKG LSAASGVPLI GVNHLAGHAL TPRFTDGLAF PYLMLLVSGG HCQFLRVEGP EAFHRLGGTI DDAPGEAFDK TAKLLGLPQP GGPAVEAEAR AGDPARFVFP RPLLDRAGCD MSFSGLKTAL LRARDGLVSA GGGLTAQDRA DLCAGFQAAI CDVLVEKSRR ALTQSEGVTG FAVAGGVAAN EQVRSGLARL AAELDAPFVA PPLRYCTDNA AMIAWAGQEA FSAGARSGLD LSARPRWPLD NSQPALLGSG KKGAKA
|
| |