Gene Dshi_3415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3415 
Symbolgcp 
ID5712473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3593049 
End bp3594119 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID641269344 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001534749 
Protein GI159045955 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCT CTCTGACCGT TTTGGGTATC GAAAGCAGCT GCGATGACAC CGCGGCGGCG 
GTCTTGCGCG GACCCGAGGT GCTGTCTTCG GTCGTTTATG GCCAGACGGC GCTGCATGCG
GCTTTCGGTG GCGTCGTGCC CGAGTTGGCC GCACGCGCAC ATGTCGAGAA GCTGGATATC
GCCGTCGCGG CGGCCCTGTC GGAGGCGGTT CTGGCGCTCG ATCAGATCGA TGTGATCGCG
GTGACCGCGG GGCCGGGGCT GATCGGTGGG GTACTGTCGG GCGTCATGTT GGCCAAGGGA
CTTTCAGCGG CGTCGGGCGT GCCGCTCATC GGGGTGAACC ACCTCGCGGG TCATGCGCTG
ACGCCGCGAT TCACCGATGG GCTGGCGTTT CCCTACCTGA TGCTGCTGGT CTCTGGGGGT
CATTGCCAGT TTCTGCGCGT GGAAGGGCCA GAGGCATTCC ATCGCTTGGG CGGCACCATT
GATGACGCGC CGGGGGAGGC TTTCGACAAG ACTGCCAAGC TCCTCGGCCT GCCACAACCG
GGGGGGCCTG CCGTCGAGGC GGAGGCCCGG GCGGGCGATC CCGCGCGTTT TGTCTTCCCA
CGGCCACTGC TGGACCGGGC TGGGTGCGAC ATGTCCTTTT CCGGGTTGAA GACGGCCCTT
CTGCGGGCTC GGGACGGTCT GGTGTCGGCG GGCGGCGGCC TGACAGCGCA GGATCGGGCC
GATCTCTGCG CCGGGTTCCA GGCGGCGATC TGTGACGTAC TGGTGGAAAA ATCGCGACGC
GCCCTGACCC AGTCCGAAGG CGTGACCGGC TTCGCGGTGG CTGGCGGCGT GGCGGCAAAT
GAGCAGGTTC GGTCCGGCTT GGCCCGGTTG GCTGCGGAAC TGGATGCTCC GTTTGTCGCA
CCGCCGCTGC GGTATTGCAC CGATAATGCG GCGATGATCG CCTGGGCTGG GCAGGAGGCG
TTCTCTGCTG GTGCGCGCTC TGGTCTGGAT CTGTCGGCGC GCCCGCGTTG GCCGCTGGAT
AACAGCCAGC CTGCGCTCCT GGGTTCAGGC AAGAAGGGCG CCAAGGCGTG A
 
Protein sequence
MTPSLTVLGI ESSCDDTAAA VLRGPEVLSS VVYGQTALHA AFGGVVPELA ARAHVEKLDI 
AVAAALSEAV LALDQIDVIA VTAGPGLIGG VLSGVMLAKG LSAASGVPLI GVNHLAGHAL
TPRFTDGLAF PYLMLLVSGG HCQFLRVEGP EAFHRLGGTI DDAPGEAFDK TAKLLGLPQP
GGPAVEAEAR AGDPARFVFP RPLLDRAGCD MSFSGLKTAL LRARDGLVSA GGGLTAQDRA
DLCAGFQAAI CDVLVEKSRR ALTQSEGVTG FAVAGGVAAN EQVRSGLARL AAELDAPFVA
PPLRYCTDNA AMIAWAGQEA FSAGARSGLD LSARPRWPLD NSQPALLGSG KKGAKA