Gene Sare_4241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4241 
Symbol 
ID5708091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4814232 
End bp4815296 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID641273660 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001539013 
Protein GI159039760 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0825162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0152473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG AACCCCTGGT CCTGGGAATC GAGACGTCCT GCGACGAGAC CGGGGTCGGT 
GTCGTCCAGG GCCACACCCT GCTCGCCGAC GCGTTGGCCA GCAGCGTCGA GCAGCATGCC
CGGTTCGGCG GTGTGGTGCC CGAGGTGGCC AGCCGGGCAC ACCTGGAGGC GATCGTGCCG
ACCATGGACC GGGCGTTGGC GGAGGCGGGG GTGACGCTCG CCGATGTCGA CGCCATCGCG
GTAACCTCCG GCCCCGGGCT GGCCGGCGCG CTGTTGGTCG GCGTCGCCGC GGCCAAGGGG
TACGCGGTCG CCGCCGAGAA GCCGGTCTAC GGCGTGAACC ATCTCGCGGC GCATGTCGCC
GTGGACACCC TGGAACACGG GCCGCTGCCG GAACCGGCGA TTGCCCTGCT GGTCTCGGGC
GGGCACTCGT CGCTGCTACT GGTCGACGAC CTGGCCCACG GTGTCACCCC GCTCGGCGCC
ACGATCGACG ACGCGGCCGG CGAGGCGTTC GACAAGGTCG CCCGGCTGCT CGGGCTGCCC
TTCCCGGGCG GCCCGTACAT TGACCGGGAG GCTCGGGCCG GTGACCGGGC GGCCATCGCG
TTTCCACGCG GGCTGACCGC GGCAAAGGAC CAGGCGGCGC ACCGCTACGA CTTCTCCTTC
TCCGGGTTGA AGACCGCGGT GGCGCGTTGG GTGGAGAGTC GGCAGCGGGC CGGTGAGGTG
GTGCCGGTTG CCGATGTCGC CGCTTCCTTC CAGGAGGCGG TTTGTGACGT ACTGGTCGGG
AAGGCACTGG ACGCCTGCCG GTCGAGTGGG ATACAGACCC TCGTGATCGG CGGCGGAGTG
GCGGCCAACT CGCGGCTGCG GGCGATGGCC GAGCAGCGCG CGGCGACGTA CGACGTCCAG
GTACGAACAC CCCGACCGAC GTTGTGTACG GACAACGGCG CGATGGTCGC CGCACTCGGC
TCGCACCTGG TCGCCGCCGG TGTCGCGCCG AGCAGCCTGG ACCTACCCGC CGATTCGGCG
ATGCCACTGA CGACGGTCAG TGTTACAGGG GAGGAGCGGA CATGA
 
Protein sequence
MADEPLVLGI ETSCDETGVG VVQGHTLLAD ALASSVEQHA RFGGVVPEVA SRAHLEAIVP 
TMDRALAEAG VTLADVDAIA VTSGPGLAGA LLVGVAAAKG YAVAAEKPVY GVNHLAAHVA
VDTLEHGPLP EPAIALLVSG GHSSLLLVDD LAHGVTPLGA TIDDAAGEAF DKVARLLGLP
FPGGPYIDRE ARAGDRAAIA FPRGLTAAKD QAAHRYDFSF SGLKTAVARW VESRQRAGEV
VPVADVAASF QEAVCDVLVG KALDACRSSG IQTLVIGGGV AANSRLRAMA EQRAATYDVQ
VRTPRPTLCT DNGAMVAALG SHLVAAGVAP SSLDLPADSA MPLTTVSVTG EERT