Gene Sare_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3947 
Symbol 
ID5708218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4489542 
End bp4490888 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content73% 
IMG OID641273372 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001538728 
Protein GI159039475 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.530134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.18472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGA TCAGCACCCA GCCCGGTCCG GGTACCGCCC GCCCGTACCG GTTTCCGCAG 
GTGGTCCGGC GTTCCGTCAA GGGTGGGCAG GTGGTGGCCG CGCACCTGCC GGGGCAGTCG
TTGGCCGTGG CGCTCCTGCT GCTCGACGCC GGTGCCGGCC GGGAACCGCG TGGGCGTGAA
GGGCTGTGCG CGGTGCTCGC CAAGGCCCTG GAGGAGGGCA CGGCGCAGCG GGACGCGACG
GCGTACGCGC TGGCCATCGA GGCGCTCGGC ACGGAGCTGG TGACCGGCCT GGACTGGGAC
TCGTTCCAGG TGAGCGTGCA GGTCCCGGTG GATCGGTTGC CCGCCGCGGT GGAGCTGTTG
GCCGAAGCGG TGCGTACCCC CCGGCTGGCG CCGGACGACG TGCGGCGGGT CCGCGACGAC
GAGGCGACCG CCCAACGGAT GGACTGGGCG AATCCGGGTC CGCGGGCGGA TGCGGCGCTG
CGGGCCGACC TGTACGGCGC CGAGAACCGC TGGGGCCGAC CGTTGTACGG CGATCCGGAC
ACGGTGGCCG GGCTGGACAT CGAGGATGTT CGAGTTTTCC ACTCGGAGTG GTTCCTTCGG
CCGGGCACCC TGATCGTCGC CGGGGACCTG GACCGGCTCG ACCTCGACGC GCTCGGCGCG
GCGGCGTTCG CCGGCACCGG TGGCGGCCCG GTGGACCGGG GCGACCCGAT TCCGGTCACG
CCACGCCAGG GGCGCCGAAT CGTCCTGGTG GACCGGCCGG GTTCGGTGCA GTCGACGCTG
CGGCTCGGGC ATCCGTCACC GCACCGCGCG CACCCCGATC ACGTCCCGAT GACGCTTGCT
GGTGCCGTTC TCGGCGGTGC CTTCACGTCC CGGCTCAACC ATCTGATCCG CGAGGTGCGC
GGCTACACGT ACGGGATCCG GGGCGACTTC GTGTCCTCCC GCCGGTTCGG GCGGTTCGCG
GTCAGCTCCG GCGTACAGAC CGCGGTCACC GCGCCCGCGC TGGTCGAGGC GGTTGGCGAG
ATCACACGTA CCCAGCAGAC CGGGGTGACC GGGGAGGAGC TGGCGGTGGC GCGCTCATGG
CAAGCCGGCC AGCTCTCGGT CGAGTTGCAG ACGCCACGGG CGATCGCCGC GGCGCTGACC
ACGCTGGTAG TCCACGACCT ACCGGACGAC TACTACGCCC GGCTGCGGGA GTCACTGCTC
GCCGCCGAGG TCGGCGAGGT CTCGGCTGCC GCCGCCGCGC ACCTGCACCC CGAGTCGCTG
ACCCTGGTGA TCGAGGGTGA CGCTGCCCTG ATCCGGGCCG AGCTGGCGGC GACCGGCCTG
GGTGAGGTCC TCACCAGCAC CCGCTGA
 
Protein sequence
MTLISTQPGP GTARPYRFPQ VVRRSVKGGQ VVAAHLPGQS LAVALLLLDA GAGREPRGRE 
GLCAVLAKAL EEGTAQRDAT AYALAIEALG TELVTGLDWD SFQVSVQVPV DRLPAAVELL
AEAVRTPRLA PDDVRRVRDD EATAQRMDWA NPGPRADAAL RADLYGAENR WGRPLYGDPD
TVAGLDIEDV RVFHSEWFLR PGTLIVAGDL DRLDLDALGA AAFAGTGGGP VDRGDPIPVT
PRQGRRIVLV DRPGSVQSTL RLGHPSPHRA HPDHVPMTLA GAVLGGAFTS RLNHLIREVR
GYTYGIRGDF VSSRRFGRFA VSSGVQTAVT APALVEAVGE ITRTQQTGVT GEELAVARSW
QAGQLSVELQ TPRAIAAALT TLVVHDLPDD YYARLRESLL AAEVGEVSAA AAAHLHPESL
TLVIEGDAAL IRAELAATGL GEVLTSTR