Gene Sare_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2189 
Symbol 
ID5706245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2519440 
End bp2520819 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content72% 
IMG OID641271671 
Producthypothetical protein 
Protein accessionYP_001537042 
Protein GI159037789 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0666639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGGGC CGAGACCGAC GGATGCCGTG GAGACCGCCC TGTCCGCCAG CCGTTCCGAT 
GCCTGCGTCG TGATCGTGGA GCACGGCGCG GAGCGACACC TACGGTGGGC AGACAGCGAA
CTGATCGGTA CCGGCGACGT CGACACGTGT CAGGTCTCGG TGGTGTCGGT TGTGGGCGAC
CGGGTAGGGG CGGTCACCGT GAGCGGGCGG GCTGACCGAG CGGAGATCGT CGCGGTGGTC
CGTGCGGCGG AGGCGGCGGC CCGACAGGCG CCGCCGGCCG AGGACGCACG GCCCCTGCCC
GCCTCCGGTG CTGAATCGCC GCATTGGGAC GCTGCGCTTC CGGCCCCGTC GTCGGCGGTG
CTTGCCGGGC TCGTGGACCA GCTCACCGAG GGGTTCGCCC GGGCCCGGCG CAACGGCCAG
GCGTTGTACG GGTATGCCGA GCACCGCCGC CGGACCACCT TCCTCGGCAC CTCCACCGGG
GTCCGGCTGC GGCACGACGA TCGGGCCGGG TACCTGGAGC TGACCGCGGG GGACCGGGAT
GGCACTCCGG CCTGGACGAA CGCCGTGACC ACCGAATTCG ACGACGTGTC GGTGCGCGAG
CTCCAGGATG AGCTGGACCG GCGACTGCGC TGGGGTCGCC GGAGCATCGA GTTGCCGCCG
GGGCGCTACG AGTGCCTGTT GCCGCCGTCG GCCGTCGCTG ATCTGATGAA CTACGCGTAC
ACCACGGCCG GTGCCCGTGC GGCGGCGCAG GGCCGATCGG TGTACAGCCG GCCCGGCGGC
CGGACCCGGG TCGGGGAAGT GCTCTCCGAT GTGCCACTGA CCTTGCGCAG CGACCCGGCG
GCCGATCGGC TGCGCTGCCC CCCGTTCCTG GTGACGTCGT CGTCGACCGG AACCCGGTCG
GTGTTCGACA ATGGCCTCCC GCTTGGTCCC ACCTCCTGGT GGGAGCGAGG GCGGCTACGG
TCGCTGGTGC ACACCCGCGC GAGTGCCGAG GAACTGGGTG CGCCACTGAC CCCGATGGTG
GACAACCTCG TCCTCGACGG ACCGCCCGGC GGTGGCGACA CCGCGGAGTT GATCGCTCGC
ACCCGGCGTG GTCTGCTCCT CACCAGTCTG TGGTACATCC GCGAGGTTGA TCTCGCCACG
ATGGCCCTGA CCGGGCTGAC CCGGGATGGT GTGTTCCTGG TGGAGGAGGG GGAGGTCGTC
GGGGCGGTGC ACAACTTCCG GTTCAACGAC AGCCCACTGG CCATGGTCGG CCGGGTCGTC
GAGGTGGGGC GCACCCTGCC CACCCGGGCT CGGGACTGGG GGGACGCGGT GGGCCCCACC
GCCATGCCGA TGCTGCGGGT GCGGGACGTC CGGCTGACCG CCGTGACGCG TGCCCGCTGA
 
Protein sequence
MRGPRPTDAV ETALSASRSD ACVVIVEHGA ERHLRWADSE LIGTGDVDTC QVSVVSVVGD 
RVGAVTVSGR ADRAEIVAVV RAAEAAARQA PPAEDARPLP ASGAESPHWD AALPAPSSAV
LAGLVDQLTE GFARARRNGQ ALYGYAEHRR RTTFLGTSTG VRLRHDDRAG YLELTAGDRD
GTPAWTNAVT TEFDDVSVRE LQDELDRRLR WGRRSIELPP GRYECLLPPS AVADLMNYAY
TTAGARAAAQ GRSVYSRPGG RTRVGEVLSD VPLTLRSDPA ADRLRCPPFL VTSSSTGTRS
VFDNGLPLGP TSWWERGRLR SLVHTRASAE ELGAPLTPMV DNLVLDGPPG GGDTAELIAR
TRRGLLLTSL WYIREVDLAT MALTGLTRDG VFLVEEGEVV GAVHNFRFND SPLAMVGRVV
EVGRTLPTRA RDWGDAVGPT AMPMLRVRDV RLTAVTRAR