Gene Sare_0165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0165 
Symbol 
ID5706355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp178824 
End bp179954 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID641269691 
Productradical SAM domain-containing protein 
Protein accessionYP_001535091 
Protein GI159035838 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.263951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000169446 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGTAC GGGGGCAGTC GAGTCGGATG CGCGGCCTGG CCGCCGTTCC GGCGTATGTG 
GTCATGCAGC CCACCACGCT CTGCAACCTC GACTGCGTGT ACTGCTACCT CCCGTTGAGG
GCGGCCGACC GGCGGATGCC GGTTGCGGTT GCGGAGGCGG TGGCGGCATC GGTCAACCCG
TGGGCGCGGG CCGGGCGGTT CTCCGTGGTG TGGCACGGCG GCGAGCCGCT CGCTGCGGGG
AGAGAGCTAC TCGCCGCGCT GATCGCCCCG TTCGGGCCGG AGGTCGAGCA TCACGTCCAG
ACCAATGCGG CGCTGATCGA TGACGCCTGG TGTCGGTTCT TCGCGGAGCA CCAGATCCGG
GTGAGTGTCA GCGTGGACGG GCCGCGGGAG CACAACGGGG GCCGGGTCAC CCGAGGCGGA
CGTCCCGCGT ATGACCGGAT CGTGCAGGGA GTCGCGGCGT TGCGGCGGCA CGGCCTACCG
TTTTCGGCGC TGGCTGTGGT GGGGCACCCC AAGCCAGGTC TCGCCCGTGA ACTCTATGAC
TTCTTCCTCG ACCTCGGCCC GGACGTGCTG GGTGTGAACA TCGAGGAGAC CGAGGGAGTC
AACACCCGGG CCAACCGTCA CGACGCGGCC GCGGTGACCG CCTTCTGGGC GGAGCTGGTG
GCGGCCTGGC GCCGGAATCC CCGCATCCAT CTGCGTGAGG TCGAGTGGTC CCTGCGGTAC
GCCGCCGCGG CGCTGGACGG TGTCGAGGGT GAGGTGCTGC CCCACCAGCT GGATCCGATC
CCCACGGTCG GTCACGACGG TTCGGTGACC GTGCTCTCGC CCGAGCTGGC CGGCTTCACG
AACCCCCGCT ACGGCGACTT CAGTAGCGGC AACGTGCTGG TCACCCCGTT GGCGGAGATT
CTGGCCGAGG CCACACAGAC ACCCTGGGTG GGGGAGTTTC TCACCGGGGT GGAGGCATGC
CGGTCGTCAT GTCCCTACTT CGGCTTCTGC GGCGGCGGCC ACGCGGCCAA TCGCTACTTC
GAGCAGGGAC GGTTTGACGG CACCGAGACC GAGCACTGCC GCAACAGCAA GATCCGCCTA
CTGGAGGGAG TGTTGGAGCA TGCCCGAGGA CACCGGTCAC CGGCAGTCTG A
 
Protein sequence
MAVRGQSSRM RGLAAVPAYV VMQPTTLCNL DCVYCYLPLR AADRRMPVAV AEAVAASVNP 
WARAGRFSVV WHGGEPLAAG RELLAALIAP FGPEVEHHVQ TNAALIDDAW CRFFAEHQIR
VSVSVDGPRE HNGGRVTRGG RPAYDRIVQG VAALRRHGLP FSALAVVGHP KPGLARELYD
FFLDLGPDVL GVNIEETEGV NTRANRHDAA AVTAFWAELV AAWRRNPRIH LREVEWSLRY
AAAALDGVEG EVLPHQLDPI PTVGHDGSVT VLSPELAGFT NPRYGDFSSG NVLVTPLAEI
LAEATQTPWV GEFLTGVEAC RSSCPYFGFC GGGHAANRYF EQGRFDGTET EHCRNSKIRL
LEGVLEHARG HRSPAV