Gene Sare_3554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3554 
Symbol 
ID5705047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4101064 
End bp4103250 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content71% 
IMG OID641272981 
Productprotein of unknown function DUF893 YccS/YhfK 
Protein accessionYP_001538347 
Protein GI159039094 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0186862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0461677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGTCGC GACGCGGTCG TGCCCTGGAC TGGCTCCAGC AGCGGGATCC GGACTACTGT 
GTCACCCGCC GTGCGACACG GTTGACCCTC GTCGCCTGCC TCCTCTTCTA TGGCTGTCGT
TACGGCCTGG GCAGCACGCT CATGGCGACG TACGCGCTGT TCGGCACCAT CGCCGCCGGC
GTGTTCGCCC AGGTACCGGG AGGGCCGGCC CAACGGGCAC GGACGCTGCT CGCCGCCCTG
CCGGTCATCT GGATTCTCAT CGCCGCCGGG ACTGTGCTGG CCGCGAGTAC CTGGGCCGCC
ACGATCGGCA TGCTCGTCAT CGGGTTCGCC GTCGCGTTCG CCGGTGTCGG CGGACCTCGC
CTGATCGGCC TTGCCAGCGC ATTTCAGATC TCCTACATCC TGGCCAGCTT TCCCCCGTAC
CAACCCGACA CGTTGCCGCA ACGGCTGGCC GGGATAACGC TCGCCATCGT TTCGGTGGCC
GTGGCCGAGG TGACGCTCTG GCCCGGCCCC GCACCGGTCA CCACCGCGCA ACGCCTGGGC
TGGGCCGCGC GGAGCGTCGC CTCGTTCGTC TGCGCGCTCG CGGACATGCT GGCCGGGCAG
CCAGCCGCTG CTGCGGAGGC ACGGCGACGC CAGCAGAGCG CGGCCGACGC CGTCACGCAG
ACCCAGATGT GGCGTCTGCC GCCGACCGAG CGGGCGACGT CGGCGAGCCG GCGGGACCAC
GCGCTGCGGG ACGCGGCCAC CGACCTGCAC CAGACGTTCC GGCACGCCGA GTGGCTGCTG
CGGAGTGCCG GAACGACCAC CGACGTGGAC GCGGCAAGGT CGCTGCGGCG ATGCGCGTCA
TCTCTGGGCC GCTCCGGCGA CGCCCTGTTG GGAACGGAGC CGGCGTCCCT CGACGCGGTC
GTGACCATGG AGCACCAGCA GCCGGACTCC TGGCGATCAG CCACGACCAG TCAGCTACGT
GTGGCCGCGA TCACCCGGTC GCTGGCACAG CACACGGACT TCGTCGCGAC TGCCGTCAGG
ATCGCGGGGG GCCACACCGA CGCCACCCAC CAACCGGCCG GACCGCACAG TTTCTGGTAT
CTCCGGCGAC GTGCCCCGTC GTTGTACTGG CAGCGGCTAC GGGTGCACCT CACCCCGCGC
TCCGTGTTCT TCCAGCGGGC CCTTCGACTC GCCGTCGCTC TGGCCGCCGC CCGGCTGGTC
GCCGGTGCGC TGGATCTGGA GCACGGCTTC TGGGTGCTCC TGGCCACCCT CACCCTCCTG
CGGACCTCCG CCGCAGACAC GTGGGCCTCG TTCCGCCTGG CCCTGGCCGG CACACTCGTC
GGCGCCGCCG CCGGCGCACT GTTACTGCTG GTCTCTCCCC AGCACGGCAC GTACGCGGCG
ATCCTGCCGC TGACGATGCT GTTGGCCCTG GGTGTGGGTC CGTTGCTCGG GCTCGCGTGG
GGGCAGGCGT TTCTGACGCT ACTGCTGATC GTCGCCTTCG CTCAGCTCAC CCCGACCGAC
TGGCAACTGG CCGGTGTCCG CCTGCTGGAC GTCCTGACCG GCGCGGCGAT CGGCGTCGTC
ACCGGCATCC TCATGTGGCC CAAGGGCGGC GGCGGCGAAC TCCGCCGCTA CACGTCGGCC
TACCTCGGCG CGGGTGCCCA CGCGATCGAG GAAACCGTGA TGAGACTGGC GGGACGGGAC
ACTCGCCAGC ATGCGGTCGA GGCGGCGCAC CGGGCACAGG TCCTCACCGA CGCGTCGTTC
TGCCAGTACC ATCTGGAGCG CCTCGATCCG CGCCCCACAG ACGTGGACTG GGAGGCGGCC
CTGGCCGCGG GGCATCGCAT CGTGCGCGGC GCGGAGCGAC TCGCCACCGA TGACCGGGCC
GGATCGCTGG GGCCACGGTG GCCCGAACCC ACGGCCCACC TCGTGCGCCG GGCCCAGCGG
CTGCGCTCCG ACTACGCCGA CCTGGCCAGC CGTTTGCCGC AGGGTCACCT CCGCGATGAG
GCTCCCATCG CGCAAGCCAC GATGGGTGTC GTCGAGCAGG TCCACGAGAT CATTCAGGGC
GGCGAACGGC GGGCCGACGT CCTGAACCTG GTCGAGGTCG ACCACTGGCT GGCCGACCTC
GGTCGAAACC TCAACCGAAT TCCCGCATCC GCACGGCAGG GCGAGGGGCG TGGTGCGCCT
GGACCGTCGA CGCCGTCGGG TGGCTAA
 
Protein sequence
MWSRRGRALD WLQQRDPDYC VTRRATRLTL VACLLFYGCR YGLGSTLMAT YALFGTIAAG 
VFAQVPGGPA QRARTLLAAL PVIWILIAAG TVLAASTWAA TIGMLVIGFA VAFAGVGGPR
LIGLASAFQI SYILASFPPY QPDTLPQRLA GITLAIVSVA VAEVTLWPGP APVTTAQRLG
WAARSVASFV CALADMLAGQ PAAAAEARRR QQSAADAVTQ TQMWRLPPTE RATSASRRDH
ALRDAATDLH QTFRHAEWLL RSAGTTTDVD AARSLRRCAS SLGRSGDALL GTEPASLDAV
VTMEHQQPDS WRSATTSQLR VAAITRSLAQ HTDFVATAVR IAGGHTDATH QPAGPHSFWY
LRRRAPSLYW QRLRVHLTPR SVFFQRALRL AVALAAARLV AGALDLEHGF WVLLATLTLL
RTSAADTWAS FRLALAGTLV GAAAGALLLL VSPQHGTYAA ILPLTMLLAL GVGPLLGLAW
GQAFLTLLLI VAFAQLTPTD WQLAGVRLLD VLTGAAIGVV TGILMWPKGG GGELRRYTSA
YLGAGAHAIE ETVMRLAGRD TRQHAVEAAH RAQVLTDASF CQYHLERLDP RPTDVDWEAA
LAAGHRIVRG AERLATDDRA GSLGPRWPEP TAHLVRRAQR LRSDYADLAS RLPQGHLRDE
APIAQATMGV VEQVHEIIQG GERRADVLNL VEVDHWLADL GRNLNRIPAS ARQGEGRGAP
GPSTPSGG