Gene Sare_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1669 
Symbol 
ID5703439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1920656 
End bp1922524 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content67% 
IMG OID641271173 
Productsecreted protein 
Protein accessionYP_001536548 
Protein GI159037295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.180028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.687449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CCCACCCACG CCGACACACA CGGCGGAGCC TGGCGCTCGG TCTCGCGCTG 
GCAACGGCCG TCACCGCCAC GGCCCCCAGC ACCGCCACCG CCGGCCCACC CCGGTCCGGC
ACCCCCGACT TCGGCCCCAA CGTGACGATC TTCAGCCCGG ACACGCCCAT CGACGAGATC
CAGGCAACCA TGGACGCACG GCACGCCGCA CAGGTGGACG CCGAGATGGG CACCGACCGA
CACGCATATC TCTTCCTACC CGGCGACTAC GGCAACACCG AACAGCCGTT GCAGGTGAAG
CTCGGCTACT ACACCCAGGT GTCCGGCCTC GGTGCCACCC CCACCGACGT CCGGGTCCAC
GGAAAGATCG AAGTCTACAA CCGCTGCCTC GACGACGGCA CCAGCAACTG CCTCGCACTG
GTCAACTTCT GGCGGACACT GTCCAACCTG GCGCTGACCA TCGACTCCGC CGACCAGGAC
GACTGTCGGT CCTCAGCGAA CTTCTGGGCG GTGTCGCAGG CCGTATCGAT GCGCCGCCTG
GACGTCAGCG GCGGCACCCT GTCGCTGATG GACTACTGCA CCGCCGGCCC GCACTACGCC
AGCGGCGGAT TCATCGCCGA CTCACGACTA CCCGACGTCG TCAACGGCTC ACAACAGCAG
TGGCTGACCC GCAACAGCGA GATCGGCAGC TGGTCCAACG CCGTGTGGAA CCAGGTCTTC
GCGGGTGTCA TCGGTGCGCC GGACGACGCC GGCTTTCCCG ACCCGCCATA CACCACCCTC
GGCACCACGC CACTGAGCCG GGAAAAGCCG TACCTGTTCG TTGACGATCG GGGCCGTTAC
CAGGTGCGAG TGCCGGCTGC TCGCCGTGAC ACCCGGGGCA TCTCCTGGGA TGCGGGGCAC
GCGCCCGGCC GAAGCATCGC GATCCGCGAC TTCTACATCG CCCGTCCCGG TGATTCCGTA
CGTACCATCA ACCAGGAGTT GGCTCGGGGC AAGCATCTCC TGCTCACCCC CGGCCGGTAC
GACATCGCCC AGAGCATCAG GATCCGCCGG CCGGACACGG TCGTCCTCGG CCTGGGACAC
GCCACGCTGA CCGCCGTGGA CGGTGCGATG CCGCTCGACA TCGCCGGCGT TCCCGGTGTC
GTGGTAGCCG GGGTGACGGT CGACGCCGGG CTCCAGGAGT CGCCGGTGCT GCTCCGGGTC
GGCGAACGAC ACGGACGCCA CCACAGCACC CCGCGGAACC CGATCACGCT GTCCGACGTG
TACTTCCGGG TCGGCGGGCC GCACATCGGT CGGACCCACA CCGCGCTCGA AATCAACAGT
GACCACGTGC TGATCGACCA CACCTGGGTG TGGCGAGGCG ACCACGGCGT CGAGGACTTC
ACCGACGGGG TCAAGGGTGA CACCGATCGC TGGCACACCA ACACCGGCCG GTACGGTGCA
ATCATCAACG GCGACCGGGT CACCGCCACC GGTCTGTTCG TCGAGCACTT CCAACGCCAC
AACACGGTGT GGAACGGTGA ACACGGCACC ACGATCCTCT ACCAGAACGA ACTGCCCTAC
GACCCGCCCA CGCAGGCCGA CTGGATGAAG GGCGACGTCG AGGGCTGGGC CGGCTACAAG
GTCGGCGACC GGGTACGGCA CCACACGCTG TACGGCGGCG GGGTGTACGT GTACAACCGG
AACAACCCGT CGATTCATAC TGAGAACGGC TTCGAGGTGC CGGACCGCCC CGGGGTACGG
CTTCATCACG TGATGACCGT GAACCTGAAC GCCGGCACGA TCGACCACGT GGTCAACGGG
ATCGGTGCGG CGGCCGACAC CACGCGCGTC GGTGCGCCGG TCTACCTCAC CGAGTATCCG
ATCGATTGA
 
Protein sequence
MTTSHPRRHT RRSLALGLAL ATAVTATAPS TATAGPPRSG TPDFGPNVTI FSPDTPIDEI 
QATMDARHAA QVDAEMGTDR HAYLFLPGDY GNTEQPLQVK LGYYTQVSGL GATPTDVRVH
GKIEVYNRCL DDGTSNCLAL VNFWRTLSNL ALTIDSADQD DCRSSANFWA VSQAVSMRRL
DVSGGTLSLM DYCTAGPHYA SGGFIADSRL PDVVNGSQQQ WLTRNSEIGS WSNAVWNQVF
AGVIGAPDDA GFPDPPYTTL GTTPLSREKP YLFVDDRGRY QVRVPAARRD TRGISWDAGH
APGRSIAIRD FYIARPGDSV RTINQELARG KHLLLTPGRY DIAQSIRIRR PDTVVLGLGH
ATLTAVDGAM PLDIAGVPGV VVAGVTVDAG LQESPVLLRV GERHGRHHST PRNPITLSDV
YFRVGGPHIG RTHTALEINS DHVLIDHTWV WRGDHGVEDF TDGVKGDTDR WHTNTGRYGA
IINGDRVTAT GLFVEHFQRH NTVWNGEHGT TILYQNELPY DPPTQADWMK GDVEGWAGYK
VGDRVRHHTL YGGGVYVYNR NNPSIHTENG FEVPDRPGVR LHHVMTVNLN AGTIDHVVNG
IGAAADTTRV GAPVYLTEYP ID