Gene Sare_4672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4672 
Symbol 
ID5704851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5292116 
End bp5293492 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content71% 
IMG OID641274070 
ProductSCP-like extracellular 
Protein accessionYP_001539416 
Protein GI159040163 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.988125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.181044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATGGCT GGAGCGAATC GATGGAGCCG GATGACAACC GCCGGCGATC CGAACCAACG 
GCCGACGGCT GGGACCGGCC CGCGGACGGC TGGGACGGCC CTACCGACGG CTGGGATCGG
CCTGCCGACG GCTGGGACCG GCGGCAGCAG AACGCGCACT GGCGGGCCGA GTCCGAGACC
CCGCAGGGCT GGGGAACCGC GGGCGGCTCG TACCACGACC AGCCCTCGCA AAGTGGGGAG
TCCGTCCCCA CCGGGCAGCA GCGTGGGTCC ACCGACGGCT GGCAGCAGGA ACTATCGGGC
GGCTGGCGCC GCCGGCAGCC CACGGACTGG CGGAACGAGT CCTCCGGAGG CTGGCGTCAC
GCGGCTGGTG CCACCGGCGG ACCCGAGGTC ACCGGCGGTT GGCGGCCCGA CGAGACGACC
GCCAGGCCGA CCAGCGGGCA CGCAGACGAC CTGACGAGCA CGCGGCCGAT CGCTGGCGCC
GCGCCGGCCG ACGCTACACC CGCCAGCAAG GGCCATCGGG CCGGCCACCG GAACCGGCGG
CCCGTGCTCA TCGGGGCCGC GGCGGCTGCC ACGCTGGTCG TGAGCCTCGG AGTCGGCACC
GCCGCACTCT CCGACGGTGG CGACACGATC CCCACCTCGG CCCACAAGGA CATGGCGGCC
ACGAGTCCGA CGTCGTACGA GAGGGGCATG ACGTCGTCCT CGACCAACAC CGCGTGGGGC
AACGGGTCAC GGTCGCGGTC AGATTCCGCG CACCGAAAGT CAGCCTCGAA GGCGTCGCGC
CACTCCGCCA AGTCCCGCTC AGACCAGGAA CGCAAGAGGG CCAGGCACCG CCCCAAGCCG
GCACACACCA CCACCGCGCC CAGTCCCACT CAGGTCCCCA CGACCGCTCC TAGTCCCACT
CAGGTCCCCA CGACCGCACC CAGTCCCACT CAGGTCCCCA CGACCGCACC CAGCCCCACT
CAGGTCCCCA CCACCGTGCC CCCCAACGGT GACGTGAGCA CCGAGGCCAG CGAAGTGGTC
CGGCTGGTTA ACGCCGAACG TGCGAAGGCC GGCTGCGAGG CGCTGAGTAT CAACGAGAAG
CTGATGACCG CTGCCCAGCA ACACAGCCAG GACCAGGCCG ACCACCAGAA GATGTCACAC
ACGGGCAGCA ACGGCAGTAG CCCCGGCGAC CGGATCAACG CTGTCGGCTA CGAGTGGCGC
GCCTACGGGG AGAACGTCGC CTGGAACCAG CAGTCACCCG AGGCCGTGAT GGACGCGTGG
ATGAATAGCT CCGGCCACCG GGCGAACATC CTGAACTGCT CCTTCACCGA GATCGGAGTC
GGTGTCGCGA GCAGCAACGG ACCGTACTGG ACACAGGTCT TCGCCGCGCC TCGCTGA
 
Protein sequence
MYGWSESMEP DDNRRRSEPT ADGWDRPADG WDGPTDGWDR PADGWDRRQQ NAHWRAESET 
PQGWGTAGGS YHDQPSQSGE SVPTGQQRGS TDGWQQELSG GWRRRQPTDW RNESSGGWRH
AAGATGGPEV TGGWRPDETT ARPTSGHADD LTSTRPIAGA APADATPASK GHRAGHRNRR
PVLIGAAAAA TLVVSLGVGT AALSDGGDTI PTSAHKDMAA TSPTSYERGM TSSSTNTAWG
NGSRSRSDSA HRKSASKASR HSAKSRSDQE RKRARHRPKP AHTTTAPSPT QVPTTAPSPT
QVPTTAPSPT QVPTTAPSPT QVPTTVPPNG DVSTEASEVV RLVNAERAKA GCEALSINEK
LMTAAQQHSQ DQADHQKMSH TGSNGSSPGD RINAVGYEWR AYGENVAWNQ QSPEAVMDAW
MNSSGHRANI LNCSFTEIGV GVASSNGPYW TQVFAAPR