Gene Sare_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0940 
Symbol 
ID5708051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1064028 
End bp1065446 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content72% 
IMG OID641270458 
Producthypothetical protein 
Protein accessionYP_001535846 
Protein GI159036593 
COG category 
COG ID 
TIGRFAM ID[TIGR02958] secretion protein snm4 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.462432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGTA TCGACGGCGG TATGGTTGCC GTTGCGCCGC CAGCCGCGAC AGCGGGGGAC 
GTGTGCCGGA TCACCATCGA CACCGCGGCG GGACGCGTTG ACCTCGCGGT GCCGGTCGAC
ACACCGATGG CCGAGGTACT CCCGCTGGTG GTCCGCCACG TTGACCCTGC ACTGACCGGT
AGCGGCGCCA CCGCCACGTA CGTGTTGCAA CGCCTCGGCG AGAGGGCCCT CGACGAGGAC
CGTACCCCAG CCGCGTTGGG GCTACGGGAC GGCGACGTGC TGTACCTGCG GCCCAGCGAG
GATCCGCTGC CGCCCATTGA TGTGGACGAC GTCGTGGATG GTGTCGGGGC GGTGATTCGA
GACCGGCCGG ACGCCTGGCA TCCGGGGTAC ACCCGCCGAC TCCTGCTGGG CCTGACCGTG
ATGGTGCTGG CGGGGATGCT CATCGGCCTG CTGCTTCCTG CTCCGGCGGG GTGGCGCGCC
GGCGCCGCGG CAGGCGTCGC CCTGTTCCTC GTCGCCGCGA GTGGCACCAG TTCCCGCGCA
CTCGGCGATG GGGGCATCGG CGTCCTGCTC GGGATTGCGG CGGTGCCGTT CGCCGGACTG
GCCGGAGCAT CGGTGCCTTT CGCGGCCGGC GCCGACGCCT GGAACGGCAC CCAGCTGATG
GCCGCCGGGG CGGCGGCCAC CGCGACCGCG ACAGTGGTGG CGCTCGCGGT GGCGGTTGCG
CGGCCACTGT TCGTCGGGAT GGCGGTTGCC GCTGGATACG CCGTGCTCGC AGGTGTCCTC
ATCGTCGCGA TGAGGACGAG TGGGGTCGGT GCGGCGGTGA TCGTGGCCAG CCTCGCCTAC
TTCACCGGCG TGGCCAGCCC TACCGTGGCT GTCCGAGTTG CCCGCCTGCG CCCGCCGCGG
CTTCCGACCA CAGCCGAGGA ACTCCAACAG GACATCGACC CAATCCCCGA GGACCTGGTG
CGGTCCCGTA CGGTCGTCGC CGACCGGTAC CTGTCGGCGT TGTTCGCCGC CGCTGGTGCC
GCCGTGGTGG CGGCCCTGGT GGCGCTTTCC ACCGACGCGG GGTGGGCCCC GACATCGTTC
GTCGTCGTGC TCAGCCTGGC GCTGCTGCTG CGCGCGCGGA CACTGGTGAA CGCCTGGCAA
CGGCTCGCCA CCGCCGTGCC CGGCGCGGTC GGTCTGCTCC TGCTCGCGCT GGCGCTGGCA
GCCCGTGCGG ACGCCTCAGC CCGCAGCGCA CTGCTGACGG TGGGAGCGGT CTGTGTCGGC
GCCATCGTCG CCGTGGTGCA CCACCTGCCG CCACACCGCT CGTCGCCGTG GTGGGGCCGG
TCGGCCGACG TTCTGGAGAC ACTGGCTGCG ATCGCCATAG CTCCGTTGGC ACTTGCCGTG
CTGGGCGTCT ACGCCCGGGT ACGCGGACTG GGTGGCTGA
 
Protein sequence
MTGIDGGMVA VAPPAATAGD VCRITIDTAA GRVDLAVPVD TPMAEVLPLV VRHVDPALTG 
SGATATYVLQ RLGERALDED RTPAALGLRD GDVLYLRPSE DPLPPIDVDD VVDGVGAVIR
DRPDAWHPGY TRRLLLGLTV MVLAGMLIGL LLPAPAGWRA GAAAGVALFL VAASGTSSRA
LGDGGIGVLL GIAAVPFAGL AGASVPFAAG ADAWNGTQLM AAGAAATATA TVVALAVAVA
RPLFVGMAVA AGYAVLAGVL IVAMRTSGVG AAVIVASLAY FTGVASPTVA VRVARLRPPR
LPTTAEELQQ DIDPIPEDLV RSRTVVADRY LSALFAAAGA AVVAALVALS TDAGWAPTSF
VVVLSLALLL RARTLVNAWQ RLATAVPGAV GLLLLALALA ARADASARSA LLTVGAVCVG
AIVAVVHHLP PHRSSPWWGR SADVLETLAA IAIAPLALAV LGVYARVRGL GG