Gene Sare_4506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4506 
Symbol 
ID5707027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5093468 
End bp5094973 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content73% 
IMG OID641273920 
ProductNLP/P60 protein 
Protein accessionYP_001539269 
Protein GI159040016 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.918319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0864721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACA GCGAGTACGG ACGACGACCT CGACGAAGCC CGGTGATCTC GCCGGTGCTC 
CGCCCGAAGC TCTGGTCCGC GTTGCTGGGT GCCATCGCCG CCGCCGCGCT CAGCGCGCCG
GCCCACGCCG ATCCGTCGCT ACCCAGCACC GTGCCCGACA CCGGCGCCCG CCCGATCGTG
TCCGGCCCGC TGAGCCTGCC CGGCGGCGCA TCACTCACCC CGTCCTCCCC GCCGCCGGTC
ACCAACCTCG TCAACGGGCC GCTCGCCGCC AAGATCTACG CGGCCGAGGC GCGCGTCGGC
CAGCTGAGCG ACGAACTACT CCTACTCAAG CAGCAACGCA CCGAAGCGCA GACGCAGCTC
ACGGCCGCCG AGCAGGATCT GAACCGGGCC CAGGCAGTGC TGGCCAGAGC GCAGGAACGG
GCCGATTCCG CGGTCGCCGA CGCGATCAAG GCCGCCGCGG CGTTGCCGCC CGCGCCGTTC
GCCACCGACC TCCAGGACCT GAACGAGATT TCCGGGATCA CCCGGGGAGA GAAGGTCACG
GGCGGAGAGA CCACCGCCGC GGCCCGAGAG TTCAACCGTG CCCGCACCAG CGAACAGGTC
GCGCAACAGG CGATGGCCGC GGCACAGGCC CGGGTGCGCT CCGTCGACAC CGCCTACTCG
ACCACGGAGC AGGCGCTGCG CGGCGAGGAG GCGGCGCTCG CCACCCTCCG GCGGGACAAC
GCCGCACAGT TGCTCGAGCT GGAGCGCCAG CAGGAGGCAG CGGAGCAGGC GCTCGGCGCG
CAGTGGGTCG CCAACGAGAC GGCGAACGGG CTCACCGCCC ACCCCACCGC CCGCAAGGCG
GTCGAGTACG CGCTGGCCCA GCTCGGCGAT CCGTACCTGT GGGCGGCCGA GGGACCGGAC
CGGTTCGACT GCTCCGGCCT GGTCTGGGCC GCCTACCGAT CGGCCGGCTA CCGCGCCCTG
CCCCGGGTCT CCCGCGACCA GTACTACGCG ACCCGGAGTC GCACCGTGGC CCGGACCGGC
CTCCTGCCCG GCGACCTACT CTTCTTCGCC TCCGGCTCGA GTTGGACGAG CATCCACCAC
ATGGGCATGT ACATCGGCCG CGGGCGCATG GTGCACGCCC CGCGCAGTGG CGACGTGGTC
AAGATCTCAA CTGTGACCTG GTCGCGCCTC TACGCCGCGA CGCGGGTGGT CAACGGGGTC
CCCATCCCGA CCACTCCCAC GCCCACGCCC ACCGTGTCGG CCACTCCCAC ACCAACACCG
TCGGCCACCC CGAAACCGAC GCCGCCTCCC TCGGCGACGC CGTCCCCCTC GGCCACGGGC
ACCCCGTCGC CCACCACGAC TCCGACCAGC ACACCATCGC CCACCCCGTC CCCCACCAGC
ACTCCCACCA CGACCCCGAC CGGCACACCA CCACCCGCCA CCTCTGCCCC GGCGTCGACC
TCGGCAGCCC CCACCTCACC GGCGCCCACC ACGCCCACGG CGACCGGCGG CTCACCCCTG
CCGTAG
 
Protein sequence
MGDSEYGRRP RRSPVISPVL RPKLWSALLG AIAAAALSAP AHADPSLPST VPDTGARPIV 
SGPLSLPGGA SLTPSSPPPV TNLVNGPLAA KIYAAEARVG QLSDELLLLK QQRTEAQTQL
TAAEQDLNRA QAVLARAQER ADSAVADAIK AAAALPPAPF ATDLQDLNEI SGITRGEKVT
GGETTAAARE FNRARTSEQV AQQAMAAAQA RVRSVDTAYS TTEQALRGEE AALATLRRDN
AAQLLELERQ QEAAEQALGA QWVANETANG LTAHPTARKA VEYALAQLGD PYLWAAEGPD
RFDCSGLVWA AYRSAGYRAL PRVSRDQYYA TRSRTVARTG LLPGDLLFFA SGSSWTSIHH
MGMYIGRGRM VHAPRSGDVV KISTVTWSRL YAATRVVNGV PIPTTPTPTP TVSATPTPTP
SATPKPTPPP SATPSPSATG TPSPTTTPTS TPSPTPSPTS TPTTTPTGTP PPATSAPAST
SAAPTSPAPT TPTATGGSPL P