Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4506 |
Symbol | |
ID | 5707027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5093468 |
End bp | 5094973 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273920 |
Product | NLP/P60 protein |
Protein accession | YP_001539269 |
Protein GI | 159040016 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.918319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0864721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGACA GCGAGTACGG ACGACGACCT CGACGAAGCC CGGTGATCTC GCCGGTGCTC CGCCCGAAGC TCTGGTCCGC GTTGCTGGGT GCCATCGCCG CCGCCGCGCT CAGCGCGCCG GCCCACGCCG ATCCGTCGCT ACCCAGCACC GTGCCCGACA CCGGCGCCCG CCCGATCGTG TCCGGCCCGC TGAGCCTGCC CGGCGGCGCA TCACTCACCC CGTCCTCCCC GCCGCCGGTC ACCAACCTCG TCAACGGGCC GCTCGCCGCC AAGATCTACG CGGCCGAGGC GCGCGTCGGC CAGCTGAGCG ACGAACTACT CCTACTCAAG CAGCAACGCA CCGAAGCGCA GACGCAGCTC ACGGCCGCCG AGCAGGATCT GAACCGGGCC CAGGCAGTGC TGGCCAGAGC GCAGGAACGG GCCGATTCCG CGGTCGCCGA CGCGATCAAG GCCGCCGCGG CGTTGCCGCC CGCGCCGTTC GCCACCGACC TCCAGGACCT GAACGAGATT TCCGGGATCA CCCGGGGAGA GAAGGTCACG GGCGGAGAGA CCACCGCCGC GGCCCGAGAG TTCAACCGTG CCCGCACCAG CGAACAGGTC GCGCAACAGG CGATGGCCGC GGCACAGGCC CGGGTGCGCT CCGTCGACAC CGCCTACTCG ACCACGGAGC AGGCGCTGCG CGGCGAGGAG GCGGCGCTCG CCACCCTCCG GCGGGACAAC GCCGCACAGT TGCTCGAGCT GGAGCGCCAG CAGGAGGCAG CGGAGCAGGC GCTCGGCGCG CAGTGGGTCG CCAACGAGAC GGCGAACGGG CTCACCGCCC ACCCCACCGC CCGCAAGGCG GTCGAGTACG CGCTGGCCCA GCTCGGCGAT CCGTACCTGT GGGCGGCCGA GGGACCGGAC CGGTTCGACT GCTCCGGCCT GGTCTGGGCC GCCTACCGAT CGGCCGGCTA CCGCGCCCTG CCCCGGGTCT CCCGCGACCA GTACTACGCG ACCCGGAGTC GCACCGTGGC CCGGACCGGC CTCCTGCCCG GCGACCTACT CTTCTTCGCC TCCGGCTCGA GTTGGACGAG CATCCACCAC ATGGGCATGT ACATCGGCCG CGGGCGCATG GTGCACGCCC CGCGCAGTGG CGACGTGGTC AAGATCTCAA CTGTGACCTG GTCGCGCCTC TACGCCGCGA CGCGGGTGGT CAACGGGGTC CCCATCCCGA CCACTCCCAC GCCCACGCCC ACCGTGTCGG CCACTCCCAC ACCAACACCG TCGGCCACCC CGAAACCGAC GCCGCCTCCC TCGGCGACGC CGTCCCCCTC GGCCACGGGC ACCCCGTCGC CCACCACGAC TCCGACCAGC ACACCATCGC CCACCCCGTC CCCCACCAGC ACTCCCACCA CGACCCCGAC CGGCACACCA CCACCCGCCA CCTCTGCCCC GGCGTCGACC TCGGCAGCCC CCACCTCACC GGCGCCCACC ACGCCCACGG CGACCGGCGG CTCACCCCTG CCGTAG
|
Protein sequence | MGDSEYGRRP RRSPVISPVL RPKLWSALLG AIAAAALSAP AHADPSLPST VPDTGARPIV SGPLSLPGGA SLTPSSPPPV TNLVNGPLAA KIYAAEARVG QLSDELLLLK QQRTEAQTQL TAAEQDLNRA QAVLARAQER ADSAVADAIK AAAALPPAPF ATDLQDLNEI SGITRGEKVT GGETTAAARE FNRARTSEQV AQQAMAAAQA RVRSVDTAYS TTEQALRGEE AALATLRRDN AAQLLELERQ QEAAEQALGA QWVANETANG LTAHPTARKA VEYALAQLGD PYLWAAEGPD RFDCSGLVWA AYRSAGYRAL PRVSRDQYYA TRSRTVARTG LLPGDLLFFA SGSSWTSIHH MGMYIGRGRM VHAPRSGDVV KISTVTWSRL YAATRVVNGV PIPTTPTPTP TVSATPTPTP SATPKPTPPP SATPSPSATG TPSPTTTPTS TPSPTPSPTS TPTTTPTGTP PPATSAPAST SAAPTSPAPT TPTATGGSPL P
|
| |