Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0056 |
Symbol | |
ID | 5707257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 65444 |
End bp | 67114 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641269582 |
Product | sortase family protein |
Protein accession | YP_001534983 |
Protein GI | 159035730 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.761592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGGG CGCCAGACGG CTGGCAACAC GATCGGGGCG ACGACCCGAC GGCCTTCATC CCGAAGGTGG AGCACCCCTG GCAACAGCCC ACGCACGACC CGGAGCAGCC GTCAGCTCAG CTGGATCCGC CACCCCAGTC AGGCTCGCCG AGCCACCCAC CGCGGTCAAC CGCCGCATCA GCGCGCCCGG CCTCCCCGCC AGGCGCACCG CCACCACGCT CGGCGCACCA GCCGACCGGG CACCGACCCG ACCTGCCGCC ACCGTCGGGA CCACCGCCAC CGCCGGGTCC GCCAGCACCA CCGGCACACG CGACCACCCC TACCTACACC CACCCGGCCG GCCCGACGTT CGACACTCGG CGCCCACCTG CCTCCGAAAA ACCGCCGGCA CCCCACCAGC CGCCCGCGGC CCACCGCGAT CTCGGGCCCG AACCCCCATC CCACACCCCG GGAACGGCCA CCCCGACACC GTCATCACAC GATCGCCGCC ACCACTCGGG CTCGGAACCG CGGATCGACC CGGCGACGGT CGACCCGGCC CCGCCCACGT GGGACCACAC CCCACAGACG GCTGACGCTC CGACCGCGCT GATCCCCGCG ACACCGCTGC GGAGCGCCAC CCCGGGTGGC AGCACGGTCA CCGGCACCAA CCAGGGCACC GCCGCCCCCA CCGATCCGAC TGGCACCACG ACCGACGCCG CGCCGCCGCC CTCCAGGCCC ACGGCGACGA CATCGCCGGC CAGCCCGACC GACCCGGCAG CCACCGCACT CATCCCGCGT GTCGTGAGCC GGACACCCCC CACCCACGAC TCCACCGCCC TGATGGGCGC TGTGCCCCGA ACGTCCCAGG GCGCGGAGCA CATGGCCGAC GCCACCGCGC CGGAGACGGC CGAGCCGCGC CGGGGCGAGC GGGTTGTGAA GCTGCGCCCC GAGCAGTCCG CCGATGGTTA CAAGAGCGTC TACTCCGAGC TGACCCGACC GACGTTCACA TCCCGGCTAC GCATCGGCGT CCGGGCGGCC GGGGAGATAC TCATCACGTT CGGCATGGTG GTCCTCCTTT TCGCAGGGTA CGAGATCTGG GGAAAGTCGG TGATCGTCAA CGCCCACCAG GACGACCTCA GCCAGCAGCT CGCCGAGGCG TGGGGCCCCA CCGGCGACCC GACCGTCACG CCCTCCGCCA CTTCCTCCTC CGCCCCCGTC CCCGCCCCGC AGACCGTCCA GGGCACCCCG ATCGCCGGGC TCTACATCCC CAGGCTGAAC AAGAGTTGGA TCGTGGTCGA GGGGATAACC CAGAAGGACC TCCGGTACGC CCCGGGCCAC TACCCGACAA GCGCCCTCCC CGGCCAGCTC GGCAACTTCT CCATCGCCGG GCACCGGAAC CGGGCCACCT TCTGGCGACT TGACGAGCTG GACGAGGGCG ATGTGATCGT CGCCGAGGAT CGAAACGAGT GGCACGTCTA CCGCGTCACC CGAAACCACA TCGTCAGACC GAGCCAGGTG GAGGTGGTGG CGCCGGTTCC CGGCGAGCCG GACGCGAAAC CGACCACCGC AATGCTCACC CTGACCACCT GCCACCCCAA GTTCGACAAC TACGAACGCC TGATCATCCA CGCGAAGCTG GACCGGACAC AGGCCAAGTC GGCGGGTCGT CCAGCGGAAC TGGGGGGCTG A
|
Protein sequence | MTRAPDGWQH DRGDDPTAFI PKVEHPWQQP THDPEQPSAQ LDPPPQSGSP SHPPRSTAAS ARPASPPGAP PPRSAHQPTG HRPDLPPPSG PPPPPGPPAP PAHATTPTYT HPAGPTFDTR RPPASEKPPA PHQPPAAHRD LGPEPPSHTP GTATPTPSSH DRRHHSGSEP RIDPATVDPA PPTWDHTPQT ADAPTALIPA TPLRSATPGG STVTGTNQGT AAPTDPTGTT TDAAPPPSRP TATTSPASPT DPAATALIPR VVSRTPPTHD STALMGAVPR TSQGAEHMAD ATAPETAEPR RGERVVKLRP EQSADGYKSV YSELTRPTFT SRLRIGVRAA GEILITFGMV VLLFAGYEIW GKSVIVNAHQ DDLSQQLAEA WGPTGDPTVT PSATSSSAPV PAPQTVQGTP IAGLYIPRLN KSWIVVEGIT QKDLRYAPGH YPTSALPGQL GNFSIAGHRN RATFWRLDEL DEGDVIVAED RNEWHVYRVT RNHIVRPSQV EVVAPVPGEP DAKPTTAMLT LTTCHPKFDN YERLIIHAKL DRTQAKSAGR PAELGG
|
| |