Gene Sare_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0056 
Symbol 
ID5707257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp65444 
End bp67114 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content72% 
IMG OID641269582 
Productsortase family protein 
Protein accessionYP_001534983 
Protein GI159035730 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.761592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGGG CGCCAGACGG CTGGCAACAC GATCGGGGCG ACGACCCGAC GGCCTTCATC 
CCGAAGGTGG AGCACCCCTG GCAACAGCCC ACGCACGACC CGGAGCAGCC GTCAGCTCAG
CTGGATCCGC CACCCCAGTC AGGCTCGCCG AGCCACCCAC CGCGGTCAAC CGCCGCATCA
GCGCGCCCGG CCTCCCCGCC AGGCGCACCG CCACCACGCT CGGCGCACCA GCCGACCGGG
CACCGACCCG ACCTGCCGCC ACCGTCGGGA CCACCGCCAC CGCCGGGTCC GCCAGCACCA
CCGGCACACG CGACCACCCC TACCTACACC CACCCGGCCG GCCCGACGTT CGACACTCGG
CGCCCACCTG CCTCCGAAAA ACCGCCGGCA CCCCACCAGC CGCCCGCGGC CCACCGCGAT
CTCGGGCCCG AACCCCCATC CCACACCCCG GGAACGGCCA CCCCGACACC GTCATCACAC
GATCGCCGCC ACCACTCGGG CTCGGAACCG CGGATCGACC CGGCGACGGT CGACCCGGCC
CCGCCCACGT GGGACCACAC CCCACAGACG GCTGACGCTC CGACCGCGCT GATCCCCGCG
ACACCGCTGC GGAGCGCCAC CCCGGGTGGC AGCACGGTCA CCGGCACCAA CCAGGGCACC
GCCGCCCCCA CCGATCCGAC TGGCACCACG ACCGACGCCG CGCCGCCGCC CTCCAGGCCC
ACGGCGACGA CATCGCCGGC CAGCCCGACC GACCCGGCAG CCACCGCACT CATCCCGCGT
GTCGTGAGCC GGACACCCCC CACCCACGAC TCCACCGCCC TGATGGGCGC TGTGCCCCGA
ACGTCCCAGG GCGCGGAGCA CATGGCCGAC GCCACCGCGC CGGAGACGGC CGAGCCGCGC
CGGGGCGAGC GGGTTGTGAA GCTGCGCCCC GAGCAGTCCG CCGATGGTTA CAAGAGCGTC
TACTCCGAGC TGACCCGACC GACGTTCACA TCCCGGCTAC GCATCGGCGT CCGGGCGGCC
GGGGAGATAC TCATCACGTT CGGCATGGTG GTCCTCCTTT TCGCAGGGTA CGAGATCTGG
GGAAAGTCGG TGATCGTCAA CGCCCACCAG GACGACCTCA GCCAGCAGCT CGCCGAGGCG
TGGGGCCCCA CCGGCGACCC GACCGTCACG CCCTCCGCCA CTTCCTCCTC CGCCCCCGTC
CCCGCCCCGC AGACCGTCCA GGGCACCCCG ATCGCCGGGC TCTACATCCC CAGGCTGAAC
AAGAGTTGGA TCGTGGTCGA GGGGATAACC CAGAAGGACC TCCGGTACGC CCCGGGCCAC
TACCCGACAA GCGCCCTCCC CGGCCAGCTC GGCAACTTCT CCATCGCCGG GCACCGGAAC
CGGGCCACCT TCTGGCGACT TGACGAGCTG GACGAGGGCG ATGTGATCGT CGCCGAGGAT
CGAAACGAGT GGCACGTCTA CCGCGTCACC CGAAACCACA TCGTCAGACC GAGCCAGGTG
GAGGTGGTGG CGCCGGTTCC CGGCGAGCCG GACGCGAAAC CGACCACCGC AATGCTCACC
CTGACCACCT GCCACCCCAA GTTCGACAAC TACGAACGCC TGATCATCCA CGCGAAGCTG
GACCGGACAC AGGCCAAGTC GGCGGGTCGT CCAGCGGAAC TGGGGGGCTG A
 
Protein sequence
MTRAPDGWQH DRGDDPTAFI PKVEHPWQQP THDPEQPSAQ LDPPPQSGSP SHPPRSTAAS 
ARPASPPGAP PPRSAHQPTG HRPDLPPPSG PPPPPGPPAP PAHATTPTYT HPAGPTFDTR
RPPASEKPPA PHQPPAAHRD LGPEPPSHTP GTATPTPSSH DRRHHSGSEP RIDPATVDPA
PPTWDHTPQT ADAPTALIPA TPLRSATPGG STVTGTNQGT AAPTDPTGTT TDAAPPPSRP
TATTSPASPT DPAATALIPR VVSRTPPTHD STALMGAVPR TSQGAEHMAD ATAPETAEPR
RGERVVKLRP EQSADGYKSV YSELTRPTFT SRLRIGVRAA GEILITFGMV VLLFAGYEIW
GKSVIVNAHQ DDLSQQLAEA WGPTGDPTVT PSATSSSAPV PAPQTVQGTP IAGLYIPRLN
KSWIVVEGIT QKDLRYAPGH YPTSALPGQL GNFSIAGHRN RATFWRLDEL DEGDVIVAED
RNEWHVYRVT RNHIVRPSQV EVVAPVPGEP DAKPTTAMLT LTTCHPKFDN YERLIIHAKL
DRTQAKSAGR PAELGG