Gene Sare_4963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4963 
Symbol 
ID5706485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5637008 
End bp5638150 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content75% 
IMG OID641274358 
Productpeptidoglycan-binding LysM 
Protein accessionYP_001539700 
Protein GI159040447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.867782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.121696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACAC CGGCTCGTGT CCTCGGACGG GTCCTCACCG GGTTCGGCGC GCTTGCCCTG 
CTCTGCGCGT TGCTGATCAG CGCCCCGATG GCGCTGCTCG CGTTCGCCGG TAACCCCCTG
CCGGCGCAGG TGCCCACCCT CGACGAGGTC GGTGCCATGC TGACCACCCG AGACGACGGT
CAGCTCTTCC TCCGAGCACT GGCCTTGGTC GGCTGGGCGG GCTGGGCCAC GTTCGCCCTG
TCGGTACTGG TCGAGCTGGG TGCTCTCGCC TGCCGGCGCC CCGCACCCCG GTTGCCGGGG
ATGAACCGGC AGCAACGGGC CGCCGCCGCG TTGGTCGGCT CCGTCACGTT GATCTTGGCA
GCCAGTCCCG TGGCGGCGAG CGCGGCGGCG GTGGCGGGGC CGCCGGCACC CGCTGCCACC
TCGGTCAGCG TGGCGCTACC ACCGTCCCCG GTGGAACGCC CAGTGCTGGT CGCGCTGCCC
GAGCCGGTGC GGCAGCAGGC GGTCAGCGCG GCGCCGAGCA CCGCGCGGAC CGCAGAACCG
GAGCGGGAAC CGGTCTACCG GGTGGCCCGG GGCGACCGCC TCGGATCGAT CGCCGCGCGG
TACCTGGACG GTTTCGACGA CTACCCGACC CTGGCCCGGC TGAACCGGTT GGCCGACCCG
GACCGCATCC ATCCAGGTCA GCTCCTGCGG CTACCCACCA GGGCCCAGGA CCGTGGTGCC
GGCCCCCACG CCACCGGGCG GCTGGTCGCG CGCCCGACCC CGCCCCGGCC GTCCGCACCG
GCAGCGGCGC CGGCCGGGCC GTCGACGCGA CCATCGGTTT CGGACACGGC GGTTTCGGAC
GCCTCAGTCT CGGATACATC GACGACGGGC GCGTCGGCGT CGAACCGGAC AGGCCCGGAC
CCGGTGCAGG ACGTGCCGAT CGTGGCCGTG GGCGCGGCTG GGCCGGGCGA CCCGAGCCGG
GTGAATCGGC CGCTCGCGGT GTCGGCGGTC CTCGCCGCGT CGGGCATCGT CGGCGCGCAG
ATCGGTGCGG TGCTCGGCCT GCGGCGGCGT CCGGCGACCG CTCGTGCCGG AACCGACCGT
AAGGCGGCGC CGACCGGCCG AGGGCAGTGG GAACTGCCCG CTGGCCGGCA CCGGCGGGAG
TGA
 
Protein sequence
MLTPARVLGR VLTGFGALAL LCALLISAPM ALLAFAGNPL PAQVPTLDEV GAMLTTRDDG 
QLFLRALALV GWAGWATFAL SVLVELGALA CRRPAPRLPG MNRQQRAAAA LVGSVTLILA
ASPVAASAAA VAGPPAPAAT SVSVALPPSP VERPVLVALP EPVRQQAVSA APSTARTAEP
EREPVYRVAR GDRLGSIAAR YLDGFDDYPT LARLNRLADP DRIHPGQLLR LPTRAQDRGA
GPHATGRLVA RPTPPRPSAP AAAPAGPSTR PSVSDTAVSD ASVSDTSTTG ASASNRTGPD
PVQDVPIVAV GAAGPGDPSR VNRPLAVSAV LAASGIVGAQ IGAVLGLRRR PATARAGTDR
KAAPTGRGQW ELPAGRHRRE