Gene Sare_4502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4502 
Symbol 
ID5707023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5089030 
End bp5090916 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content69% 
IMG OID641273916 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_001539265 
Protein GI159040012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.040263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCC GCACCCCCAC CCTTCGCCGC CGCCTCGCGC TCACCATCGC CGCGGGGTTC 
GGCCTGCTCA CCTTCGCCGC GGCACCGGCG GCCGCGAAAC CCGTACCGGG CCCCACCGAT
GAGCAGGCGG TTGTCCAGTA CCGGGTGCTC GGCCCGCGTA CGGTGGCCGA CCGCAGCGCG
GTCGCCCGCA CCGGTGCCTC GATCGACTAC TCCGAGCACG GCGTTCTACA CGTCTCCGCC
ACCAGCACCG AGGCCACCGC GATCACCCGG CTGGGCTTCC AACTGGAGGA GGTCCCGGCG
CAGTCCGCCG ACCACCACGG CCACGCGCAC GCCGACGAAG ATGTCGGCCT GTTCGACTTC
CCGCCCGCCG ACTCCGACTA CCACAACTAC GCCGAGCTGA CAGCGGTGGT GAACCAGGTG
GTGGCGGACC ACCCGTCCAT CGCGCAAAAG ATCAGCATCG GCACCTCGTA CGAGGGTCGG
GACCTGATGG CGGTGAAGAT CTCCGACAAC GTCGGGACCG ACGAGGACGA ACCGGAGATC
CTCTTCAACT CGCAGCAGCA CGCCCGCGAG CACCTGACCG TGGAGATGGC GATCTACCTG
CTCCACCTCT TCACCGACAA CTACGGCAGT GACTCCCGGG TCACCAGCGT CGTCAACAGC
CGGGAACTGT GGATCGTGCC GACCGTCAAC CCAGACGGCA GCGAGTACGA CATCGCCACC
GGCTCGTACC GGTCGTGGCG CAAGAACCGG CAGCCGAACA GCGGTTCGTC GTGGGTCGGT
ACCGACCTGA ACCGCAACTG GGACTACCTG TGGGGCTGCT GCGGCGGTTC CTCCGGCTCC
CCCTCGTCGA ACACCTATCG GGGTCCGTCG GCCTTCTCCG CGCCGGAGAC CGACGCGGTG
CGCGACTTCG TCGACGGTCG CGTCGTGGAC GGAGTCCAGC AGATCAAGGC CAACATCGAC
TTCCACACCT ACTCCGAGCT GGTGCTCTGG CCGTTCGGCT ACACCTACAG CAACACCGCC
CCCGGAATGA CGGCCGACCA GTACAACACC TTCGCCACCA TCGGCCAGCA GATGGCGGCC
ACCAACGGCT ACACCCCGCA GCAGTCCAGC GACCTCTACA TCACCGACGG CAGCAGCATC
GACTGGATGT GGGGACAGCA CGGGATCTGG GCGTACACCT TCGAGCTGTA CCCCGGCTCC
GCCTCGGGCG GTGGCTTCTA CCCGCCCGAC GAGGTCATCC CGGCAGAGAC CGCCCGTAAC
CGCGACGCCG TGCTGCTTCT CTCCGAATAC GCCGACTGCC CGTACCGGGC GATCGACAAG
GAGGAGCAGT ACTGCGGTGA CGGTGGCGGG ACCACGGTCT GGGCGGACAA CTTCGAGACC
GCGACCGGCT GGACGATCGA CCCCAACGGC ACCGACACCG CCACCACCGG CCAGTGGGAG
CGGGGCGCCG CCCAGTCGAC CAGCTACTCC GGCGCCAAGC AGCTCACCCC GTACGCCGGC
AGCAACGACC TGGTCACCGG CCGGCTGGCC GGCTCCTCGG TGGGCTCACA CGACATCGAC
GGCGGCGTGA CCAGCGCCCG GTCCCCGGCG GTGTCCCTGC CGTCGAGCGG CACGCTGACC
CTGTCACTGG CCTGGTACCT GGCGCACTAC TCGAACGCGT CCTCCGCGGA CTACTTCCGC
GTCAGTGTCG TACACAGCGG CGGCACCACC ACCCTGCTCG ACCAGGCCGG CGCGGCAACC
AACCGCAGCG CCTCCTGGTC AGTGGCCAGC CTCGACCTGA CGCCGTACGC CGGCCAGTCG
ATCCAGATCC AGGTCGAAGC GGCAGACGCC GCCGGCGGCA GCCTCGTCGA GGCGGCAGTC
GACAACGTCA CCATCACCGC CTCCTGA
 
Protein sequence
MAFRTPTLRR RLALTIAAGF GLLTFAAAPA AAKPVPGPTD EQAVVQYRVL GPRTVADRSA 
VARTGASIDY SEHGVLHVSA TSTEATAITR LGFQLEEVPA QSADHHGHAH ADEDVGLFDF
PPADSDYHNY AELTAVVNQV VADHPSIAQK ISIGTSYEGR DLMAVKISDN VGTDEDEPEI
LFNSQQHARE HLTVEMAIYL LHLFTDNYGS DSRVTSVVNS RELWIVPTVN PDGSEYDIAT
GSYRSWRKNR QPNSGSSWVG TDLNRNWDYL WGCCGGSSGS PSSNTYRGPS AFSAPETDAV
RDFVDGRVVD GVQQIKANID FHTYSELVLW PFGYTYSNTA PGMTADQYNT FATIGQQMAA
TNGYTPQQSS DLYITDGSSI DWMWGQHGIW AYTFELYPGS ASGGGFYPPD EVIPAETARN
RDAVLLLSEY ADCPYRAIDK EEQYCGDGGG TTVWADNFET ATGWTIDPNG TDTATTGQWE
RGAAQSTSYS GAKQLTPYAG SNDLVTGRLA GSSVGSHDID GGVTSARSPA VSLPSSGTLT
LSLAWYLAHY SNASSADYFR VSVVHSGGTT TLLDQAGAAT NRSASWSVAS LDLTPYAGQS
IQIQVEAADA AGGSLVEAAV DNVTITAS