Gene Sare_4835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4835 
Symbol 
ID5707740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5479626 
End bp5483132 
Gene Length3507 bp 
Protein Length1168 aa 
Translation table11 
GC content71% 
IMG OID641274231 
Producthypothetical protein 
Protein accessionYP_001539576 
Protein GI159040323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.391306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000518688 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGGAAG AGAAAGACGA GTGGTCGCAT GTCGCCGAGG AGGAGGCGAC GTTCGTCCCG 
CCCACCATCG ACACGAGTGA CCTCCGCGAC TACCACACGT GGGCTCAGGA GGAGCGAGGG
TGGCGCAAGC TACTCGCCGC GGTCAACGGT GGGGCGGCTA TCGCCACAGC TGATGGAGAA
GCGTTGGCCG CGCGCCTGGT CAGTCCTCAG TCGGTGATGG ATGCCGGCGT AGCATTCCAG
CATGCCTACG ACACCCTGAA CTGGCTGGAA CAGTTCGTGC GGGACCAGTC ACAGGCCATC
GCCGGGGAGG ACCGGGCGTG GCAGGGCGAG GCGGCCGATG CCTTCCTGGC AAAGATGACC
TTTTTTGCCG ACTATATTGG CGCGCAGGCG GAGCGGATCG TCGGCGGCGA CGGCACCGGT
GGTGTCAACT CCGTCCCGAA CCAGCTCTAC CAGAGCGCGA ACTACCTCGC CTGGGCGCAG
AGCACCATGA AGTACCTCGA CCGGTCCTGG GCGACAATCG CTGCGCAAAA CGGTGTCGGC
AGCAATGACC ACGACATGGT GGCCATCAGC GGCACCAAGT TCGAGAAGCC GATGACCGAG
CAGATGCTTC AGGTTGTTGA CACGCTCGCG ACCCAGTACG ACTTCACCTA CAGCTCGGTC
ACCCCGCCCG ATGGCGACGG AGAACCGAAG ATCAGCACGC CGGAAGTGAC GCCCCCGGAT
TTGTCGGTGC CGGAAATCTC AACGCCGGAG GTGTCGATGC CCGAGATCTC TACGCCGGAG
GTGTCGATGC CGGAGATATC GACACCGGAC GTGTCAACCC CAGATTTGTC TACGCCGGAC
GTGTCGCCGC CGGGCGAAAT CCCGTCACCC GAGGTGTCGT CGCCGGGCGA ACTTCCGCCA
CCCGGGGTGA CGCCGCCGGG CGAGATCCCG CCACCCGGCA TGACGCCGCC AGGAGACGGC
CCCTCCCTCC TCAACGGCGG TCCAGGCCTG GAGGACCCGA ATCTTGGTGG CATGCCCGGG
CTGGACGACC CGAGCGGTGG TGCCGGTGGG GGCGCCGGTG ACCAGGTGGT GCCGCCCATC
GTCCCGCCTC TTCCCGGGCT AGGTAACGGC AGTGGAGGGC CCGGGGACAC AGGTTCATCG
AATCCACAGG CGCCGAGCGT GCCGGGCATC GACGCTGGCG AGGGTCTCGG CGTGCCACCT
CCGACGGGGG TATTCACCCC GGCCCCCGTC CCGAGACCCG GCAGTGGCAG TGGCGGTGGC
GGTGGCGGTG GCACGCCCAG CATTCCGCCA CCACCCAGCG CCGGTGATCT GGTCGACGGC
GACCAAGACG ACTGGACCGG TGGGGCTGCC GGAGACGTAC CCCTGCCAAA CCCACCAGGT
GACACGACGA TGCCCGGCGC CGACAGCGAC GTCCCGCTAC CGACCGCGCC ACCATTGGGC
GCCGACAGCG ACGTCCCCCT TCCGACCGCG CCACCACTCG GCGCCGGCGG GACGGGCGAC
GCCCCCACCG CGCCGATGGT GCCGGGCATG CCGATGATGC CGGGAGCGGG TGGGGTTCCA
GGGGCCGGCG GTGCCGGCGG ACCGGAGAAG CCCGACGCCA ACGGACTGAT CGAAGGCCCA
CCCGACGACT GGCAACCACC AGCCTTCACC GGCGTGGACA TCCCAGGCGC ACCCGACGGC
ACCTCACCAG GTGGCGCCGG CCTGGACGAC ACGACACCAG GTGCCGGCCT CAACGACACG
ACACCACCCA CAACCAACGA CACCACCACA CCCACCGTGC CAGGCGCACC AATGATGCCC
GGCATGCCAG GACCAGGTGG CACACCAGGC GCCGCCAGCG GCCCCGGCGG ACCCGAGAAA
CCCGACGCCA ACGGACTGAT CGAAGGCCCA CCCGACGACT GGCAACCACC AGCCTTCACC
GGCGTGGACA TCCCAGGCGC ACCCGACGGC ACCTCACCAG GTGGCGCCGG CCTGGACGAC
ACGACACCAG GTGCCGGCCT CAACGACACG ACACCACCCA CAACCAACGA CACCACCACA
CCCACCGTGC CAGGCGCACC AATGATGCCC GGCATGCCAG GACCAGGTGG CACACCAGGC
GCCGCCGGCG GCCCCGGCGG ACCCGAGAAA CCCGACGCCA AGGGACTGAT CGAAGGCGGT
ACGGACGCGT GGGAGCCGTC CGGCGTCGGC GATGTCGGCG CGCTGGGAGC TCCGCACGGC
ACGGCAGCGG GTGGCGCGGG GCTCGACGTC CCGCAGCAGA ACGGAGGCTC CACCAACCCC
CTGTGGGACG TGGCGCCCGC AGGCGGAACG GCCACCAACA CGCCGATTCT GCCAGGCATG
CCCGGCAGCG TGGCCTCCCC CGGTGACACC AACACCCCTG ACCATCCGGG TGCGTCCACG
CTGGTCGCGG CAGACGACAC CAGCTGGCAG CCGTCGACAC CCGACGGTAC GGGGCCGGAT
GCGCCGGACG GCGCGTCCGC CGGCGGCGTC GGCCTGACCC CGTCCACTCC CCCGATCACC
GAGGTGGCCG TGCCGCCGGT GAGCCAGGGT TCCCCCGAGC CACCGATACC GGTCCCCGGC
GTGCTGCCGA GCCCGGGGCG AGCCCCGGCC GGCGATGCCC CGAGGTATCC GGGGCCTACG
CGGGCCACGC CGGTCGACGC GGCGCCAACG CCCCCGGAGG AAGGGCCAGA GTCCCCGACC
CCCTCGGCGA CGGACTCGGG TGACCCGTGC GACGCCACCG ATCCGCGCTT GTCGGCTGAC
CCCTCCGGTG GTGATGTCGC ACGGCCGACC GCGCCGCCGA CAGTCCCGGA ACCCGTCGCG
CCGCCAACGG CCGTGAGTCT GCCGACGGGC TTCCCCGCGG CCGGCGTGCC GTTCCCCGAG
GCTCCGGCCG CGGACGCCCT GGACAGGCCG AGGTCGCAGA AGGACATCTC GCTGGTGGCC
AGCGTTCCGG ACTCGCCCCA CCAGCACCGC AGGGCTGCGG CGCCCCCGGC GGACGCCGAA
CCCGACGGCG GTTCAGCTAC CGCTGACGGA AGTGCCGCCC CCACCGCGGC GACTGCCGAT
GATGCGCCCC CGGGTCTCGT ACCCGAGGCC ACCGCCGATC GGGAACCGAG CACGCCGGTG
GCGGTGGTCG GTGCGTCCGC CGGCACCGGG ACCGCGGCGA CCCGCCACGC CCGTCGGGTT
GAGGCCGCGG GGCCGGATGT CGGGGCGTCG GGGACGGTGG CTGGCGACCG GCCGTATCCC
GGTGTACACC CCGCACGCCG GTCCGAGCCG ACCGATGAAC GGGAACCGAT CGACCGACCC
GACGCCGCCG AACTTCTGCG CGACGAGCAC GACAGTTGGG GCGGGGACAC GCCCGATGCA
CAGCTGCCAC CCGGCGACGA CTACGTACCC ATGGTCCAAC CGGACAGCGG CGACGCGGAC
ACGTCCGAAT GGGACGACCT CGACGACTAC GCGTGGCTGA CCGAGGCGTA CGCCGACGAG
CAGGACGAGA GACTGACAGA TGCCTGA
 
Protein sequence
MPEEKDEWSH VAEEEATFVP PTIDTSDLRD YHTWAQEERG WRKLLAAVNG GAAIATADGE 
ALAARLVSPQ SVMDAGVAFQ HAYDTLNWLE QFVRDQSQAI AGEDRAWQGE AADAFLAKMT
FFADYIGAQA ERIVGGDGTG GVNSVPNQLY QSANYLAWAQ STMKYLDRSW ATIAAQNGVG
SNDHDMVAIS GTKFEKPMTE QMLQVVDTLA TQYDFTYSSV TPPDGDGEPK ISTPEVTPPD
LSVPEISTPE VSMPEISTPE VSMPEISTPD VSTPDLSTPD VSPPGEIPSP EVSSPGELPP
PGVTPPGEIP PPGMTPPGDG PSLLNGGPGL EDPNLGGMPG LDDPSGGAGG GAGDQVVPPI
VPPLPGLGNG SGGPGDTGSS NPQAPSVPGI DAGEGLGVPP PTGVFTPAPV PRPGSGSGGG
GGGGTPSIPP PPSAGDLVDG DQDDWTGGAA GDVPLPNPPG DTTMPGADSD VPLPTAPPLG
ADSDVPLPTA PPLGAGGTGD APTAPMVPGM PMMPGAGGVP GAGGAGGPEK PDANGLIEGP
PDDWQPPAFT GVDIPGAPDG TSPGGAGLDD TTPGAGLNDT TPPTTNDTTT PTVPGAPMMP
GMPGPGGTPG AASGPGGPEK PDANGLIEGP PDDWQPPAFT GVDIPGAPDG TSPGGAGLDD
TTPGAGLNDT TPPTTNDTTT PTVPGAPMMP GMPGPGGTPG AAGGPGGPEK PDAKGLIEGG
TDAWEPSGVG DVGALGAPHG TAAGGAGLDV PQQNGGSTNP LWDVAPAGGT ATNTPILPGM
PGSVASPGDT NTPDHPGAST LVAADDTSWQ PSTPDGTGPD APDGASAGGV GLTPSTPPIT
EVAVPPVSQG SPEPPIPVPG VLPSPGRAPA GDAPRYPGPT RATPVDAAPT PPEEGPESPT
PSATDSGDPC DATDPRLSAD PSGGDVARPT APPTVPEPVA PPTAVSLPTG FPAAGVPFPE
APAADALDRP RSQKDISLVA SVPDSPHQHR RAAAPPADAE PDGGSATADG SAAPTAATAD
DAPPGLVPEA TADREPSTPV AVVGASAGTG TAATRHARRV EAAGPDVGAS GTVAGDRPYP
GVHPARRSEP TDEREPIDRP DAAELLRDEH DSWGGDTPDA QLPPGDDYVP MVQPDSGDAD
TSEWDDLDDY AWLTEAYADE QDERLTDA