Gene Sare_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3084 
Symbol 
ID5706819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3490450 
End bp3492786 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content64% 
IMG OID641272520 
Producthypothetical protein 
Protein accessionYP_001537888 
Protein GI159038635 
COG category[S] Function unknown 
COG ID[COG3472] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00108127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAGA CGTTTGAGAG TGGCGATGTT CCGCTGGCAG ACCTGCTGAA TCAAGCACGC 
CAGGGTGTGC TGCAGCTTCC TGACTTCCAA CGGGGATGGG TCTGGGACGA CGACCACATC
GTGAGCCTGC TCGCCTCGGT CTCCCGTTCG TTTCCCATCG GTGCGGTGAT GACGCTGGAG
ACCGGCAACG CTGAGGTCCG CTTTCGGCCC CGACCGTTGG AGGGAGTCCC GAGCGGCACA
GCGGCTACCA ACCCGAAGTA CCTGCTGCTC GATGGCCAGC AGCGGACTAC CTCGCTCTAC
CTGGCGCTTC GTTCTGGGGC ACCGGTCCGT ACCCGGGACA CCAGGAAGAA GGAGGTGGAG
CGTTGGTACT TCGCCGACAT CGACGCCTGC ATCGACCCGG ACGCCGACCG GACCGAGGCC
ATCCTGTCGC TCCCGGCTGA CAAGCGGCGG CTGAGCTCCC GAGGCGAGGT GCTGCTCGAC
GCCTCCACGA CCGAGGCGCA GGTGCGAGCC GGCAAGGCAG GTCTCTTCCC GCTCGACCTC
GTCCTCGATC AGAATGCCAC CCTTGACTGG CAATACGCCT ACCTGCAGGC GGGCCCGTCG
ATGGATCAGC AGCTCCAGAT CTGGAAGCAG TTTCAGGAGT CGCTGATCTC GCCCTTCCTG
CACTACAGCG TTCCTCGGAT CGCCCTCGAC CAGAAGACTT CCAAGGAGGC CGTTTGCCAG
GTCTTCGAGA AGGTGAACAC CGGTGGTGTC GAGCTGACGG TCTTCGAACT GCTGACGGCG
ACGTTCGCGG CGGAGAACTT CCGGTTGCGC GACGACTGGG AGCGGCGACA AGCGACCTGG
GCTGATGAGC CGCTGCTCGC CGACCTGGAC GCCACGACCT TCCTGCAGAT CGTGACACTG
CTCTGGACGC GAAGCCGTTG GGAGGAACGT ATGCGGGAGC GTGTTCGTGG TGACCGTGTC
CCGGCGGTCT CCGCGAAACG TCGGGAGATG CTGTCGCTGC CATTGAGCGG CTACCGGGGC
TGGGCGGATG CTGTTACCGA CACGCTGCAG CGGGTGGTGC GTTTCCTGCA CGGCGAGCGG
ATCTTCCGAA GCCGTGATCT CCCGTACAGC ACGCAGTTGG TTCCGTTGAC CGCGATCCTT
ACGCTGCTCG GGGAGGACGC CTTCACGCCA GGGCCCCGCG CGAAGCTTCG TCAGTGGTAC
TGGTCCGGGG TGTTCGGAGA GCTGTACGGC GGCACCACCG ACACCCGTTT CGCCAACGAC
CTGCAAGACG TGCTCGCCTG GATCCTGAAC GACGGCGAGG AACCCCGCAC CGTCCGCGAG
TCTCAGTTCC AGGCCGAGCG GTTGCTGGGC CTCCGGACCC GGAACAGCGC CGCCTACAAG
GGCCTGTACG CGCTCGCCAT GAAACGCGGC GGCCGAGATT TCCGGACCGG CGACACAATC
GACGCCAAGG CCTACGCGGC CGACTCCATC GACATCCGCC ACGTCTTCCC GCAGAAATGG
TGCGCGGCGA ACGGCATCGA CAGCAACTAC GCCAACTGCA TCGTGAACAA GACAGCCATC
GACGGGCAGA CCTGGGGATA CATCAGCAAC AACGCGCCAA GCCAGTACCT GGCCGCGATC
GAAAGTGACC TGCCGGTGAG CTCGCAGGAC CTGGACGCGA TCATCGCCAG TCACGACATC
GACCCCGTCG CACTGCGGCA GGACGACTTC CGTGCCGTCT TCGACGCCCG CTACGAGCGC
TTGATCCGAC AGATTGAGGA CGCGACGGGC CGGCCGGTGA ACCGAGGAGA CAGTCACGGC
AGTCCCTTCG CCACGCATCA GGGCGGAGCC GCGCTGGCCC GTAGCATCCA GGCGCTCATC
AGGGCCGGCG AGAGCAAGAT CGTCGAGTTC AACTCGACCG GGCGGAAGAA CCTTTCCACC
GGCCAGAAAG ACCGCGAGAT CGAGTGGGCG GTCACCAAGA CGATCGCCGG CTTCATGAAC
GGTCACGGCG GGACGCTGCT GGTCGGCGTT GAGGACGACG GAAAGGTCAT CGGTTTGGAA
GAAGACCTGA CGATCTTCAC CAAGAAGAAC ACCGACGCCT GGGAACAGTG GCTCACTCAT
CTGCTCATCC AAGATTTCGG CAAGGCCCCG ACGGCCAACG TGACCGTCAG GTTCGGCACC
ATCGAGGACC GGACGGTTGC CCGGATCGAT GTCGCGCTCA CCTCGGAGCC GGTGTATACA
CTTCGCACGA AGACCGGGAT GAAGGGGGCG GTCTTCCTCG TGCGCCTCAA CCACACGACG
CAGGAGGTCG CTGGGCCCGA GGCTTACGCC TACCAGCACA ACCGATGGTT CAAGTGA
 
Protein sequence
MAKTFESGDV PLADLLNQAR QGVLQLPDFQ RGWVWDDDHI VSLLASVSRS FPIGAVMTLE 
TGNAEVRFRP RPLEGVPSGT AATNPKYLLL DGQQRTTSLY LALRSGAPVR TRDTRKKEVE
RWYFADIDAC IDPDADRTEA ILSLPADKRR LSSRGEVLLD ASTTEAQVRA GKAGLFPLDL
VLDQNATLDW QYAYLQAGPS MDQQLQIWKQ FQESLISPFL HYSVPRIALD QKTSKEAVCQ
VFEKVNTGGV ELTVFELLTA TFAAENFRLR DDWERRQATW ADEPLLADLD ATTFLQIVTL
LWTRSRWEER MRERVRGDRV PAVSAKRREM LSLPLSGYRG WADAVTDTLQ RVVRFLHGER
IFRSRDLPYS TQLVPLTAIL TLLGEDAFTP GPRAKLRQWY WSGVFGELYG GTTDTRFAND
LQDVLAWILN DGEEPRTVRE SQFQAERLLG LRTRNSAAYK GLYALAMKRG GRDFRTGDTI
DAKAYAADSI DIRHVFPQKW CAANGIDSNY ANCIVNKTAI DGQTWGYISN NAPSQYLAAI
ESDLPVSSQD LDAIIASHDI DPVALRQDDF RAVFDARYER LIRQIEDATG RPVNRGDSHG
SPFATHQGGA ALARSIQALI RAGESKIVEF NSTGRKNLST GQKDREIEWA VTKTIAGFMN
GHGGTLLVGV EDDGKVIGLE EDLTIFTKKN TDAWEQWLTH LLIQDFGKAP TANVTVRFGT
IEDRTVARID VALTSEPVYT LRTKTGMKGA VFLVRLNHTT QEVAGPEAYA YQHNRWFK