Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2283 |
Symbol | |
ID | 5706042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2621371 |
End bp | 2623452 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641271761 |
Product | sulfatase |
Protein accession | YP_001537132 |
Protein GI | 159037879 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00157811 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGCTGATC AGCCCGCCAC GCCCGAACCG GCACCACCCG AGGACGGCGG TACCCAGGTG GGTCGATCGG CCGGACCAGT GGCTCCCGAC GGTGGGTCAC GGCGTGGCTG GCGAGCGGAG GGCGGCCGGC TGCTGGAGAT CACCGCACTG CTCGGGCTCG CGGTCACTCA GCCGTTGCTG GACGTGCTCG GCCGCAGTCC GGACTTCTTC CTGTTCCATC GTGCCGGCCG GGGTGAGATT CTGCAGTTGG TCGCACTGGT GGCGATCGTG CCGACCGTCG CGGTCGGTCT GGTCGCGGCG GCATCGCGGT TGGCCGGCCG CACCGCCCGG AAACTGACCC ACGCACTGCT CGTGGGTCTC CTGCTCACCG CACTGGCCGT GCAGGTCGGT CGACACACGA CACCAGTGCG GGGCCTACCG CTGTTGGTGC TGGCGGTTGT CGTCGCGGCG GCCGGGGTGG CCGCCTACCG GCGTTGGCGC GCCCTGGGGC GGGTGCTGCG GGTCGCGGCG GTCGGGCCGG CGGTCTTCGT CGTGTTGTTC CTGGTTGCCT CCCCGACCTC GACCGTGGTG TTGCCGCGCG GGGACGGTGG TGCCGCCGGG TTGGCCCGCG CCGGCGGTCA CCCACCAGTG GTCCTGCTGG TCCTCGACGA GCTGCCCCTG GTTTCCCTGC TGGCCCCGAA CGGTCGGATC GACGCAGCTC GGTTCCCGCA CTTCGCGGAG CTGGCCGCCG GCTCGACCTG GTACCGCAAC GCGACCGGGG TCAGCGGCTG GACACCGTAC GCGCTGCCGG CAATGCTGAC CGGCCGCTAT CCGGCCACCG GGGCGGCCCC ACACTACTCG CAGCACCCGG ACAACCTGTT CACCGCGTTC GGCGGCCTGT ACGACATTCG TGCCGAGGAG AGCATCACCC GCCTCTGCCC GCCCAGCCGC TGCGACACAC CGCCGGACCG GGAGCAGGGG ATGGGGGTGC TGGTACGGGA GAGCACGAAA CTGCTGGCCC GGCTCTCCGC GCCGGCGGAC AGCCGGGTCG ATCCCGCCGA CTCGTACCGG GAGCGGACCG CCGCCGAGGC GGGCATCGAC GCCGCCGAGC CCATTCCGGA CGATCCGAAG TTCCGCTGGG ACCGGTTGAA CGCCAACCAG CCGGCCCGGT TCAGCAGTTT CCTCGCCGGG CTCCGGCCGT CTGACCGCCC AACGCTGCAC TTCCTGCACC TGCTGATGCC GCATTCGCCG TGGGCGTACC TGCCCTCGGG CGTGCGCTAC GAGGCACCCG AGGACTTCCC GAACGAGGGG GAGGGCTGGG TGGAGTTGGC CCGCCAGCGG CACCTGGCCC AACTCGGGTA CACCGACCGG CTGATCGGCG AAACTCTGCG TACGCTGCGC GCCACCGGAC TGTACGACGA TGCCCTGCTG GCGGTCACCG CCGACCACGG GGTGAGCTTC ACCAAGGGGG CGCAGGGGCG GGGGATGGGC GCCATCGAGG CCGCCGCCGA CGAGGTGGCC TGGGTGCCGC TGTTTGTCAA GTACCCCGGG CAGCGTACCG GCCGGCTCGA CGACCGGAAC TGGCAGCATG TCGACCTGCT GCCCACCCTT GCCGACGAGG CGGCGATCCG GCTGCCCTGG TCGGTCGACG GCCAGTCGGC GCGGGAGGCG CCCCGGGCCG AGGCGGGCAA GGTCTTCTAT GACCGGCCCG CCCAGCCGAC TCCGATCAAC GGTGGGGTTC CCGCCGCGAT ACCGCCCGCC GCGCCGCATC CGCTGGTCGG TACCACCGTG CCGGACCAGC CGGTGGCAGG CTCGGCCCGG GTCGGGAACC TGGCCGCCTT TCGCGAGGTG GACCCGGACC GCGGCTCGCT GCCCGCGTTG GTCTGGGGTG ATCTGCCCGA CGACATCCCC GACGGCACCC CGCTGGCGGT CGCCGTCAAC GACCGGGTCG CCGTTGTGGT GCCGGTGGTT CCCCGGGACG AGGGCGGGCG CCGGTTCGCG GCCCTGATTG CCGACGACCG ACTCTTCCGG TCCGGGGTCA ACCGCCTCGG CCTGTTCCTC GTCTCCGCCG ATGGCACGCT GAACCGGCTC GCGCTCTCCT GA
|
Protein sequence | MADQPATPEP APPEDGGTQV GRSAGPVAPD GGSRRGWRAE GGRLLEITAL LGLAVTQPLL DVLGRSPDFF LFHRAGRGEI LQLVALVAIV PTVAVGLVAA ASRLAGRTAR KLTHALLVGL LLTALAVQVG RHTTPVRGLP LLVLAVVVAA AGVAAYRRWR ALGRVLRVAA VGPAVFVVLF LVASPTSTVV LPRGDGGAAG LARAGGHPPV VLLVLDELPL VSLLAPNGRI DAARFPHFAE LAAGSTWYRN ATGVSGWTPY ALPAMLTGRY PATGAAPHYS QHPDNLFTAF GGLYDIRAEE SITRLCPPSR CDTPPDREQG MGVLVRESTK LLARLSAPAD SRVDPADSYR ERTAAEAGID AAEPIPDDPK FRWDRLNANQ PARFSSFLAG LRPSDRPTLH FLHLLMPHSP WAYLPSGVRY EAPEDFPNEG EGWVELARQR HLAQLGYTDR LIGETLRTLR ATGLYDDALL AVTADHGVSF TKGAQGRGMG AIEAAADEVA WVPLFVKYPG QRTGRLDDRN WQHVDLLPTL ADEAAIRLPW SVDGQSAREA PRAEAGKVFY DRPAQPTPIN GGVPAAIPPA APHPLVGTTV PDQPVAGSAR VGNLAAFREV DPDRGSLPAL VWGDLPDDIP DGTPLAVAVN DRVAVVVPVV PRDEGGRRFA ALIADDRLFR SGVNRLGLFL VSADGTLNRL ALS
|
| |