Gene Csal_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0014 
Symbol 
ID4027333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp17225 
End bp19072 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content65% 
IMG OID637965166 
Productsulfatase 
Protein accessionYP_572078 
Protein GI92112150 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATA CTCTGAGGCG ACGCTGGCGC GGCACGCTGG CGTTCGCGCT GTTGCAGTTG 
CCCCTCATCT GGCTGGTGGC CCTGCGCTAC ACGCCTTATC TCGCCGTGCC CGACGACCCC
ATGGGCGTGG CTTACCTGGT CCTGACCTGG ATCGGCCACT TCGGCATGCT GGCATTGCTC
GGCTGGCTGC CACTGGGCGT ACTGGCCCTG CTGCTGAAGG CGCGCTGGCT ATGGCTGCCG
GCGGCGCTGC TGGGCGCGCT CGGCCTGTGT GCGCTGTTGC TGGATACCGT GGTCTATGCC
CAGTACCGCT TCCATGTGAA CTACTTCATG GTCTCGCTGT TCTTGAACGA CGAGAATGGC
GAGATCTTCA GCTTCACGAC CTCGACCTGG CTGGTGGTGA TCGGCTGCGT GCTGCTGGCC
CTGACCCTGG AAGGCTGGCT CGCCCAGCGC CTGATCGCGG GGGGACGAGG CCGTCGCCTG
CCGGTCGGCT CGGCGTGCGG CGTGGTGCTG CTGGCCCTGC TGGGCAGCCA CGCGCTGCAT
ATCGTCGCCG ACGCCCGCTA CATGCGCAGC GTGACCCAGC AGGTCGGCGT CTATCCGCTG
CTGTTTCCCA CCACTGCCAA GGACTTCATG GAGGAACACG GCTGGCTCGA TCCCCGCGCC
GCCCGGGCCG CGAACGCCGA TATCGAGGCC CGGCAGGCAC AGAATCTCGA TTGGCCCAAG
AACCCGCTGA GCTGTCAGGC CATGCAGCCG CCCAATGTGC TGGTGGTGCT GATCGACTCC
TGGCGCGCCG ACGAATACGG ACCGAAGAAC ACGCCCAACC TGCATGCCGC ACTGAACGAG
AGCGGCCGGC GCTATCTGAA TCACTACAGC GGCGGCAACG CGACCCGCAA CGGCACCATG
AGCCTGTTCT ACGGCCTGAC CGGCAACTAC TACGCCTATC TGAACGACTC CCAGACGCCC
CCGCTGCTGC TGACGCAGTT GCAGAAGCAG GATTACGCGC TGGGCATCTT CTCGTCCGCC
AGCCTCGGCA GTGTCGGCTT CGACCGCACG ATCTTCTCGT CGATCGAGTC ACTGCGGATG
GACACCCAGG GAGACTCGCC CGCGGACCGG GACCGGCAGA TGACCGAGGA CTGGATGCAC
TGGCTCGGCC GGCAGGAACG GCAGGACGCT ACCCCGTGGT TCGGCATGCT GTTCTACGAT
GCGCCCCACG GCTATGACGT CCCGGCCGAC GCCGCCCAGC CCTTCCAGCC GTCGGTACAG
AACATGGACT ATCTCGAGCT GGGTCCCGAG ACCGATCCCC TGCCGTACTT CAACCGTCAT
CGCAACGCCG TGCATTACGA CGACGTCCTG CTCGGCAAGA CCATCGACGA CCTGAAAGCC
AAGGGCGAAT GGGACGAGAC CCTGCTGGTG GTCACATCCG ACCATGGCCA GTCATTCGAT
GATTTCGACA AGAACTATTG GGGCCACAAC GGCCACTTCG CCTCGCCGCA GACCCGTGTG
CCGATGCTCG TCAACGGCCC CGGCGTCGAG CCGGGCGAGG TCACGGGCAT GACCAGCCAC
CTCGACGTGG CCCCCATGCT GATGCGCCAC GCCCTGGGGT GCAGCAACCC GCTCTCCGAC
TATGCCATGG GCGAGGACCT GCTGAAGCCC GGCATCGACC ATCCCTGGGT GCAATCCAGC
AGCTACATCG ACTACGGCAT CATCGAGCCG AACCGGATCA CGGTGGTCGA TGGCACCGGT
CAGTGGGAGA TCGTCGACCG CCAGCTCGAT CCGATCGAAG GCGCCGAATT CTCGCCGGCG
GTGTTCGACG CGATGCAGTG GTTCCGCCGC TTCTATCGCC AGGGCTGA
 
Protein sequence
MQDTLRRRWR GTLAFALLQL PLIWLVALRY TPYLAVPDDP MGVAYLVLTW IGHFGMLALL 
GWLPLGVLAL LLKARWLWLP AALLGALGLC ALLLDTVVYA QYRFHVNYFM VSLFLNDENG
EIFSFTTSTW LVVIGCVLLA LTLEGWLAQR LIAGGRGRRL PVGSACGVVL LALLGSHALH
IVADARYMRS VTQQVGVYPL LFPTTAKDFM EEHGWLDPRA ARAANADIEA RQAQNLDWPK
NPLSCQAMQP PNVLVVLIDS WRADEYGPKN TPNLHAALNE SGRRYLNHYS GGNATRNGTM
SLFYGLTGNY YAYLNDSQTP PLLLTQLQKQ DYALGIFSSA SLGSVGFDRT IFSSIESLRM
DTQGDSPADR DRQMTEDWMH WLGRQERQDA TPWFGMLFYD APHGYDVPAD AAQPFQPSVQ
NMDYLELGPE TDPLPYFNRH RNAVHYDDVL LGKTIDDLKA KGEWDETLLV VTSDHGQSFD
DFDKNYWGHN GHFASPQTRV PMLVNGPGVE PGEVTGMTSH LDVAPMLMRH ALGCSNPLSD
YAMGEDLLKP GIDHPWVQSS SYIDYGIIEP NRITVVDGTG QWEIVDRQLD PIEGAEFSPA
VFDAMQWFRR FYRQG