Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2674 |
Symbol | |
ID | 5900129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2906332 |
End bp | 2908011 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641563165 |
Product | sulfatase |
Protein accession | YP_001684299 |
Protein GI | 167646636 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.687796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCT GGACTTGGCT GGCCGGACTG GCGCTGATCG TCGCTGTCGC CCTGGGCTGG GCGGCCACGC ACAAGCAGGC CGTCTTCATG TGGATCGCCC ATATGCGCCT GCCGCATGTG GAGCCCAACC ATGCGGTGGC CTGGTCGGAG GGGCCGGAAG CCGCGCCGAG CGGGCCACGT CCGCCCAACG TGATCGTCAT CCTGGCGGAC GACATGGGTT TCAACGACAT CACCTTCAAC GGCGGCGGGG TGGCCGGCGG TCTCGTGCCG ACGCCCAATA TCGACTCCCT GGGCCATGAC GGGGTCAGCT TCGCCAACGG CTATGACGGC AACGCCACCT GCGCCCCGTC GCGCGCGACG ATCATGACCG GCCGCTACGC CACGCGCTTC GGCTTCGAGT TCACGCCCGC GCCGGTCGCC TTCGAGAAGA TGGTCGGCAG TGAAGGCGCG GCGGGCGATA TCGTTCTGCC CCGGTTCTAT CCCGATCGCC TCAAGGCCAT GCCGCCGGGC TCGACCGCGC CGACGCCCGA CGCGGTGAAT GAGCTGAGCA TGCCGGCCAG CGAGATCACC GTCGCCCAAC TTCTGAAGAC GCGCGGCTAC CACACCCTGC ATTTCGGCAA GTGGCATCTG GGCGGAAAGG CCGGATCGCG CCCGGAACAG AAGGGCTTCG ACGAAAGTCT GGGCTTCATC GCTGGCGGGT CCATGTACCT GCCCGAAGGC GACCCGGGCG TCGAGAACGC CAAGCAACCT TGGGATCCGA TCGATCGCTT CCTGTGGCCG AACCTTCCGT ATGCGGTGCA GTTCAACGGC TCGCCGATGT TCAGGCCGGG TGGCTACATG ACTGACTATC TGACCGATGA GGCCGTCAAG GCGGTCAGGG CCAACCGCAA CCGGCCGTTC TTCATGTATT TCGCGCCCAA TGCGATCCAC ACGCCGCTCC AGGCCACCAA GGCCGACTAC GACGCCCTTC CGGAAATCAA GGATCATCGC CTGCGCGTCT ATGGCGCGAT GGTGCGCAAC CTCGATCGCA ACGTCGGCCG GCTGCTGCAG GCGTTGAAGG AGGAGGGACT CGACCAGAAC ACGCTGGTGA TCTTCACCAG CGACAACGGC GGCGCCAACT ATATCGGCCT GCCGGACATC AACCGGCCCT ATCGCGGCTG GAAGGCGACG TTCTTCGAGG GCGGCATCCA TTCGCCGTTC TTCATGCGCT GGCCGGCCGT GATCCCCGCC AATTCCCGCT ACAGCGCGCC AGTGGGCCAC ATCGACATCT TCGCCACGGC GGCGGCGGCG GCGGGCGCGC CCTTGCCGAA GGATCGGGTG ATCGACGGGG TCGACCTGGT TCCCTTCGTC CAGGGAAAGG CGACGGGGCG CCCGCACCAG ACGCTGTTCT GGCGCTCGGG CAGCTACAAG GTCGTGCTCG ACGGCGACTG GAAGCTGCAG TCGAGCGAGG CCCAGAACAA GATTTGGCTG TTTAATTTGG CCCAGGACCC GACCGAACAA CACGAGCTCA GCGCGGCGCA GCCGGAGCGG GTGAAGGCGA TGTTGGCCCT GCTCCGCCAG CAGGACGCCC AAAACGCCAA GCCAATCTGG CCGTCGCTGC TCCAGGGGCC GATCTTCATC GACCACCCCT CGGGAGTCCC GCAGAAGAAG GGGCAGGAAT ACATCCTGTG GGACAACTGA
|
Protein sequence | MKLWTWLAGL ALIVAVALGW AATHKQAVFM WIAHMRLPHV EPNHAVAWSE GPEAAPSGPR PPNVIVILAD DMGFNDITFN GGGVAGGLVP TPNIDSLGHD GVSFANGYDG NATCAPSRAT IMTGRYATRF GFEFTPAPVA FEKMVGSEGA AGDIVLPRFY PDRLKAMPPG STAPTPDAVN ELSMPASEIT VAQLLKTRGY HTLHFGKWHL GGKAGSRPEQ KGFDESLGFI AGGSMYLPEG DPGVENAKQP WDPIDRFLWP NLPYAVQFNG SPMFRPGGYM TDYLTDEAVK AVRANRNRPF FMYFAPNAIH TPLQATKADY DALPEIKDHR LRVYGAMVRN LDRNVGRLLQ ALKEEGLDQN TLVIFTSDNG GANYIGLPDI NRPYRGWKAT FFEGGIHSPF FMRWPAVIPA NSRYSAPVGH IDIFATAAAA AGAPLPKDRV IDGVDLVPFV QGKATGRPHQ TLFWRSGSYK VVLDGDWKLQ SSEAQNKIWL FNLAQDPTEQ HELSAAQPER VKAMLALLRQ QDAQNAKPIW PSLLQGPIFI DHPSGVPQKK GQEYILWDN
|
| |