Gene Caul_2674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2674 
Symbol 
ID5900129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2906332 
End bp2908011 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content66% 
IMG OID641563165 
Productsulfatase 
Protein accessionYP_001684299 
Protein GI167646636 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.687796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT GGACTTGGCT GGCCGGACTG GCGCTGATCG TCGCTGTCGC CCTGGGCTGG 
GCGGCCACGC ACAAGCAGGC CGTCTTCATG TGGATCGCCC ATATGCGCCT GCCGCATGTG
GAGCCCAACC ATGCGGTGGC CTGGTCGGAG GGGCCGGAAG CCGCGCCGAG CGGGCCACGT
CCGCCCAACG TGATCGTCAT CCTGGCGGAC GACATGGGTT TCAACGACAT CACCTTCAAC
GGCGGCGGGG TGGCCGGCGG TCTCGTGCCG ACGCCCAATA TCGACTCCCT GGGCCATGAC
GGGGTCAGCT TCGCCAACGG CTATGACGGC AACGCCACCT GCGCCCCGTC GCGCGCGACG
ATCATGACCG GCCGCTACGC CACGCGCTTC GGCTTCGAGT TCACGCCCGC GCCGGTCGCC
TTCGAGAAGA TGGTCGGCAG TGAAGGCGCG GCGGGCGATA TCGTTCTGCC CCGGTTCTAT
CCCGATCGCC TCAAGGCCAT GCCGCCGGGC TCGACCGCGC CGACGCCCGA CGCGGTGAAT
GAGCTGAGCA TGCCGGCCAG CGAGATCACC GTCGCCCAAC TTCTGAAGAC GCGCGGCTAC
CACACCCTGC ATTTCGGCAA GTGGCATCTG GGCGGAAAGG CCGGATCGCG CCCGGAACAG
AAGGGCTTCG ACGAAAGTCT GGGCTTCATC GCTGGCGGGT CCATGTACCT GCCCGAAGGC
GACCCGGGCG TCGAGAACGC CAAGCAACCT TGGGATCCGA TCGATCGCTT CCTGTGGCCG
AACCTTCCGT ATGCGGTGCA GTTCAACGGC TCGCCGATGT TCAGGCCGGG TGGCTACATG
ACTGACTATC TGACCGATGA GGCCGTCAAG GCGGTCAGGG CCAACCGCAA CCGGCCGTTC
TTCATGTATT TCGCGCCCAA TGCGATCCAC ACGCCGCTCC AGGCCACCAA GGCCGACTAC
GACGCCCTTC CGGAAATCAA GGATCATCGC CTGCGCGTCT ATGGCGCGAT GGTGCGCAAC
CTCGATCGCA ACGTCGGCCG GCTGCTGCAG GCGTTGAAGG AGGAGGGACT CGACCAGAAC
ACGCTGGTGA TCTTCACCAG CGACAACGGC GGCGCCAACT ATATCGGCCT GCCGGACATC
AACCGGCCCT ATCGCGGCTG GAAGGCGACG TTCTTCGAGG GCGGCATCCA TTCGCCGTTC
TTCATGCGCT GGCCGGCCGT GATCCCCGCC AATTCCCGCT ACAGCGCGCC AGTGGGCCAC
ATCGACATCT TCGCCACGGC GGCGGCGGCG GCGGGCGCGC CCTTGCCGAA GGATCGGGTG
ATCGACGGGG TCGACCTGGT TCCCTTCGTC CAGGGAAAGG CGACGGGGCG CCCGCACCAG
ACGCTGTTCT GGCGCTCGGG CAGCTACAAG GTCGTGCTCG ACGGCGACTG GAAGCTGCAG
TCGAGCGAGG CCCAGAACAA GATTTGGCTG TTTAATTTGG CCCAGGACCC GACCGAACAA
CACGAGCTCA GCGCGGCGCA GCCGGAGCGG GTGAAGGCGA TGTTGGCCCT GCTCCGCCAG
CAGGACGCCC AAAACGCCAA GCCAATCTGG CCGTCGCTGC TCCAGGGGCC GATCTTCATC
GACCACCCCT CGGGAGTCCC GCAGAAGAAG GGGCAGGAAT ACATCCTGTG GGACAACTGA
 
Protein sequence
MKLWTWLAGL ALIVAVALGW AATHKQAVFM WIAHMRLPHV EPNHAVAWSE GPEAAPSGPR 
PPNVIVILAD DMGFNDITFN GGGVAGGLVP TPNIDSLGHD GVSFANGYDG NATCAPSRAT
IMTGRYATRF GFEFTPAPVA FEKMVGSEGA AGDIVLPRFY PDRLKAMPPG STAPTPDAVN
ELSMPASEIT VAQLLKTRGY HTLHFGKWHL GGKAGSRPEQ KGFDESLGFI AGGSMYLPEG
DPGVENAKQP WDPIDRFLWP NLPYAVQFNG SPMFRPGGYM TDYLTDEAVK AVRANRNRPF
FMYFAPNAIH TPLQATKADY DALPEIKDHR LRVYGAMVRN LDRNVGRLLQ ALKEEGLDQN
TLVIFTSDNG GANYIGLPDI NRPYRGWKAT FFEGGIHSPF FMRWPAVIPA NSRYSAPVGH
IDIFATAAAA AGAPLPKDRV IDGVDLVPFV QGKATGRPHQ TLFWRSGSYK VVLDGDWKLQ
SSEAQNKIWL FNLAQDPTEQ HELSAAQPER VKAMLALLRQ QDAQNAKPIW PSLLQGPIFI
DHPSGVPQKK GQEYILWDN