Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0237 |
Symbol | |
ID | 5897511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 260923 |
End bp | 262947 |
Gene Length | 2025 bp |
Protein Length | 674 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641560721 |
Product | sulfatase |
Protein accession | YP_001681872 |
Protein GI | 167644209 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.871196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG CGAGCCGGCC CAATATCCTG CTGATCACCT GCGACCAGTA TCGCTTTCCC CGGTTCTCCT ACGGGGCGGA CGCGGGGTTT AGCGAGCCGC TGAAGCGCAT CCTGGGTTTC CAGCGCGAGG ACGACGCTCA AAACCCGTAC GCCCCGTACT TTCCCGGCTT GCTGGCCTTG CGCCAGAACG CGGCGCTGTT GCGCAACCAC ACCATCGCCG CCAGCGCCTG CACGCCCAGT CGGGCGGTGA TCTATACCGG CCAGTACGGG ACCAAGACCG GGGTCACCCA GACCGACGGC CTGTTCAAGA GCGGCGACTC CTACAACTTT CCCTGGCTGG CCGCCGACGG TATCCCCACC CTGGGAACCT GGATGCGCGA GGCCGGCTAC TCCACCCACT ATTTCGGCAA ATGGCACGTC AGCAACCCGC CCGAGCACTC GCTGGACCGT TACGGCTTCG ACGACTGGGA GGAATCCTAT CCCGAGCCGC ACGGCGCGGC GATCAACAAC CTGGGCGTCT ATCGCGACGC CGGCTTCACC GACCAGGCCT GCGCCTTCAT CCGCCGCAAG GCCCTGGCCC TGAACTACAA CCGCGCCCAG GCCGTCGAGC AGGCGCGGGA CCCCTACGCC GCCGGGCCGG ACGCCGATAA CATCCCGCCC TGGTTCGCGG TCGCCTCGTT CACCAATCCC CACGACATCG CCACCTATCC GGCGGTGATC GCCCAGGCTC TGCCGACGCC GGACAATTCC GGCACGCAGT CGATCTTCGG TCCGCTGACC GTTCCGTTGC AGGGGCAGAA GACGCCGCCG CCGACCGCCG GCACGATCCA GATCGCGCTC AATGCCCTGG GCTTTCCGCA GGACTGCGCC AAGCCGTCGC CCACCCAGAA CGAGTCCCTG GCCGACAAGC CCAGCTGCCA GCGCGACTAC GCCTACAAGG TGGGCCTGGC CCTGAACGCC AAGACCGGCT TCAACATCGT CAACACCGTC GGGTCCAAGC TGCACGACCA GTTCCCCAAT CTCTCCGAGA CCCCGGACCT GGCGCGGCGG GCGGCGGTCC AGCAGGCGCT GAAGGGGACA ATCCCCTTCC AGCTGAGCGA CGATCCGGAC GGCTACGCCC TGCAGTTCCT GCAGCTCTAT GGCTGGCTGC ACGCCGTGGT CGACACCCAC GTGACGGCCG TGCTGAAGAC GCTGGAAGAG ACGGGCCAGG CCGACAACAC CATCGTCATC TTCCTGGCCG ACCACGGCGA GTACGCGGCG GCCCACGGCA TGATGATCGA GAAGTGGCAC ACGGCCTATC AGGAGGCCCT GCATGTGCCG GTGGTCGTGC GCTTCCCGCC ATCGACGAAG GTGGTCGAGA ACGAACCCGG GACGGGGGAG GGGCCGCTGG GCTTCACGCC GCGCCAGATC GACGCCCTGA CCAGCCATAT CGACATCCTG CCCACCGTGC TGGGCCTGGC TGGGGTGACG CCCGATCAGC GGACGACGAT CGCCGAGCGC CTGGGCCGGC ATCGCCCCAC GCCGCCCCTG CCGGGAGTTG ACCTGTCGGG CCTGCTGAAG GGCGAGATCC ACGCGGTGAT CGAGCCGGAC GGCCGCGAGC GGCAGGGCGT ATTGTTCATC ACCGACGACG AGATCACCGC CCCCTCGGCC TCGAACGATG ATCCCGCCAA CCTCAAGTGC GACAAGGAGT TCGAGGTCTA CAGGCAGGTG GTCGAGACGG TGAACGATCA GCATCGGTTG CTGAACCTGG CGCCAGGTTC GGTGCGCCAG CCCAACCACG TGCGGTGCGT GCGAACCCTG CGCCACAAGC TCAGCCGCTA TTTCGACCCG TCAGGCGAAG CGGCGGAGGA GTGGGAGATG TATGATCTCG AGCGCGATCC CAACGAGGCG GTGAACCTGG TGCGGGTGGC CTCGCCGCTG ACCGCGCGAA CGGACCTGCC GTCGCCGTTC GTGACGGCCG AGGTGCAGGC GGAGGCGGAC CAACTGGCGA AGCTGCTGGC GGAACTGGAA GCGCGGGATC TGTAA
|
Protein sequence | MTQASRPNIL LITCDQYRFP RFSYGADAGF SEPLKRILGF QREDDAQNPY APYFPGLLAL RQNAALLRNH TIAASACTPS RAVIYTGQYG TKTGVTQTDG LFKSGDSYNF PWLAADGIPT LGTWMREAGY STHYFGKWHV SNPPEHSLDR YGFDDWEESY PEPHGAAINN LGVYRDAGFT DQACAFIRRK ALALNYNRAQ AVEQARDPYA AGPDADNIPP WFAVASFTNP HDIATYPAVI AQALPTPDNS GTQSIFGPLT VPLQGQKTPP PTAGTIQIAL NALGFPQDCA KPSPTQNESL ADKPSCQRDY AYKVGLALNA KTGFNIVNTV GSKLHDQFPN LSETPDLARR AAVQQALKGT IPFQLSDDPD GYALQFLQLY GWLHAVVDTH VTAVLKTLEE TGQADNTIVI FLADHGEYAA AHGMMIEKWH TAYQEALHVP VVVRFPPSTK VVENEPGTGE GPLGFTPRQI DALTSHIDIL PTVLGLAGVT PDQRTTIAER LGRHRPTPPL PGVDLSGLLK GEIHAVIEPD GRERQGVLFI TDDEITAPSA SNDDPANLKC DKEFEVYRQV VETVNDQHRL LNLAPGSVRQ PNHVRCVRTL RHKLSRYFDP SGEAAEEWEM YDLERDPNEA VNLVRVASPL TARTDLPSPF VTAEVQAEAD QLAKLLAELE ARDL
|
| |