Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3942 |
Symbol | |
ID | 5901404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4266675 |
End bp | 4268696 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564463 |
Product | sulfotransferase |
Protein accession | YP_001685565 |
Protein GI | 167647902 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.749375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACCA CCACCACCGA GCCCTCCGGC AGCCTCGCCA CGGCGCTCGC CCACACCCAG CGCCTGCTGG CCGCCGATCC GGCCATGGCC GCCGAGCAGG CCCGCGCGAT CCTGGAGGCC GTGCCGCGTC ACGCCGGGGC CACCCTGATG CTGGCCGCGG CCCTGCGGCT GTCGGGCGAC CTCGACCAGG CCCTGGAGGT CGTCGATCCC CTGGCCCGCG CCCTGGTCCA GTCGCCGGAG GTTCAGCTGG AGCACGGCTT GGTCCTGGCC CGCCTGGGCC AGACCCAGGC CGCGATCGCC GCCTTCAAGC GCGCCACGGC CCTGGATCCC GACCTGGCCG AGGGTTGGCG GGGCCTCGCC GAAGCGCTGG ACCTGGCCGG CGATGCGGCG GGCGCCCAGG CCGCCCAGGC CCGCCAGATC AAGGCCGGCG TCCGCGATCC GGCCCTGATG AGCGCCGCCG CCGCCCTGGT CGATGGCAAG CTGGGCGTCG CCGAGCAGAT CCTGCGCGAC GTGCTGCGGG TCCGGCCCGA CGAGCCGGCG GCGATCCGGA TGCTGGCCGA GGTCGCCGCC CGGCTGGGTC GCCACGACGA CGCCGAGACC CTGCTGGTCC GCTGCCTGGA GCTGGCCCCC GGCTTCACCG CCGCCCGTCA CAACCTGGCC ACCGTGCTTT ACCGGCAGGG CCGCTCCGAG GACGCCCTGG TCGAGTTGGC GCAACTGCTG GCCGGCGCGC CGCGCAACCC GGCCTATCTG AACCTCAAGG CCGCCGCCCT GGCCCGGATC GGCGAATACG CCCAGGCCAT CGAGCTCCTG GAGGACGTGC TGGCCCGCTT CCCCCAGCAG CCCAAGGGCT GGATGAGCTA TGGCCACGCG CTCAAGACCG TCGGCCGATC CGCCGACAGC GTGGCCGCCT ACCGCAAGGC CGTCGACCAG GCCCCGTCGC TGGGCGAGGC CTGGTGGAGC CTGGCCAACA TGAAGACCTA CCGCTTCGGC GACGCGGACC TGGCGGCGAT GGAAGCGGCG CTGGCCCAGC AGGATCTCGG CGAGGACGAC CGCCTCCACC TGCACTACGC CCTGGGCAAG GCCCACGAGG ACGCCGCCCG CTACGCCGAG TCCTTCGCCC ACTACGCCAG GGGCGCCGAC CTGCGGCGGG CGCAGATCGC CTACGATCCC GGCGTGATCC GCGAGCATGT GGCGCGCGGC AAGGCGGTCC TGACGGCGGA CCTGTTCGCG GCGCGGGCCG GCCAGGGCTG CCCCGCGCCC GACCCGATCT TCATCCTGGG CCTGCCGCGG TCCGGCTCGA CCCTGATCGA ACAGATCCTG GCCAGCCACT CGGCGGTCGA GGGCACGATG GAACTGCCCG ACATCACCTC GATGGCCCGC CGGCTGAGCG GGGCCAAGAC CAGCAAGGAG GCCTCGGCCT ATCCCGAGAT CCTGGCGACC CTGGGTCCGG AGGATCTCAA GGCGCTGGGC GAGGAGTTTC TCGAGCGCAC CCGGGTGCAG CGCAAGACCG CCCGGCCGCT GTTCATCGAC AAGATGCCCA ACAACTGGGC CCATGTCGGG CTGATCGCGC TGATGCTGCC CAACGCCAAG ATCATCGACG CGCGTCGCCA CCCGATGGGC TGCTGCTTCT CGGGCTTCAA GCAGCACTTC GCCCGGGGCC AGAACTTCAG CTACGGCCTG GACGACATCG GCCGCTACTA CGCCGACTAT GTCGAGCTGA TGGCCCATTT TGACGCCGTG CTGCCGAGCC GCGTGCACCG GGTGATCTAC GAAGAGATGG TCGAGGATCC AGAAACCCAG ATCCGCGCCC TGCTGGACTA TTGCGGCCTG CCGTTCGAAG CCGCCTGCCT GAACTTCCAC GAAAACGACC GCGCCGTGCG GACCGCAAGT TCAGAACAGG TCCGCCGGCC GATCTTCAAG GACGCGGTCG AGCATTGGCA GAACTACGAA TCGTGGCTGG GGCCGCTGAA GACCGCCCTG GGTCCGGTCT TGGCCAGCTA CCCGGCCGCG CCTGAATTTT GA
|
Protein sequence | MSTTTTEPSG SLATALAHTQ RLLAADPAMA AEQARAILEA VPRHAGATLM LAAALRLSGD LDQALEVVDP LARALVQSPE VQLEHGLVLA RLGQTQAAIA AFKRATALDP DLAEGWRGLA EALDLAGDAA GAQAAQARQI KAGVRDPALM SAAAALVDGK LGVAEQILRD VLRVRPDEPA AIRMLAEVAA RLGRHDDAET LLVRCLELAP GFTAARHNLA TVLYRQGRSE DALVELAQLL AGAPRNPAYL NLKAAALARI GEYAQAIELL EDVLARFPQQ PKGWMSYGHA LKTVGRSADS VAAYRKAVDQ APSLGEAWWS LANMKTYRFG DADLAAMEAA LAQQDLGEDD RLHLHYALGK AHEDAARYAE SFAHYARGAD LRRAQIAYDP GVIREHVARG KAVLTADLFA ARAGQGCPAP DPIFILGLPR SGSTLIEQIL ASHSAVEGTM ELPDITSMAR RLSGAKTSKE ASAYPEILAT LGPEDLKALG EEFLERTRVQ RKTARPLFID KMPNNWAHVG LIALMLPNAK IIDARRHPMG CCFSGFKQHF ARGQNFSYGL DDIGRYYADY VELMAHFDAV LPSRVHRVIY EEMVEDPETQ IRALLDYCGL PFEAACLNFH ENDRAVRTAS SEQVRRPIFK DAVEHWQNYE SWLGPLKTAL GPVLASYPAA PEF
|
| |