Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2992 |
Symbol | |
ID | 5900447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3251084 |
End bp | 3252988 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641563489 |
Product | sulfotransferase |
Protein accession | YP_001684617 |
Protein GI | 167646954 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.49268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACATCG GCCGCGACGC TTTATTGCTA GAGGCGAGAG ACCTGCGCTC GCGCCAACGC CTGCCAGACG CCTTGGCCGC CCTCGCGCGG CTGGAGACGC TGCATCCCCG GTTCAGCCGT CTGCATCAGG AGCGCGGCCA CTGTCATGTC CTGCTCGGGG ATGCGCCGGC GGCCATCGAC GCGCTGCGCG AGGCGGTGCG TCTCAACCCG ACGCTACCGG CGAGCTGGGA CATGCTGGAG CAGCTTTACC GCATGCAGGG CGCCACCGCC CAGGCGGTCA ATGCGGCGCG GCATCTGGCC ACGCTGAGAC AGCTGCCGTC GGCCGTGGTG GCGGCCAACG GCCTGTTCGC CGATGGCGAC CTGTCGCCGG CGGAGGAGAT CCTTCGGGAC TACCTGAGCC GGGATGGCGA CAATGTGGGC GCCCTGCGTC TGCTGGCGCG GATCCGCATG CAGAGCGACG CGCTCGACGA GGCCGAGGCG TTGCTGAAAT CCGTGATCGA GCGCGCGCCG GACTACCACG CCGCGCGCCT GGACTACGCC ATGGTGCTCT TGCAGCGGCA AAAGCCTCTG GAGGCGCGGC GGGAGGCCGA ACATTTGCTC GCGCACGACC CGGACAACCG CGACTACCTC AAGCAGTACG GCGCGGCCTG CGTCAGCCTG GGCGACCACG AGCCGGTGAT CGATCTCTAT GAAAGGTTGC TGGCGGGGCG ACCTCAGTCG GGCGCCGAGG TCGCCGACCT GCGGCTTTGG CGGGCGAACG CCCTGAAGAT CACTGGCCGG CGGCAGGAGG CCATCGCGGA CTATCGCGCC GCCCTGGCGG CGCGGCCGGA CCATGGCGTT GCCTGGTTCA GCCTCGCCAA CCTCAAGACC TACCGCTTCA CCGACGATGA CGTCTCGCGA ATGCAGGTGG CCGAGGCCCA GCCGGGCATC CAGACCATGG ACCGGGTCTA TATGGCCTTC GCGTTGGGCA AGGCGCTGGA GGACCGGGGC GACTACGCGG CCTCGTGGCG GCGCTACGAG CGCGGCAATG CGGTGCGGCG CGCGGCCGGC CGCTATCGCC CGGAGATCGC GGAAGCCTGC GCGCTCCGAT TGAAGCAAAT CTTCACCGCC GATTTCTTCG CCGAACGCGC CGGCTGGGGC GTGGACGATC CGGCGCCCAT CTTCATCCTG GGCCTACCCC GTTCGGGCTC GACCTTGATC GAGCAGATCC TGGCCTCCCA TTCCCGCGTG GAGGGCACGC AGGAACTGAC CGAGATCGGC CGATATGCCG GCGAACTCTG CGGTCGCGAT CCGGATTGCG GTTTGCCACT GGACCCCGAG GCGTTGTCGC GCCTTAAGGC GGAGGATGTC CGAGCGCTCG GCGAACGCTT CCTGGCCGAA ACCCGCGCCT ATCGTCGGCT GGGCAGACCG TCCTTCATCG ACAAGATGCC AAATAACTTC TGGCACATCG GGCTGATCCA CCTGATCCTG CCTCGCGCGA CAATCATCGA TGTGCGGCGC GAACCGATGG CCTGCTGCTT CAGCAATCTC AAGCAGTTGT TCGGCACGAC CAACCAGGAA TTCACCTACG GCGTCGACGA CATCGCCCGC TACTACCGCA CCTATCTCGA CGTCATGCGG CACTGGGGCG ATGTGTTGCC GGAGAGGGTT CTGAAGGTCC GGTACGAGGA CGTGGTCGAG GATCTCGAAG GCGGCGTGCG GCGTATGCTG GAGCACTGCA AACTGCCCTT CGAGCCGGCC TGCCTGACCT TCCACGAGAC CAAGCGCAGC GTGCGCACGC CCAGTTCCGA GCAGGTGCGC CAGCCCATCG GTCGCGAGGG GCTCACGCAA TGGGAGCACT ACGCGCCTTG GCTCAACGAC CTGCGGGACG CGCTGGGCGA CGCCATGACC GGCTACAGGG ACTGA
|
Protein sequence | MNIGRDALLL EARDLRSRQR LPDALAALAR LETLHPRFSR LHQERGHCHV LLGDAPAAID ALREAVRLNP TLPASWDMLE QLYRMQGATA QAVNAARHLA TLRQLPSAVV AANGLFADGD LSPAEEILRD YLSRDGDNVG ALRLLARIRM QSDALDEAEA LLKSVIERAP DYHAARLDYA MVLLQRQKPL EARREAEHLL AHDPDNRDYL KQYGAACVSL GDHEPVIDLY ERLLAGRPQS GAEVADLRLW RANALKITGR RQEAIADYRA ALAARPDHGV AWFSLANLKT YRFTDDDVSR MQVAEAQPGI QTMDRVYMAF ALGKALEDRG DYAASWRRYE RGNAVRRAAG RYRPEIAEAC ALRLKQIFTA DFFAERAGWG VDDPAPIFIL GLPRSGSTLI EQILASHSRV EGTQELTEIG RYAGELCGRD PDCGLPLDPE ALSRLKAEDV RALGERFLAE TRAYRRLGRP SFIDKMPNNF WHIGLIHLIL PRATIIDVRR EPMACCFSNL KQLFGTTNQE FTYGVDDIAR YYRTYLDVMR HWGDVLPERV LKVRYEDVVE DLEGGVRRML EHCKLPFEPA CLTFHETKRS VRTPSSEQVR QPIGREGLTQ WEHYAPWLND LRDALGDAMT GYRD
|
| |