Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4734 |
Symbol | |
ID | 5902196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5121055 |
End bp | 5122185 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641565253 |
Product | CBS domain-containing protein |
Protein accession | YP_001686352 |
Protein GI | 167648689 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3448] CBS-domain-containing membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0061944 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACAGA CGCTCACCCT GGCCCTGCGC GGCCGCAATC CGATCCGACC GGCGGACATC CTGCGATCGG GCCTCGGCGC CCTGCTGGGC GTTCCCGCCA CCGGCCTGCT GGCGCATATG GTCGCCAGCG GCCATGCCTC CGCCCTGCCG TTGCTGGTTC CGCCGATCGG GGCGTCGGCG GTGCTGGCCT TCGCCGTGCC CGCCAGCCCG CTGGCCCAGC CGCGCGCGGT CATCGGCGGC AACATGGTCT CGGCCCTGGC CGGCGTGACC TGCGCCCTGG CCTTCCATCC GCACCCCGCC CTGGCGGCGG CGGCGGCCGT GGCCTGCGCG ATCATCGCCA TGGGCCTGTT GGGGTGCCTG CACCCGCCCG GGGGCGCCGT CGCGCTCGGC GCCGCCCTGG TCGCCGGTCC GGTCGGCCCG GCCTCCTATG CCTATGTCTT CGTCCCGATC GGCCTGTGCT CGGGCCTGCT GGTGCTGGCC GCGATGGCCT ATGCGCGGGT CGCCGGACGA TCCTATCCGC ACCGGGTCCC GCCGCCGGCC AACGTGCACG CCACCCTCGA CGCCCCGCCC TCCCAGCGGG TCGGCTTCAC CGCCGCGGAC ATCGACAACG CCCTGGCCCA TTACGGCGAC CTGCTCGACG TCGATCGCGA GGACCTGGAC GCCCTGTTCC GCGAGGTCGA GCTTCAGGCC CACCGGCGTA TCCACGCCCA CATCCTGTGC AGCGACATCA TGTCCCGCGA CGTGTTGAGC GTGGACCTCC ACCAGACCGC CGAAAGCGCC CTGGCCTACA TGCGGACCCA CGATCTGCGC GCCGCGCCGG TCGTCGACGC CGATCGCAGG GTGGTCGGCA TGGTCCGCCG CGCCGAACTC CAGACCGGAC GGGAGGGCCT GGTCGAGGCG GTGTTGGATC CCTTCGTGCA CAAGGTTCGC CCCGGCACCG CGATCGAGGC CCTGCTGCCG ATCCTGTCCA GCGGGGTGGC GCACGAGGCC ATGGTGGTCG ACGAACACCG CGTGCTGCTG GGCATCATCA CCCAGACCGA TCTGCTCGGC GTGCTCTACC GGGCGCACAT CGTCGAGGCG GTGGCGCTGC AGCGGGCGGA GGAGGCCGGC GCGATCGATC CGACCATCTA G
|
Protein sequence | MRQTLTLALR GRNPIRPADI LRSGLGALLG VPATGLLAHM VASGHASALP LLVPPIGASA VLAFAVPASP LAQPRAVIGG NMVSALAGVT CALAFHPHPA LAAAAAVACA IIAMGLLGCL HPPGGAVALG AALVAGPVGP ASYAYVFVPI GLCSGLLVLA AMAYARVAGR SYPHRVPPPA NVHATLDAPP SQRVGFTAAD IDNALAHYGD LLDVDREDLD ALFREVELQA HRRIHAHILC SDIMSRDVLS VDLHQTAESA LAYMRTHDLR AAPVVDADRR VVGMVRRAEL QTGREGLVEA VLDPFVHKVR PGTAIEALLP ILSSGVAHEA MVVDEHRVLL GIITQTDLLG VLYRAHIVEA VALQRAEEAG AIDPTI
|
| |