Gene Caul_4734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4734 
Symbol 
ID5902196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5121055 
End bp5122185 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID641565253 
ProductCBS domain-containing protein 
Protein accessionYP_001686352 
Protein GI167648689 
COG category[T] Signal transduction mechanisms 
COG ID[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0061944 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACAGA CGCTCACCCT GGCCCTGCGC GGCCGCAATC CGATCCGACC GGCGGACATC 
CTGCGATCGG GCCTCGGCGC CCTGCTGGGC GTTCCCGCCA CCGGCCTGCT GGCGCATATG
GTCGCCAGCG GCCATGCCTC CGCCCTGCCG TTGCTGGTTC CGCCGATCGG GGCGTCGGCG
GTGCTGGCCT TCGCCGTGCC CGCCAGCCCG CTGGCCCAGC CGCGCGCGGT CATCGGCGGC
AACATGGTCT CGGCCCTGGC CGGCGTGACC TGCGCCCTGG CCTTCCATCC GCACCCCGCC
CTGGCGGCGG CGGCGGCCGT GGCCTGCGCG ATCATCGCCA TGGGCCTGTT GGGGTGCCTG
CACCCGCCCG GGGGCGCCGT CGCGCTCGGC GCCGCCCTGG TCGCCGGTCC GGTCGGCCCG
GCCTCCTATG CCTATGTCTT CGTCCCGATC GGCCTGTGCT CGGGCCTGCT GGTGCTGGCC
GCGATGGCCT ATGCGCGGGT CGCCGGACGA TCCTATCCGC ACCGGGTCCC GCCGCCGGCC
AACGTGCACG CCACCCTCGA CGCCCCGCCC TCCCAGCGGG TCGGCTTCAC CGCCGCGGAC
ATCGACAACG CCCTGGCCCA TTACGGCGAC CTGCTCGACG TCGATCGCGA GGACCTGGAC
GCCCTGTTCC GCGAGGTCGA GCTTCAGGCC CACCGGCGTA TCCACGCCCA CATCCTGTGC
AGCGACATCA TGTCCCGCGA CGTGTTGAGC GTGGACCTCC ACCAGACCGC CGAAAGCGCC
CTGGCCTACA TGCGGACCCA CGATCTGCGC GCCGCGCCGG TCGTCGACGC CGATCGCAGG
GTGGTCGGCA TGGTCCGCCG CGCCGAACTC CAGACCGGAC GGGAGGGCCT GGTCGAGGCG
GTGTTGGATC CCTTCGTGCA CAAGGTTCGC CCCGGCACCG CGATCGAGGC CCTGCTGCCG
ATCCTGTCCA GCGGGGTGGC GCACGAGGCC ATGGTGGTCG ACGAACACCG CGTGCTGCTG
GGCATCATCA CCCAGACCGA TCTGCTCGGC GTGCTCTACC GGGCGCACAT CGTCGAGGCG
GTGGCGCTGC AGCGGGCGGA GGAGGCCGGC GCGATCGATC CGACCATCTA G
 
Protein sequence
MRQTLTLALR GRNPIRPADI LRSGLGALLG VPATGLLAHM VASGHASALP LLVPPIGASA 
VLAFAVPASP LAQPRAVIGG NMVSALAGVT CALAFHPHPA LAAAAAVACA IIAMGLLGCL
HPPGGAVALG AALVAGPVGP ASYAYVFVPI GLCSGLLVLA AMAYARVAGR SYPHRVPPPA
NVHATLDAPP SQRVGFTAAD IDNALAHYGD LLDVDREDLD ALFREVELQA HRRIHAHILC
SDIMSRDVLS VDLHQTAESA LAYMRTHDLR AAPVVDADRR VVGMVRRAEL QTGREGLVEA
VLDPFVHKVR PGTAIEALLP ILSSGVAHEA MVVDEHRVLL GIITQTDLLG VLYRAHIVEA
VALQRAEEAG AIDPTI