Gene Caul_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4042 
Symbol 
ID5901504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4378039 
End bp4379298 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID641564563 
Productdiguanylate phosphodiesterase 
Protein accessionYP_001685665 
Protein GI167648002 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.18678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGGC TGATGCTGGC GCTGCTGACC GGCGCCTATC TCTGTCTGGC TCTTGTGACC 
TCCCTGGCCC TGTGGCGCAT GGGAGCGGCT CCGGCCGTCG GCCTGGCCGC CTTCATCGGC
GCCATGGGCC TGTGCTTCGC CATCCACGGC GTGATCGCCG GCGTCCTGCA GGGCGCGACC
CTGCGCGTCG ATATCGAGAC CATCCGCGAG GCCCACGGCA TCCTGCTGGA ACAGATCGAG
AAGGTCGACG CCCGCATCAC CGGCCTGCTG GAGACCGTCG CCGACGACGC CCAGCGCCGC
TCGGCCGAAC TGACCAGCGA GGTCCATCAG TTGGAAGACC TGGTCGTGCG CATGAGCGAC
CGGCTGGAAA ACCAGCTGAC CCACCATGTC GCCGCCGCCC GCGACGAGCC GCGCGGCCGC
TCGTCGCAGT CCAGCGCCTT GCTCGGCGTG GTGCAGGACG CCCTGGCCGA CAACCGGGTC
GACCTCTATC TGCAGCCGGT CGTCAGCCTG CCCCAGCGCC GGACCGTCTT CTACGAGAGC
TTCTCGCGCC TGCGCGACGA GACCGGCCGG GTGCTGATGC CCGCCGAATA CCTGGCCGTG
GCCGAGCCCG AGGGCCTGAC CGCCGCGATC GACAACCTGC TGCTGTTCCG CTGCGTGCAG
ATCGTCCGTC GCCTGGCCAA GCAGGACCGC AAGGTCGGGA TTTTCTGCAA CATCTCGCTG
GCCAGCCTGG CCGACGAGGC GTTCTTCGCC CAGTTCCTCG AATTCCTGCA GGTCAACAAG
GACCTGTCGG GCGCCCTGAT CTTCGAACTG GGCCAGGCCG CCTTCAACGA CCGAGGCCCG
GTCGAGGCCC GTCACATGGC CCGCCTGGCC AGCCTGGGTT TCCGCTTCAG CCTTGACAAG
GTCACCGACC TGGACCTGGA CTTCCAGGAC CTGGCCCGCG CCGACGTCAA GTTCCTGAAG
ATTGGCGCCC AGCTTCTTCT GGACCAGTTG GAAGAGCAGG GCGGCAAGCT GGTCATCGCC
TCGTTGCCCG ACCTCAATGC CGAGGACTTC GCCGGCCTGA CCCGTCGCTA CGGCATCGAG
GTGATCGCCG AGAAGGTCGA GCACGAGCGC CAGGTGGTCG ACGTGCTGGA GCTCGACATC
GGCTACGGCC AGGGCCACCT GTTCGGCGAG CCCCGCGCCA TCCGCGACTC GATCATCGCC
GAAGCCGACC CGCCGCAGGA CTTCATGCGC GGCGCGATGC GGCGCGGGAT GGGGCGGTAG
 
Protein sequence
MRRLMLALLT GAYLCLALVT SLALWRMGAA PAVGLAAFIG AMGLCFAIHG VIAGVLQGAT 
LRVDIETIRE AHGILLEQIE KVDARITGLL ETVADDAQRR SAELTSEVHQ LEDLVVRMSD
RLENQLTHHV AAARDEPRGR SSQSSALLGV VQDALADNRV DLYLQPVVSL PQRRTVFYES
FSRLRDETGR VLMPAEYLAV AEPEGLTAAI DNLLLFRCVQ IVRRLAKQDR KVGIFCNISL
ASLADEAFFA QFLEFLQVNK DLSGALIFEL GQAAFNDRGP VEARHMARLA SLGFRFSLDK
VTDLDLDFQD LARADVKFLK IGAQLLLDQL EEQGGKLVIA SLPDLNAEDF AGLTRRYGIE
VIAEKVEHER QVVDVLELDI GYGQGHLFGE PRAIRDSIIA EADPPQDFMR GAMRRGMGR