Gene Caul_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1024 
Symbol 
ID5898479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1084434 
End bp1085426 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content71% 
IMG OID641561506 
Producthypothetical protein 
Protein accessionYP_001682652 
Protein GI167644989 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.401843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCC GCTTCCGACT TATCGCTTCC CTGGCCGCCC TGGCCCTCGC GGCGAGCGCC 
GCGCCCGCCC TGGCCGCCAA CGCCGGCTTC ACCGTGGTCC AGGCCGCCGA CCCGCGAGGC
GCGCCGATCA CCGTGGGGAT CTGGTATCCC ACCGACGCCC CGGCCAAGCC CATGAAGCTG
GGCATCGGCG ACCAGGTCGT CGCCCCCGGC GCGCCCCTGG TCGGCGACCA CCTGCCGTTG
ATCGTCATGT CGCACGGCAA CGGCGGCTTC TTCGGCGGCC ACGCCGACAC CGCCCAGGCG
CTGGCCGAGG CCGGCTTCGT GGTCGCGGCC TTGACCCACA CCGGCGACAA CTACGCTGAT
CAGAGCCGAG CCACCGACAT GCCCAACCGG CCCCGCCAGC TGTCGGTGCT GATCGACTAC
ATGCTGACCG CATCGCCGAT GCACGCGGCG ATCGACCCCG CGCGGGTCGG GGCGTTCGGA
TTCTCGTCCG GCGGCTTCAC GGTGCTGGTG GCGGCCGGCG CCGAGCCGGA CCTGAAGACG
ATCGCCCCGC ACTGCGAGGC CCATCCCGAC TTCTTCGACT GCAAGCTGAC CGCCGGCCAT
CCGCTGCCCG CCGACGTTTC CAAGGCGGTC TGGACCCACG ACACGCGGAT CAAGGCCGTC
GTCTCCGCCG CGCCGGCCCT GGGCTACAGC TTCTCCAAGG CGGGCCTCTC GAAGGTGACC
CTGCCGCTGC AGCTGTGGCG GGCCGGCAAC GACGAAATCC TGCCCGATCC GTTCTACGCC
TCCAACGTCC GCGCCAATCT GCCCAAGGCG CCCGACTACC AGGTGGTCGC CAATGCCGGG
CATTTCGACT TCCTGACGCC CTGCAACGAC CAGGGCCGCG CCACCGCCGC CGCCATCTGC
GGCAGCGCGC CGGGCTTCGA CCGCGCCGCC TTCCACAAGG ATTTCGACCG CGAGGTGGTG
GGGTTCTTCA AGGGCAGTCT GGGCCAGCCC TAA
 
Protein sequence
MPIRFRLIAS LAALALAASA APALAANAGF TVVQAADPRG APITVGIWYP TDAPAKPMKL 
GIGDQVVAPG APLVGDHLPL IVMSHGNGGF FGGHADTAQA LAEAGFVVAA LTHTGDNYAD
QSRATDMPNR PRQLSVLIDY MLTASPMHAA IDPARVGAFG FSSGGFTVLV AAGAEPDLKT
IAPHCEAHPD FFDCKLTAGH PLPADVSKAV WTHDTRIKAV VSAAPALGYS FSKAGLSKVT
LPLQLWRAGN DEILPDPFYA SNVRANLPKA PDYQVVANAG HFDFLTPCND QGRATAAAIC
GSAPGFDRAA FHKDFDREVV GFFKGSLGQP