Gene Caul_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3018 
Symbol 
ID5900473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3285949 
End bp3287961 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content66% 
IMG OID641563519 
Productpoly(R)-hydroxyalkanoic acid synthase, class I 
Protein accessionYP_001684643 
Protein GI167646980 
COG category[I] Lipid transport and metabolism 
COG ID[COG3243] Poly(3-hydroxyalkanoate) synthetase 
TIGRFAM ID[TIGR01838] poly(R)-hydroxyalkanoic acid synthase, class I 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.27509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGA CTGAAACCTC TCCCAAACCG CGAAAAAAGG CCGTCTCCAA GGCGGATGCG 
ACGCCGACCG CCGCGAAGAC CAAGTCCAAG GCCGCGCCGA AGGCTCCTCC GAAAATCGCC
CCCGAACCGG CAGCCCGCGC CAAGCCCGCC GAATCGGCCT CGCGCCGCGA ACCGCCTCGG
GCGGCGGCGC CCACCGGCGC CATGCCCGAG CTCGAGGCCC TCCTGTCGCC CGACCAGCGC
CAGATGCTCG AGACCCTGTC GGCCAACCTG GCCCGCGCCG CCGTCACCGC CCAGGGGGCG
ATCGCCGAGG CCGCCCTGCG CCAGGCCGAC CGGCCGGCGG CCCTGAGCGC CGACCCGTTC
CACGTCGGCC CCGCCTTCAA CGAGGTGATG ACCAGCCTGG CCGCCCAGCC CGACCGCCTG
CTGCGGGCCC AGGCCGACCT CTTTACCCGC TACATGGACC TGTGGCAGTC GGCCGCCCGC
AGAATGACCG GCGAGCAGAC CCAACCCATC GTCGCGCCAA CCAGCGGCGA CAAGCGGTTC
AGCGACCCCG ATTGGGCCAC CAATCCGATG TTCGACATGA TGAAGCAGAG TTACCTGCTC
TCTTCCAATT GGCTGAACGA TCTGGTGTCG CAGGCCGAGG GCGTCGATCC CAGCGCCAAG
CGCCGGGTCG AGTTCTTCAC CAAGATGCTG ACCGACGCCT TCTCGCCGTC GAACTTCCTG
ATCTCCAACC CGGCCGCCCT GCGCGAGGTG ATGCAGAGCA AGGGCGAGAG CCTGGTGCGC
GGCATGCGAA ACTTCGCCGC CGATCTCGAG CGCGGCGGCG GCCAACTGGC CATCAGCCAG
ACCGATCTGG CCAAGTTCAA GGTCGGCGAG AATGTCGCCA CCGCCCCCGG CAAGGTGGTC
TATCAGAACG ACATCCTGCA GCTGCTGCAG TTCGATCCGA CCACGGAGCA GGTGCACGAG
ATCCCGCTGC TGATCTTCCC GCCGTGGATC AACAAGTTCT ATATCCTCGA CCTGCGGCCC
GAGAACTCGA TGATCCGCTG GCTGACCGGC CAGGGCTTCA CGGTGTTCGT GGCCTCGTGG
GTCAATCCCG ACAGTGAACA AGCGACCAAG ACCTTCGAGG ACTACATGTT CGAGGGGATC
TACGACGCCA GCCAGCAGGT GATGAACCAG ACCGGCGTCA ACAAGGTCAA CACCGTCGGC
TACTGCATCG GCGGCACCCT GCTGTCCTGC GCCCTGGCCC ACATGGCGGC CAAGGGCGAC
AAACGGATCA ATTCGGCCAC CTTCTTCGCC GCCCAGCAGG ACTTCTCCGA GGCCGGAGAC
CTGCTGCTGT TCACCGACGA GGAATGGCTG AAGTCGATCG AGACGCTGAT GGACCAGAAG
GGCGGCTACC TGCCCAGCCA GTCGATGGCC GACACCTTCA ACAGCCTGCG CGGCAACGAC
CTGATCTGGT CGTTCTTCAT CAACAACTAC CTGATGGGCA AGGAGCCGCG GCCCTTCGAC
CTGCTGTTCT GGAACGCCGA CCAGACGCGC ATGCCCAAGG CCCTGCACCT GTTCTATCTG
CGCAACTTCT ACAAGGACAA CGCCTTGACC ACGGGTCACC TGACCCTGGG CGGCGTGAAG
CTGGACCTGT CGAAGGTCAA GACCCCGATC TATGTCCAGT CATCCAAGGA CGACCACATC
GCCCCGTTCC GCAGCGTCTA TCGCGGCGCG CGAGCCTTCG GCGGGCCGGT CACCTTCACC
ATGGCCGGCT CGGGCCACAT CGCCGGGGTG ATCAACCATC CCGACGCCAA GAAGTACCAG
CACTGGACCA ACGACCAGTT GCCCGGCTCG GTCGAGGACT GGCGCGCCGG CGCGGTCGAG
CATCCCGGCT CGTGGTGGCC GCACTGGGCG ACCTGGCTGA AGGCCCGATC AGGCAAGCTG
GTCCCGGCCC GCGATCCGGC CAAGGGCCTG CTGAAACCGT TGGAGGACGC GCCGGGCAGC
TTCGTGCGGG TGCGGTCGAA CGCGGCGGCC TGA
 
Protein sequence
MATTETSPKP RKKAVSKADA TPTAAKTKSK AAPKAPPKIA PEPAARAKPA ESASRREPPR 
AAAPTGAMPE LEALLSPDQR QMLETLSANL ARAAVTAQGA IAEAALRQAD RPAALSADPF
HVGPAFNEVM TSLAAQPDRL LRAQADLFTR YMDLWQSAAR RMTGEQTQPI VAPTSGDKRF
SDPDWATNPM FDMMKQSYLL SSNWLNDLVS QAEGVDPSAK RRVEFFTKML TDAFSPSNFL
ISNPAALREV MQSKGESLVR GMRNFAADLE RGGGQLAISQ TDLAKFKVGE NVATAPGKVV
YQNDILQLLQ FDPTTEQVHE IPLLIFPPWI NKFYILDLRP ENSMIRWLTG QGFTVFVASW
VNPDSEQATK TFEDYMFEGI YDASQQVMNQ TGVNKVNTVG YCIGGTLLSC ALAHMAAKGD
KRINSATFFA AQQDFSEAGD LLLFTDEEWL KSIETLMDQK GGYLPSQSMA DTFNSLRGND
LIWSFFINNY LMGKEPRPFD LLFWNADQTR MPKALHLFYL RNFYKDNALT TGHLTLGGVK
LDLSKVKTPI YVQSSKDDHI APFRSVYRGA RAFGGPVTFT MAGSGHIAGV INHPDAKKYQ
HWTNDQLPGS VEDWRAGAVE HPGSWWPHWA TWLKARSGKL VPARDPAKGL LKPLEDAPGS
FVRVRSNAAA