Gene Caul_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2971 
Symbol 
ID5900426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3226716 
End bp3228500 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content68% 
IMG OID641563468 
Productpoly-beta-hydroxybutyrate polymerase domain-containing protein 
Protein accessionYP_001684596 
Protein GI167646933 
COG category[I] Lipid transport and metabolism 
COG ID[COG3243] Poly(3-hydroxyalkanoate) synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.745288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.158557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG TCCAAATCCC GCCAGGCGCT GGGACGTTCG CCGAGCCTGC GCAAGACTTG 
GCCGAAGATC CTTGCGACGG CCTGGCCGAC CTTGTCGACC GAACGGCGAG CGCGGCCCTG
GCCACCGCCA CGGGCGGCCT GTCGCCGGCC TCGGTCCTGG GCGCCTTCGC CGACTGGGCC
ACGCACCTGG CCATCTCGCC GGGACGTCAG TGGCGGTTGG CGGTCAAGGC CGCCCGCAAG
ACCGGCCGGC TCACCGATTT CGCCATGCGA GCCTTGTGGG AGGGCGCTGG GGCCACGCCC
TGCATCGCGC CCTTGCCGCA GGATCGCCGT TTCGACGATC CGGCCTGGCG CACCTGGCCC
TACAACGTGG TGCAGCAGGG ATTTCTCCTC CAGCAGCAAT GGTGGGATGT GGCCTGCACG
GGCGTGCCCG GGGTCACCGC GCGCCATGAG GCCATGACCC GCTTCGCCGT CCGCCAGGCG
CTCGACACCG TGGCGCCAAG CAACTTCATC GCCACCAATC CGGTCCTCCA GGCCAAGATC
GCCACGACCG GCGGGGCCTG CCTGGCCCAC GGCTGGACCA ATTTCGCCGA CGACCTGCGC
CGGCTGATCG AACGCAGGGG TCCCGATGGG GTTGAAGCCT TCCGGCCTGG TCATGAGGTG
GCTGTCACCC CCGGCCAGGT GGTCTTCCGC AATGACTTGA TCGAGCTGAT CCGTTATGCG
CCGACCACCG CAACGGTGCG TCCGCAGCCC ATCCTGATCA CGCCCGCCTG GATCATGAAG
TACTACATCC TCGATCTGTC GCCCGGCAAT TCGCTGGTCC GCTGGCTCGT CGATCAGGGC
TACTGCGTGT TCATGATCTC CTGGCGCAAT CCCGGCCAGG CGGACCGCGA GTTGACCTTG
GACGACTATC GCCGGCTGGG CTTTCTGGCG GCGCTCGACG CGGTGGTGGA GGAGACCGGG
GCGGCGAGCA TTCACGCGGT GGGTTATTGC CTGGGCGGAA CCTTGCTGGC CATCGCGGCG
GCGGCCATGG CCCGTGACGG CGACGACCGA TTGGCCAGCA TCACCTTGCT GGCCGCCCAG
ACCGAATTTA GCGAGCCCGG AGAGCTGGGT CTCTTCATCG ACGAAGGCCA GGTCCGTTTC
CTGGAGGACC TGATGTGGTC GCAGGGCTAT CTCGACGCTC GCCAGATGGG TGGGGCGTTC
CAGCTGCTGC GTTCCAACGA CCTGATCTGG TCGCGCGTCG TGCGCGAGTA CCTGCTGGGC
GAGCGCGCCC CGATGAGCGA CCTGATGGCC TGGAACGCTG ACGGCACGCG TTTGCCTTAC
GCCATGCACA GCCAGTACCT GCGCGCGCTT TTCCTTGACG ATGACTTGGC GGAAGGACGG
TTTCAGGTCG ATGGGCGGCC GGTGGCCCTG GAGGACATCC GCAGCCCCGT GTTCGCGGTC
GGCGCCGAGC GCGACCACGT CGCGCCCTGG CGTTCGGTGT TCAAGATCCA CCTTTCGGTG
GGGGCGGCGG TCACCTTCCT TCTGACCAGC GGCGGGCATA ACGCCGGCAT CGTCTCCGAG
CCGGGTCGAC CCGGACGCCA TTGGCGGCGT CGCACCCGAC CCGCTGGCGG GCGCTATGTC
GGTCCCGAGG CTTGGCTTGA TCTGGCCGAA GCGGGCGAAG GGTCCTGGTG GCCCGCCTGG
ACCCAGTGGC TGGACGAGCG GTCGGGCGCG CCGATCGAGC CTGAGGCCCT TCATACGGCC
GCGGACCTGG GGCCAGCGCC GGGAACCTAT GTCTTCGGCC GCTAA
 
Protein sequence
MSAVQIPPGA GTFAEPAQDL AEDPCDGLAD LVDRTASAAL ATATGGLSPA SVLGAFADWA 
THLAISPGRQ WRLAVKAARK TGRLTDFAMR ALWEGAGATP CIAPLPQDRR FDDPAWRTWP
YNVVQQGFLL QQQWWDVACT GVPGVTARHE AMTRFAVRQA LDTVAPSNFI ATNPVLQAKI
ATTGGACLAH GWTNFADDLR RLIERRGPDG VEAFRPGHEV AVTPGQVVFR NDLIELIRYA
PTTATVRPQP ILITPAWIMK YYILDLSPGN SLVRWLVDQG YCVFMISWRN PGQADRELTL
DDYRRLGFLA ALDAVVEETG AASIHAVGYC LGGTLLAIAA AAMARDGDDR LASITLLAAQ
TEFSEPGELG LFIDEGQVRF LEDLMWSQGY LDARQMGGAF QLLRSNDLIW SRVVREYLLG
ERAPMSDLMA WNADGTRLPY AMHSQYLRAL FLDDDLAEGR FQVDGRPVAL EDIRSPVFAV
GAERDHVAPW RSVFKIHLSV GAAVTFLLTS GGHNAGIVSE PGRPGRHWRR RTRPAGGRYV
GPEAWLDLAE AGEGSWWPAW TQWLDERSGA PIEPEALHTA ADLGPAPGTY VFGR