Gene Caul_4464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4464 
Symbol 
ID5901925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4836467 
End bp4839670 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content74% 
IMG OID641564983 
Productpyruvate carboxylase., methylmalonyl-CoA carboxytransferase 
Protein accessionYP_001686082 
Protein GI167648419 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit
[COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0495259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCC AGCGCGTCCT GATCGCCAAT CGCGGCGAGA TCGCCATCCG CATCGCTCGC 
GCGGCGGCCG AGCTGGGGAT CGCCAGCATT GCGGTGTTCG CCACCGACGA GGCCGACGCG
CCGCACGTGT CGGCGGCGGA CGAAGCGGTC GCCCTGCCTG GGACCGGCCC CCGAGCCTAT
CTCGACATCG CCGCCATCGT GGCCGCCGCC AAGGCCGCCG GCTGCGACGC CCTGCATCCC
GGCTACGGCT TCCTGGCGGA GAACCCGGCC CTGGCCCGGG CGTGCGCGGC GATCGGGATC
GCCTTCATCG GCCCCTCTCC CGCGCACCTG GAAACCTTCG GCGACAAGGC CTTGGCCCGC
GCCCTGGCCG ACGCCAAGGC CATCCCCCTG CTGGAAGGGA CCGGCGCCTT GGACCTGGCC
GGCGCCCAGA GGTTCCTGGC GCGCGAGGGG CCGATGATGC TAAAGGCGGC GGCCGGCGGC
GGCGGGCGCG GCATGCGGCC GGTGCGCGAC GCCGGCGAGG TCGAGGCCGC CTTCGCCGCC
TGCGCCCGCG AGGCCCTGGC GGCGTTCGGC GACGGCACGC TCTATGCCGA GCGGCTGATG
GAAAACGCCC GCCATATCGA GGTGCAGATC GCGGGCGACG GGACGCACGT GGCGGCGGTG
GGCGACCGCG ACTGCTCGCT GCAGCGCCGT CGGCAGAAGC TGGTCGAAAT CGCCCCCGCC
CCGAACCTGC CCGATGAGAC CCGCGCGGCC CTGTTCGACG CCGCCGTCCG CCTGACCGCC
GGCTATCGCA CCCTGGCGAC AGTGGAGTTC CTGGTCGCTC CTGACGGCGA TTTCGCGTTC
ATCGAGGTCA ACGCCCGGCT CCAAGTCGAG CATACGGTGA CCGAGGAGGT GACGGGCGTC
GACCTGGTCC GCGCCCAGTT CGAGCTGGCC GACGGGGCCA GCCTGGCGGA CGTCGGGCTG
GCGACCACGC CGCACGCCCA GGGCTACGCG ATCCAGGGCC GGGTCAATCT GGAGACCCTA
AGCGTCGACG GGACGCCGCG ACCCTCGACC GGCACGATCA CCGCCTACCA GCCGCCCGGC
GGTCCTGGCG TTCGGGTCGA CGGCGCGGGC GCGGCGGGAC TGGTCGTCAC CGGCGCCTAT
GACAGCCTGC TGGCCAAGGT CATCGGGCGC GGCCGGACGG CGGCCGAGGC GGCGGGACGC
ACGGCCCGGG CCCTGGCCGA GTTCCGCATC GAGGGCGTCG ACACCACGAT CCCGCTGCTA
CGCGCGCTGC TGGCCGAACC CGCGACCGTG GACGGAACCG CGACCACCGA CTTCATCGAC
CGCGAGGCCG GCCGGCTGTT CGAGGCCGCC GGCGCCCCGC CCCAAAGCGC CGAACCCGTC
TTCACCGAGG ACGGCGCGGT GACGGCTCCG CTCCAGGCCC TGGTCGGGAT GATCGAGGTG
GTCGAGGGCG ACTTGGTGCG GCCGGGCCAG GCGGTGGCGG TGCTCGAGGC CATGAAGATG
GAGCACCTGG TCCATGCCGA GACCGGCGGC CGAGTGATCC GGGTCGCCGT CACCGCCGGC
GAGGCCGTGC GGCCCGGCCA GCCCCTGCTC TATCTGGAAC CGGCCGAGGT CGAGGAGGGC
GAGGCCCTGG CGGCCGAGGC GGTCGATCCC GACGCCCTTC GCCCCGACCA CGCCGAGGTG
ATCGCCCGCC ACCGATTCAC GCTCGACGAG GCCCGGCCCG AGGCGGTCGC CAAGCGCCGC
AAGACCGGCC ACCGCACGGC CCGGGAAAAC ATCGACGACC TGGTCGATCC CGGCAGCTTC
CTCGAATACG GCGCCCTGGC CATCGCCGCC CAGAAGCGCC GGCGCTCGGT CGAGGACCTG
ATCGCCGCCA CGCCCGCCGA CGGCCTGATC ACCGGCATCG GCAGCGTCAA CGGCGCCCTC
TTCCCACCCG ACAAGGCCCG GGTCGCGGCC CTGGCCTATG ACTTCACCGT GCTGGCCGGC
ACGCAAGGGG CGATGAACCA CCGCAAGTCC GACCGGCTGC TGGCCATCGT CGCCGAGCAA
CGCCTGCCAG TCGTCTGGTT CGCCGAGGGC GGCGGCGGGC GGCCGGGCGA CACCGACACC
ACGGCGGTGG CGGGGCTGGA CGTGCCGACC TTCCGCAGCA TGGCGGCCCT GTCCGGCGTC
GTGCCCAAGA TCGCCATCGT CGCCGGCCGC TGTTTCGCCG GCAACGCCGC CATCGCCGGC
CTGTCGGAGA TCATCGTCGC CACCCGGGAC TCCAACCTCG GCATGGGCGG GCCGGCGATG
ATCGAGGGCG GCGGCCTGGG CGTCTTCCGG CCCGAACAGA TCGGCCCCTC CGCCCACCAG
TGGGCCAACG GCGTGATCGA CCTGCTGGCC GACGACGAGG CCCACGCCAC GCGGCTGGCC
AGGCAGGCGC TGTCCTATTT CCAGGGCGCG GTGAAGGACT GGACCTGTCC GGACCAGCGC
CCCTTGCGTC GCGCGGTGCC GGAGAACCGG CTGCGGGTCT ATGACGTGCG GGCGCTGATC
GCGGTTCTGG CGGACACCAG CTCCTTCCTG GAACTGCGCG GCGGCTTCGC GGCGGGGATG
ATCACCGGCC TGGTGCGGAT CGAGGGCCGG CCGCTGGGGC TGATCGCCAA CGACCCGCGC
CACCTGGGCG GGGCGATCGA CGGCGACGGG GCCGAGAAGG CCGCGCGGTT CCTGCAGCTG
TGCGACGCCT TCGGCCTGCC GGTGCTGAGC CTGTGCGACA CCCCCGGCTT CATGGTCGGG
CCGGCGAGCG AGGACGCGGG CGCGGTGCGG CGGGTCAGCC GGCAGTTCAT CGCCGGGGCC
AAGCTGCGCA CGCCGCTGCT GACCGTGGTC ACCCGAAAGG GCTATGGCCT GGGCGCCCAG
GCCATGGCGG GGGGCAGCTT CCACAGCCCC CTGTTCATCG CCGCCTGGCC GACCGGCGAG
TTCGGCGGCA TGGGCCTGGA GGGCGCCGTG CGGCTGGGCT ACCGCAAGGA GCTGGAGGCC
GAGACCGATC CCGCCAAGCA GAAGGCGCTC TACGACCAGC TGGTGGCGCG GCTCTACGCG
GCGGGCAAGG CGACCAGCAT GGCGGCGGCC CTGGAGATCG ACGCGGTGAT CGACCCGGCC
GATACGCGGC GCTGGATCGT TGGCGGGCTG GACGCGGCGG CGGGGACGGT GCGGCCGTGG
GAAGTGCGGG TGGATAGCTG GTAA
 
Protein sequence
MPFQRVLIAN RGEIAIRIAR AAAELGIASI AVFATDEADA PHVSAADEAV ALPGTGPRAY 
LDIAAIVAAA KAAGCDALHP GYGFLAENPA LARACAAIGI AFIGPSPAHL ETFGDKALAR
ALADAKAIPL LEGTGALDLA GAQRFLAREG PMMLKAAAGG GGRGMRPVRD AGEVEAAFAA
CAREALAAFG DGTLYAERLM ENARHIEVQI AGDGTHVAAV GDRDCSLQRR RQKLVEIAPA
PNLPDETRAA LFDAAVRLTA GYRTLATVEF LVAPDGDFAF IEVNARLQVE HTVTEEVTGV
DLVRAQFELA DGASLADVGL ATTPHAQGYA IQGRVNLETL SVDGTPRPST GTITAYQPPG
GPGVRVDGAG AAGLVVTGAY DSLLAKVIGR GRTAAEAAGR TARALAEFRI EGVDTTIPLL
RALLAEPATV DGTATTDFID REAGRLFEAA GAPPQSAEPV FTEDGAVTAP LQALVGMIEV
VEGDLVRPGQ AVAVLEAMKM EHLVHAETGG RVIRVAVTAG EAVRPGQPLL YLEPAEVEEG
EALAAEAVDP DALRPDHAEV IARHRFTLDE ARPEAVAKRR KTGHRTAREN IDDLVDPGSF
LEYGALAIAA QKRRRSVEDL IAATPADGLI TGIGSVNGAL FPPDKARVAA LAYDFTVLAG
TQGAMNHRKS DRLLAIVAEQ RLPVVWFAEG GGGRPGDTDT TAVAGLDVPT FRSMAALSGV
VPKIAIVAGR CFAGNAAIAG LSEIIVATRD SNLGMGGPAM IEGGGLGVFR PEQIGPSAHQ
WANGVIDLLA DDEAHATRLA RQALSYFQGA VKDWTCPDQR PLRRAVPENR LRVYDVRALI
AVLADTSSFL ELRGGFAAGM ITGLVRIEGR PLGLIANDPR HLGGAIDGDG AEKAARFLQL
CDAFGLPVLS LCDTPGFMVG PASEDAGAVR RVSRQFIAGA KLRTPLLTVV TRKGYGLGAQ
AMAGGSFHSP LFIAAWPTGE FGGMGLEGAV RLGYRKELEA ETDPAKQKAL YDQLVARLYA
AGKATSMAAA LEIDAVIDPA DTRRWIVGGL DAAAGTVRPW EVRVDSW