Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4464 |
Symbol | |
ID | 5901925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4836467 |
End bp | 4839670 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641564983 |
Product | pyruvate carboxylase., methylmalonyl-CoA carboxytransferase |
Protein accession | YP_001686082 |
Protein GI | 167648419 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit [COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0495259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTCC AGCGCGTCCT GATCGCCAAT CGCGGCGAGA TCGCCATCCG CATCGCTCGC GCGGCGGCCG AGCTGGGGAT CGCCAGCATT GCGGTGTTCG CCACCGACGA GGCCGACGCG CCGCACGTGT CGGCGGCGGA CGAAGCGGTC GCCCTGCCTG GGACCGGCCC CCGAGCCTAT CTCGACATCG CCGCCATCGT GGCCGCCGCC AAGGCCGCCG GCTGCGACGC CCTGCATCCC GGCTACGGCT TCCTGGCGGA GAACCCGGCC CTGGCCCGGG CGTGCGCGGC GATCGGGATC GCCTTCATCG GCCCCTCTCC CGCGCACCTG GAAACCTTCG GCGACAAGGC CTTGGCCCGC GCCCTGGCCG ACGCCAAGGC CATCCCCCTG CTGGAAGGGA CCGGCGCCTT GGACCTGGCC GGCGCCCAGA GGTTCCTGGC GCGCGAGGGG CCGATGATGC TAAAGGCGGC GGCCGGCGGC GGCGGGCGCG GCATGCGGCC GGTGCGCGAC GCCGGCGAGG TCGAGGCCGC CTTCGCCGCC TGCGCCCGCG AGGCCCTGGC GGCGTTCGGC GACGGCACGC TCTATGCCGA GCGGCTGATG GAAAACGCCC GCCATATCGA GGTGCAGATC GCGGGCGACG GGACGCACGT GGCGGCGGTG GGCGACCGCG ACTGCTCGCT GCAGCGCCGT CGGCAGAAGC TGGTCGAAAT CGCCCCCGCC CCGAACCTGC CCGATGAGAC CCGCGCGGCC CTGTTCGACG CCGCCGTCCG CCTGACCGCC GGCTATCGCA CCCTGGCGAC AGTGGAGTTC CTGGTCGCTC CTGACGGCGA TTTCGCGTTC ATCGAGGTCA ACGCCCGGCT CCAAGTCGAG CATACGGTGA CCGAGGAGGT GACGGGCGTC GACCTGGTCC GCGCCCAGTT CGAGCTGGCC GACGGGGCCA GCCTGGCGGA CGTCGGGCTG GCGACCACGC CGCACGCCCA GGGCTACGCG ATCCAGGGCC GGGTCAATCT GGAGACCCTA AGCGTCGACG GGACGCCGCG ACCCTCGACC GGCACGATCA CCGCCTACCA GCCGCCCGGC GGTCCTGGCG TTCGGGTCGA CGGCGCGGGC GCGGCGGGAC TGGTCGTCAC CGGCGCCTAT GACAGCCTGC TGGCCAAGGT CATCGGGCGC GGCCGGACGG CGGCCGAGGC GGCGGGACGC ACGGCCCGGG CCCTGGCCGA GTTCCGCATC GAGGGCGTCG ACACCACGAT CCCGCTGCTA CGCGCGCTGC TGGCCGAACC CGCGACCGTG GACGGAACCG CGACCACCGA CTTCATCGAC CGCGAGGCCG GCCGGCTGTT CGAGGCCGCC GGCGCCCCGC CCCAAAGCGC CGAACCCGTC TTCACCGAGG ACGGCGCGGT GACGGCTCCG CTCCAGGCCC TGGTCGGGAT GATCGAGGTG GTCGAGGGCG ACTTGGTGCG GCCGGGCCAG GCGGTGGCGG TGCTCGAGGC CATGAAGATG GAGCACCTGG TCCATGCCGA GACCGGCGGC CGAGTGATCC GGGTCGCCGT CACCGCCGGC GAGGCCGTGC GGCCCGGCCA GCCCCTGCTC TATCTGGAAC CGGCCGAGGT CGAGGAGGGC GAGGCCCTGG CGGCCGAGGC GGTCGATCCC GACGCCCTTC GCCCCGACCA CGCCGAGGTG ATCGCCCGCC ACCGATTCAC GCTCGACGAG GCCCGGCCCG AGGCGGTCGC CAAGCGCCGC AAGACCGGCC ACCGCACGGC CCGGGAAAAC ATCGACGACC TGGTCGATCC CGGCAGCTTC CTCGAATACG GCGCCCTGGC CATCGCCGCC CAGAAGCGCC GGCGCTCGGT CGAGGACCTG ATCGCCGCCA CGCCCGCCGA CGGCCTGATC ACCGGCATCG GCAGCGTCAA CGGCGCCCTC TTCCCACCCG ACAAGGCCCG GGTCGCGGCC CTGGCCTATG ACTTCACCGT GCTGGCCGGC ACGCAAGGGG CGATGAACCA CCGCAAGTCC GACCGGCTGC TGGCCATCGT CGCCGAGCAA CGCCTGCCAG TCGTCTGGTT CGCCGAGGGC GGCGGCGGGC GGCCGGGCGA CACCGACACC ACGGCGGTGG CGGGGCTGGA CGTGCCGACC TTCCGCAGCA TGGCGGCCCT GTCCGGCGTC GTGCCCAAGA TCGCCATCGT CGCCGGCCGC TGTTTCGCCG GCAACGCCGC CATCGCCGGC CTGTCGGAGA TCATCGTCGC CACCCGGGAC TCCAACCTCG GCATGGGCGG GCCGGCGATG ATCGAGGGCG GCGGCCTGGG CGTCTTCCGG CCCGAACAGA TCGGCCCCTC CGCCCACCAG TGGGCCAACG GCGTGATCGA CCTGCTGGCC GACGACGAGG CCCACGCCAC GCGGCTGGCC AGGCAGGCGC TGTCCTATTT CCAGGGCGCG GTGAAGGACT GGACCTGTCC GGACCAGCGC CCCTTGCGTC GCGCGGTGCC GGAGAACCGG CTGCGGGTCT ATGACGTGCG GGCGCTGATC GCGGTTCTGG CGGACACCAG CTCCTTCCTG GAACTGCGCG GCGGCTTCGC GGCGGGGATG ATCACCGGCC TGGTGCGGAT CGAGGGCCGG CCGCTGGGGC TGATCGCCAA CGACCCGCGC CACCTGGGCG GGGCGATCGA CGGCGACGGG GCCGAGAAGG CCGCGCGGTT CCTGCAGCTG TGCGACGCCT TCGGCCTGCC GGTGCTGAGC CTGTGCGACA CCCCCGGCTT CATGGTCGGG CCGGCGAGCG AGGACGCGGG CGCGGTGCGG CGGGTCAGCC GGCAGTTCAT CGCCGGGGCC AAGCTGCGCA CGCCGCTGCT GACCGTGGTC ACCCGAAAGG GCTATGGCCT GGGCGCCCAG GCCATGGCGG GGGGCAGCTT CCACAGCCCC CTGTTCATCG CCGCCTGGCC GACCGGCGAG TTCGGCGGCA TGGGCCTGGA GGGCGCCGTG CGGCTGGGCT ACCGCAAGGA GCTGGAGGCC GAGACCGATC CCGCCAAGCA GAAGGCGCTC TACGACCAGC TGGTGGCGCG GCTCTACGCG GCGGGCAAGG CGACCAGCAT GGCGGCGGCC CTGGAGATCG ACGCGGTGAT CGACCCGGCC GATACGCGGC GCTGGATCGT TGGCGGGCTG GACGCGGCGG CGGGGACGGT GCGGCCGTGG GAAGTGCGGG TGGATAGCTG GTAA
|
Protein sequence | MPFQRVLIAN RGEIAIRIAR AAAELGIASI AVFATDEADA PHVSAADEAV ALPGTGPRAY LDIAAIVAAA KAAGCDALHP GYGFLAENPA LARACAAIGI AFIGPSPAHL ETFGDKALAR ALADAKAIPL LEGTGALDLA GAQRFLAREG PMMLKAAAGG GGRGMRPVRD AGEVEAAFAA CAREALAAFG DGTLYAERLM ENARHIEVQI AGDGTHVAAV GDRDCSLQRR RQKLVEIAPA PNLPDETRAA LFDAAVRLTA GYRTLATVEF LVAPDGDFAF IEVNARLQVE HTVTEEVTGV DLVRAQFELA DGASLADVGL ATTPHAQGYA IQGRVNLETL SVDGTPRPST GTITAYQPPG GPGVRVDGAG AAGLVVTGAY DSLLAKVIGR GRTAAEAAGR TARALAEFRI EGVDTTIPLL RALLAEPATV DGTATTDFID REAGRLFEAA GAPPQSAEPV FTEDGAVTAP LQALVGMIEV VEGDLVRPGQ AVAVLEAMKM EHLVHAETGG RVIRVAVTAG EAVRPGQPLL YLEPAEVEEG EALAAEAVDP DALRPDHAEV IARHRFTLDE ARPEAVAKRR KTGHRTAREN IDDLVDPGSF LEYGALAIAA QKRRRSVEDL IAATPADGLI TGIGSVNGAL FPPDKARVAA LAYDFTVLAG TQGAMNHRKS DRLLAIVAEQ RLPVVWFAEG GGGRPGDTDT TAVAGLDVPT FRSMAALSGV VPKIAIVAGR CFAGNAAIAG LSEIIVATRD SNLGMGGPAM IEGGGLGVFR PEQIGPSAHQ WANGVIDLLA DDEAHATRLA RQALSYFQGA VKDWTCPDQR PLRRAVPENR LRVYDVRALI AVLADTSSFL ELRGGFAAGM ITGLVRIEGR PLGLIANDPR HLGGAIDGDG AEKAARFLQL CDAFGLPVLS LCDTPGFMVG PASEDAGAVR RVSRQFIAGA KLRTPLLTVV TRKGYGLGAQ AMAGGSFHSP LFIAAWPTGE FGGMGLEGAV RLGYRKELEA ETDPAKQKAL YDQLVARLYA AGKATSMAAA LEIDAVIDPA DTRRWIVGGL DAAAGTVRPW EVRVDSW
|
| |