Gene Caul_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1443 
Symbol 
ID5898898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1533967 
End bp1535397 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID641561930 
Productpyruvate kinase 
Protein accessionYP_001683071 
Protein GI167645408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.434005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCG CGCGCCGCTC CCGCATTGTC GCCACCATCG GCCCCGCCAG CAGCTCGCAT 
GAGATGATCG TGAAGCTGGC CAAGGCCGGG GCCGACGTCT TCCGCCTGAA CTTCAGCCAC
GGCAGCCACG ACAACCACGC CGCCGCCTAC GCCGCCATCC GCGCGGCCGA GGCGGTGGTC
GGCCGTCCGC TGGGCATCCT GGCCGACCTC CAGGGGCCGA AGCTGCGTGT GGGCAAGTTC
GCCAACGGCC CGGTGACGCT GAACGCCGGC CAGGCCTTCC GCTTCGACAA CGATCCGACA
CCGGGCGATG AAACCCGCGT GCACCTGCCG CATCCGGAAA TCCTGGTGGC CATGCGCCCC
GGCGCGACCC TGCTGCTAGA CGACGGCAAG CTGCGGATGA CCGTCACGGA CGCCGGCCCT
GGCTACGCCA ACACCAAGGT GGTCAATGGC GGCAAGCTGT CGGAGCGCAA GGGCGTGGCC
GTGCCCGACG TCGTGATCCC GATGTCGCCG CTTACCCCGA AGGACCGCGA GGACCTGGCC
TTCGCCCTGC GCCTGGGCGT GGACTGGATC GCCCTGTCGT TCGTGCAGGC TCCCGAGGAC
ATGGCCGAGC TGCGCCGCAT CGTCGAGGGC CGCGCCGCCG TGCTGGCCAA GATCGAGAAG
CCCCAGGCCC TGGAAGTGCT GGGTCCGATC CTCGACCTCT GCGACGGCGT GATGGTGGCC
CGGGGCGACC TGGGCGTCGA GATGGCCCCG GAAGAGGTGC CGGTGGCCCA GAAGGTCATC
CTGCGCGCCG CTCGCGAGCG CGGCATTCCG GTGATCGTCG CCACCCAGAT GCTGGAGTCC
ATGACCAGTT CGCCGACCCC GACCCGAGCC GAGGCCTCGG ACGTGGCCAA CGCCGTCTAC
GAGGGCGCCG ACGCGGTGAT GCTGTCGGCC GAAAGCGCGG CCGGAGATTA TCCTGAAGAA
TCCGTGGCGA TGATGAGCCG GATCATCGAG CGGGTGGAGC GCGATCCGCG CTGGCCCGAG
CTGATGCAGG CCGAGCAGCC GCACGACGAC GACGACGCCG ACGTTCTGGT GGTCGCCGCC
GCCCAGGCCG CCAAGGCCGG CTCGACCAAG TGCCTGGTAG CCTTCACGAC GACCGGCGCC
ACCGCCCGTC GCCTGGCGCG CGAGCGGCCG CTGCAGCCGG TTCTGGCCCT GTCGCCGCAG
ATCGACGCCG TGCGCCGCAT GTGCCTGGTC TGGGGAGTCG AGGCTCGCGT CAGCGGCCAG
CCCGACAGCC TGGAGGTCGT CACCTCCGAC GCCGTGGCCA AGGCGGTGGA CCTGGGCTTG
GTCGGTCCGG GCGAGCGCGT GCTGATCGTC GCCGGAACGC CGTTCGGCGC CCCCGGCGCG
GCCAACCTGC TGCGCCTGGC CCACGCGCCG TTCCCGACGC GCAAGCGGTA G
 
Protein sequence
MIRARRSRIV ATIGPASSSH EMIVKLAKAG ADVFRLNFSH GSHDNHAAAY AAIRAAEAVV 
GRPLGILADL QGPKLRVGKF ANGPVTLNAG QAFRFDNDPT PGDETRVHLP HPEILVAMRP
GATLLLDDGK LRMTVTDAGP GYANTKVVNG GKLSERKGVA VPDVVIPMSP LTPKDREDLA
FALRLGVDWI ALSFVQAPED MAELRRIVEG RAAVLAKIEK PQALEVLGPI LDLCDGVMVA
RGDLGVEMAP EEVPVAQKVI LRAARERGIP VIVATQMLES MTSSPTPTRA EASDVANAVY
EGADAVMLSA ESAAGDYPEE SVAMMSRIIE RVERDPRWPE LMQAEQPHDD DDADVLVVAA
AQAAKAGSTK CLVAFTTTGA TARRLARERP LQPVLALSPQ IDAVRRMCLV WGVEARVSGQ
PDSLEVVTSD AVAKAVDLGL VGPGERVLIV AGTPFGAPGA ANLLRLAHAP FPTRKR