Gene Caul_2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2723 
Symbol 
ID5900178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2957998 
End bp2959788 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content71% 
IMG OID641563215 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_001684348 
Protein GI167646685 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.974431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCCTT CTCGTTCTCC GATCCGTCCC GAGGCCGAGC GCAACGTCGC CGCCTATGCG 
GTCGATCTGG CCGCGGCGCT GGGCGCCCGC GCGGCCTTCA CCCTGACGGG CGGGATGGCG
ATGTACCTGA ACCGGGCGGT CTCGACCCAT CCCGGCCTGA CGGCGGTCTA CAACCAGCAC
GAACAGGCCT GCGCCGCCGG CGCAGAGGGC TACGCCAAGG CCGCTGACTT CCGGGTCGCG
GGCCTGGCGG TGGTCACGGC CGGCCCGGGG GTCACCAACA CCATCACCAG CCTGTGCTCG
GCCTATGGCG ACAGCGCGCC GGTCATCGTG CTGGCCGGCC AGGTCAAGAC CGCCGACATT
GATCCCTTCG GCACGCGCAC CCATGGGGCG CAGGAGGTGC GGTCGCGCGA ACTGGTCAGC
CCCTGCGTCA AGCGCTTCGT CCGCCTCAGC GCCGATGGTT TTCGGGACGA ACTGGTCGAG
ACCTTCGCCG AGGCCTTCAC CGGCCGGCCG GGTCCGGTGT TCGTGGAGAT CCCGCTGGAC
GTCCAGAACC TGGTCATCGG CTACGACCCC GAGGATGTCG CCCAGGCCGT CGAAGCGATC
CGGGCTCGCA TCGCCGCCGA TGTCGATGGT TCGGACGCGG CGTCGCTGGC CGAGGCCCTG
GCCTGGTTGC TCGCGGGCCA GCGGCCGTTG GTCTATGTCG GCAACGGCTG CCGCGTGGCC
GGAGCGGGCG GCGCGGTGCG GGCCTTCCTG GAAGCGCACG GCGTGCCGGC GGTGTTCTCC
TGGCTGTCGC TCGACCAGCT TGCGGCCGAT CATCCGCTGA ACTTCGGCGC GCCGGGCGGG
CTGGCCCCGT TGTCGGCCAA CCAGGTGCTG TACCAGGCCG ACCGAGTGCT TTTCCTGGGC
GCGCGGCTGG ACCTGGGCAC GACGGCCTTC CAGCGCCACG ATTTCGGCGC CCAGGCCGAG
CGCGTGGTCG TCGACGTCGA TCCCGCCGAG TTGGCCAAGT TCGCGGGCCT GCCGAACACG
CGCGCCGTGC GCGCCGACCT GGCCGCCCTG TCGCCCGCCG TCCTGGCGAC GCAGGACGCG
CCAAGCGCCA TGGCCGCCGA CTGGCTCGCC TGGTGCCAGG CGCGACGCGC CGAGTACCTG
GCCGACGAGC GCCGGCGACT GACGGTGGAC AGCCTCAACG TCTTCGGCGT CGCCCAGCGC
CTGTCGGCCT GGTCGGCGGG CAAGGTGTTC GTGCCGACCG GTTCGGGCTC GGCCATCGAG
GCCTTCATCC GGTTCTTCGC CCCGGGCGAG GACAGCCGCT GCTTCTTCGG CGCCTCGTTG
GGCGCCATGG GCCTGGGCCT GCCGGCGGCG GTGGGCGCGG CCTTCGCCAC CGATCGCCGG
GTAATCTGCG TCGAGGCCGA CGGCGGCTTG ATGCTCAATA TCCAGGAGCT GGCGACCCTG
GCGCACTACG CGCCCAAGGG CTTCGTCCTG TTCGTGCTCA ATAACGACGG CTACACCTCG
ATCCACGCTT CGCAGAGCCG GCATTTCGGC GCGGTGGGCG GTGCGGGGCC AGACTCCGGC
GTGTTCATTC CCGACTACGG CAAGGTCGCT CCGGCCTTTG GCCTGCGCTA CGTTCGCATC
GACAGCCTCG CCGCGCTGGA CGCCTTGCTG CCTACCCTCG ACGCGGACGC CGCGCCGGTG
TTCGTGGATC TGATCATCGA CCGCGCCGAG TCGCGGGGTC CGACCGTCAA GACGGTGATC
TCGCCCGACG GCAAGCTGTC CTCGACGCCG CTGTCCGAGA TTCAGTGGTA G
 
Protein sequence
MSPSRSPIRP EAERNVAAYA VDLAAALGAR AAFTLTGGMA MYLNRAVSTH PGLTAVYNQH 
EQACAAGAEG YAKAADFRVA GLAVVTAGPG VTNTITSLCS AYGDSAPVIV LAGQVKTADI
DPFGTRTHGA QEVRSRELVS PCVKRFVRLS ADGFRDELVE TFAEAFTGRP GPVFVEIPLD
VQNLVIGYDP EDVAQAVEAI RARIAADVDG SDAASLAEAL AWLLAGQRPL VYVGNGCRVA
GAGGAVRAFL EAHGVPAVFS WLSLDQLAAD HPLNFGAPGG LAPLSANQVL YQADRVLFLG
ARLDLGTTAF QRHDFGAQAE RVVVDVDPAE LAKFAGLPNT RAVRADLAAL SPAVLATQDA
PSAMAADWLA WCQARRAEYL ADERRRLTVD SLNVFGVAQR LSAWSAGKVF VPTGSGSAIE
AFIRFFAPGE DSRCFFGASL GAMGLGLPAA VGAAFATDRR VICVEADGGL MLNIQELATL
AHYAPKGFVL FVLNNDGYTS IHASQSRHFG AVGGAGPDSG VFIPDYGKVA PAFGLRYVRI
DSLAALDALL PTLDADAAPV FVDLIIDRAE SRGPTVKTVI SPDGKLSSTP LSEIQW