Gene Caul_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0213 
Symbol 
ID5897487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp227881 
End bp228933 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content70% 
IMG OID641560697 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001681848 
Protein GI167644185 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.209682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGC TTCTGCTCCT GCCCGGCGAC GGGATCGGCC CCGAGGTTTG CGGGGAGGTG 
CGCCGGGTGG CCGCCGCGCT CACGCCCGAC CTGATGATCG CCGAAGCCCT CTATGGCGGC
GCCAGCTACG ATGTGCATGG CGCGCCCCTG ACCGACGACG TCCGCGACCA GGCCCTGGCC
AGCGACGCGG TGCTGATGGG CGCTGTCGGC GGTCCCAAGT GGAAGGACGC CCCCCGCCAC
CTGCGCCCCG AGGCGGGCCT GCTGCAGCTG CGCAAGGACA TGGACGTCTA CGCCAACCTG
CGCCCGGCCT ACTGCTTCGA GGCCCTGGCC GACGCCTCCA GCCTCAAGCG CGAGCTGGTT
TCGGGCCTGG ACATCATGTT CGTCCGTGAA CTGACAGGCG GGGTCTATTT CGGCCAGCCA
CGCGGCATCG AGGATCTCGG CAACGGCCAG AAGCGCGGCG TCGACACCCA GGTCTACACC
ACCGCCGAGA TCGAGCGCGT GGCCCGGGTG GCCTTCGAAC TGGCGCGCGG CCGGTCCAAT
CGCGTCGCCT CGGCCGAGAA GTCGAATGTC ATGGAGTCGG GACTGCTATG GCGGGAGGTC
GTCACCAACC TCCACGCCAA GGAATATGCC GACGTCCAGT TGGAGCACAT CCTGGCCGAC
AACTGCGCCA TGCAGCTGGT CCGCGCGCCC AAGCAGTTCG ACGTGATCGT CACCGACAAC
CTGTTTGGCG ACATCCTGTC GGACGCCGCG GCGATGCTGA CCGGCTCGCT GGGCATGCTG
CCCTCAGCGG CGCTGGGCGC GGCGGGCAAG CCGGGCCTCT ATGAGCCGAT CCACGGCTCG
GCCCCCGACA TCGCGGGCCA AGGCGTGGCC AATCCGCTGG CCGCCATCCT GTCGTTCGAG
ATGGCCCTGC GCTGGTCGCT GAACCGCGCC GACGCGGCCG ACACCCTGTT GGCGGCGGTC
AAGGCGGCGC TGGACGGCGG TGCGCGGACG CGTGACCTGG GCGGGTCGTT GTCGACCGCC
GAGATGGGCG ATGCGGTGCT GAAGAGGCTC TAA
 
Protein sequence
MSTLLLLPGD GIGPEVCGEV RRVAAALTPD LMIAEALYGG ASYDVHGAPL TDDVRDQALA 
SDAVLMGAVG GPKWKDAPRH LRPEAGLLQL RKDMDVYANL RPAYCFEALA DASSLKRELV
SGLDIMFVRE LTGGVYFGQP RGIEDLGNGQ KRGVDTQVYT TAEIERVARV AFELARGRSN
RVASAEKSNV MESGLLWREV VTNLHAKEYA DVQLEHILAD NCAMQLVRAP KQFDVIVTDN
LFGDILSDAA AMLTGSLGML PSAALGAAGK PGLYEPIHGS APDIAGQGVA NPLAAILSFE
MALRWSLNRA DAADTLLAAV KAALDGGART RDLGGSLSTA EMGDAVLKRL