Gene Caul_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0216 
Symbol 
ID5897490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp229993 
End bp231435 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID641560700 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001681851 
Protein GI167644188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.146559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCG CACCGAAGAC CCTTTACGAC AAGATCTGGG ACGCCCACGT CGTCAGCCAA 
CTGGACGGCG AGGCGATCCT CTATATCGAC CTGCACCTGA TCCACGAGGT GACCACCCCG
CAGGCCTTCG CCGGGCTCCG CGCGGCCGGC CGCAAGGTGC GCCGGCCCGA CCGCACGCTG
GCCGTGGCCG ATCACAATAT CCCAACCGAG GGCCAGGCCC TGGGCGTCGA CGCCGTGGCC
GACGAAGAGG CGCGCCTGCA ACTCCAGACC CTGGCGCGCA ACGTCAAGGA CAACGGCATA
GAGTTCTTCC CGATGGGCGA CATCCGCAAC GGGATCGTCC ACGTGGTCGG TCCCGAGCAG
GGCCGCACCC AGCCGGGCAT GACCATCGTC TGCGGCGACA GCCACACCTC GACCCACGGC
GCCTTTGGAG CCCTGGCCCA CGGCATCGGG ACCTCCGAGG TCGAGCATGT GCTGGCCACC
CAGACCCTGC GCCAGGAGAA GGCCCGGAAC ATGCTGGTGC GCGTCGATGG CCAGCTGGGT
CCCGGCGTCA CGGCGAAGGA TGTTGCGCTG GCCGTGATCG GCGAGATCGG CACCGCCGGC
GGCACCGGCT ACGTCATCGA GTTCGCCGGC GACGTGGTCC ACGACCTGTC GATGGAAGGC
CGCATGACCC TGTGCAACCT GACCATCGAG GGCGGCGCCA AGGCCGGCCT GGTCGCGCCG
GACGACAAGA CCTTCGCCTA TATCCAGGGC AAGCCTTCGG CGCCGAAAGG CGCGGCCTGG
GACATGGCCC TGTCGTACTG GAAGAGCTTC GTCAGCGACG AGGACGCCCA TTTCGACCGC
ACGGTGGTCA TCGACGGCTC GGCCCTGGTC CCGATGGTCA CCTGGGGCAC CAGCCCCGAG
GACGTCATCC CGGTGACCGG CAATGTTCCA GATCCGGAAA GTTTCGCCAC GCCCGACAAG
CGCGCCGCCG CCCACCGGGC GCTGGACTAT ATGGGCCTGA CCGCCGGCCA GCCGATCTCG
GAAGCCCGCA TCGACCGCGT CTTCATCGGC TCGTGCACCA ACAGCCGGAT CGAAGACATG
CGCGCCGCCG CCGCCGTAGT GCAGGAAGCC TTCCTGCACG GCCGCCTGGT GGCCCCGCAC
GTCAAGGCGA TGGTCGTGCC GGGCTCGGGC CTGGTGAAGG AACAGGCCGA AGAAGAGGGG
CTGGACGCCA TCTTCAAGGC TGCGGGCTTC GACTGGCGCG AGCCGGGCTG CTCGATGTGC
CTGGCCATGA ACCCCGACAA GCTGCAGCCG CACGAACGCT GCGCCTCGAC CAGCAACCGC
AACTTCGAAG GCCGCCAGGG TCGCGCCGGC CGCACCCACC TGGTCTCGCC GGCCATGGCC
GCGGCGGCGG CGATCGCGGG CCATTTGGTC GATGTGCGCG CCCTGCTCGA GGAGACCATC
TGA
 
Protein sequence
MTRAPKTLYD KIWDAHVVSQ LDGEAILYID LHLIHEVTTP QAFAGLRAAG RKVRRPDRTL 
AVADHNIPTE GQALGVDAVA DEEARLQLQT LARNVKDNGI EFFPMGDIRN GIVHVVGPEQ
GRTQPGMTIV CGDSHTSTHG AFGALAHGIG TSEVEHVLAT QTLRQEKARN MLVRVDGQLG
PGVTAKDVAL AVIGEIGTAG GTGYVIEFAG DVVHDLSMEG RMTLCNLTIE GGAKAGLVAP
DDKTFAYIQG KPSAPKGAAW DMALSYWKSF VSDEDAHFDR TVVIDGSALV PMVTWGTSPE
DVIPVTGNVP DPESFATPDK RAAAHRALDY MGLTAGQPIS EARIDRVFIG SCTNSRIEDM
RAAAAVVQEA FLHGRLVAPH VKAMVVPGSG LVKEQAEEEG LDAIFKAAGF DWREPGCSMC
LAMNPDKLQP HERCASTSNR NFEGRQGRAG RTHLVSPAMA AAAAIAGHLV DVRALLEETI