Gene Caul_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3381 
Symbol 
ID5900836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3653985 
End bp3655004 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content65% 
IMG OID641563887 
Productketol-acid reductoisomerase 
Protein accessionYP_001685006 
Protein GI167647343 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.392134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCT ATTATGATCG CGACGCCGAC CTCGCTCGTA TCCTGGACAA GAAGATCGCG 
ATCGTAGGCT ATGGTTCGCA AGGTCACGCG CACGCCCTCA ACCTTCGGGA TTCGGGCGCG
ACCAATGTCG CCGTCGCCCT GCGCGCCGGC TCGCCGACCG CCAAGAAGGC GCAGGGCGAG
GGCCTGAAGG TCATGACCGT GGCCGAAGCC GCCGCCTGGG CCGACCTGCT GATGATCCTG
GCGCCCGACG AGCATCAGGC CGCGATCTAC AAGAACGACA TCGCGCCCAA CATCCGCGAC
GGCGCGGCCC TGCTGTTCGC CCACGGCCTG AACGTCCACT TCGGCCTGAT CGAGCCCAAG
GACACCATCG ACGTGCTGAT GGTCGCCCCC AAGGGCCCCG GCCACACCGT GCGCGGCGAG
TATCAAAAGG GCGGCGGCGT GCCCTGCCTG ATCGCGGTGC ACCACAACGC CACCGGCAAC
GCCCTGGACC TCGGCCTGGC CTATGCCAGC GCCATCGGCG GCGGCCGTTC GGGCATCATC
GAGACCAACT TCCGCGAGGA ATGCGAAACC GACCTGTTCG GCGAGCAGGC CGTCCTCTGC
GGCGGCACGG TCGACCTGAT CCGCTGCGGC TTCGAAGTGC TGGTGGAAGC CGGCTACGCG
CCGGAAATGG CCTATTTTGA GTGCCTGCAC GAACTGAAGC TGATCGTCGA CCTGATCTAT
GAAGGCGGGA TCGCCAACAT GAACTACTCG ATCAGCAACA CGGCCGAATA CGGCGAATAC
GTCACCGGTC CGCGCATCGT CACCGCCGAG ACAAAGGCCG AGATGAAGCG CGTGCTGGAA
GACATCCAGT CGGGCAAGTT CGTCCGCGAC TTCATGCTGG AAAACGCCGT CGGCCAGCCC
TCGTTCAAGG CCACCCGCCG TCGCGCGAGC GAACACCAGA TCGAGGAAGT CGGCGCCCGC
TTGCGCGGCA TGATGCCCTG GATCGCCAAG AACAAGCTGG TGGACGTGAC CAAGAACTAG
 
Protein sequence
MRVYYDRDAD LARILDKKIA IVGYGSQGHA HALNLRDSGA TNVAVALRAG SPTAKKAQGE 
GLKVMTVAEA AAWADLLMIL APDEHQAAIY KNDIAPNIRD GAALLFAHGL NVHFGLIEPK
DTIDVLMVAP KGPGHTVRGE YQKGGGVPCL IAVHHNATGN ALDLGLAYAS AIGGGRSGII
ETNFREECET DLFGEQAVLC GGTVDLIRCG FEVLVEAGYA PEMAYFECLH ELKLIVDLIY
EGGIANMNYS ISNTAEYGEY VTGPRIVTAE TKAEMKRVLE DIQSGKFVRD FMLENAVGQP
SFKATRRRAS EHQIEEVGAR LRGMMPWIAK NKLVDVTKN