Gene Caul_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3049 
Symbol 
ID5900504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3312252 
End bp3313478 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID641563551 
Product5-aminolevulinate synthase 
Protein accessionYP_001684674 
Protein GI167647011 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.192624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACA AAGCCGCGTT CCGTAACACC GTGGATCAGA TCCGCGACGA AGGCCGCTAT 
CGGGTGTTCG CCGACGTGAA GCGCCATCGC GGCGCGTTCC CGCGCGCCAC CTGGACCCGC
CCGGACGGCG GCGAAAGCGA GATCGTGGTC TGGTGCTCCA ACGACTATCT GGGCCAGGGG
CAGAACCCCC TGGTGCTGGA CGCCATGCAC GCGGCGATCG ACCAGCACGG TTCGGGCTCG
GGCGGCACGC GCAACATCTC GGGCACCAAC CACCACCATG TCGAGCTGGA GGCCGAGCTG
GCCGACCTGC ACGGCAAGGA AGCGGCCCTG CTGTTCACCT CGGGCTACGT CTCCAACGAG
GCCAGCCTGT CGGCCCTGCA GAAGATCCTC CCCGGCCTGA TCATCTTCTC CGACGCCCAG
AACCACGCCT CGATGATCGC CGGCATCCGC AACGGCGGCT GCCAGCGCCA TGTGTTCCGC
CACAACGACC TGGCCCATCT TGAAGAGCTG CTGATCGCCG CCCCGGCCGA CGCGCCCAAG
CTGATCGCCT TTGAGAGCGT CTATTCGATG GACGGCGACA TCGCCGACCT GGCCGGCACC
GTGGCCCTGG CCAAGAAATA CGGCGCCATG ACCTATCTCG ACGAGGTCCA TGCCGTGGGC
ATGTACGGTC CGCGCGGCGG CGGCGTCGCC GAGCGCGACC GCCTGATGGA CCAGATCGAC
ATCATCGAAG GCACCCTGGG CAAGGCCTTC GGCGTGATGG GCGGCTACAT CACCGGCGAC
GCCGTGGTGG TCGACGCCAT CCGTCTGATG GCTTCGGGCT TCATCTTCAC GACATCCCTG
CCGCCGGCGT TGACCGCCGG CGCCTTGGCC AGCGTGAAAT ATCTCAAGCA CCACCCGGAA
GTCCGCGAAG CCCATCAGGA GCGCGCCCAG ACCCTGAAGG CGATGTTCAA GGCCGCCGGC
CTGCCGGTGA TGGAGAACGA CAGCCACATC GTGCCGGTGC TGGTCGGCGA TCCCGTCCAC
TGCAAGCTGA TCAGCGACAT GCTGCTGGCC GACCACGGCG TCTATGTGCA GCCGATCAAC
TACCCGACCG TGCCGCGCGG CACCGAGCGC CTGCGCTTCA CCCCGACGCC GTTCCACACC
GACGACATGA TGCGCAAGCT GGTCGGGGCG ATGGAAACCC TGTGGGCGCA CTGCAACGTG
GCCCGCATGG GCGGCTACGC GGCTTAA
 
Protein sequence
MDYKAAFRNT VDQIRDEGRY RVFADVKRHR GAFPRATWTR PDGGESEIVV WCSNDYLGQG 
QNPLVLDAMH AAIDQHGSGS GGTRNISGTN HHHVELEAEL ADLHGKEAAL LFTSGYVSNE
ASLSALQKIL PGLIIFSDAQ NHASMIAGIR NGGCQRHVFR HNDLAHLEEL LIAAPADAPK
LIAFESVYSM DGDIADLAGT VALAKKYGAM TYLDEVHAVG MYGPRGGGVA ERDRLMDQID
IIEGTLGKAF GVMGGYITGD AVVVDAIRLM ASGFIFTTSL PPALTAGALA SVKYLKHHPE
VREAHQERAQ TLKAMFKAAG LPVMENDSHI VPVLVGDPVH CKLISDMLLA DHGVYVQPIN
YPTVPRGTER LRFTPTPFHT DDMMRKLVGA METLWAHCNV ARMGGYAA