Gene Caul_3314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3314 
Symbol 
ID5900769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3595657 
End bp3597573 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content70% 
IMG OID641563820 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_001684939 
Protein GI167647276 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.855639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCT TGACCCCTCT CCTCGACACC ATCTCCTCGC CGGCCGACAC GCGCGGTTTC 
TCCGTGGCCG AACTGAAGCA ACTGGCGAGC GAGGTGCGGG CCGAGACCAT CGACGCCGTA
TCCGTGACCG GCGGTCACCT GGGCGCGGGC CTCGGCGTGG TCGAGCTGAC CGTCGCCCTG
CACCACGTGT TCGAGACGCC CAAGGACATC GTCATCTGGG ACGTTGGCCA CCAAGCCTAT
CCGCACAAGA TCCTCACCGG CCGCCGCGAC CGCATCAAGA CCCTGCGCCA GGGCGGCGGG
CTGTCGGGCT TCACCAAGCG GGCCGAGAGC GTCTACGACC CGTTCGGCGC GGCCCACGCG
GCCACCTCGA TCTCGGCGGC GCTCGGCTTC TGCGCCGCTC GCGACGCCAA GGGCGAGAAC
AACAGCGTCA TCGCGGTGAT CGGCGACGGT TCGATGAGCG CCGGCATGGC CTATGAGGCG
ATGAACGCGG CGGTCGACAC CACCAAGCGC CTGATCGTCA TCCTCAACGA CAACGACATG
TCGATCGCCC CGCCGGTGGG CGGCATGAGC GCCTATCTGG CCAATCTCGT CTCGGGCGGC
GCCTATCGCA AGGTGCGCAA GCTGGGCAAG ACCGTGGTCG AGAAGCTGCC GACCCCGATC
CGCGACGCGG CCCGCAAGGC CGAGGAATAT GCCCGGGGCA TGGTCACCGG CGGCACCTTC
TTCGAGGAGC TGGGCTTCCA TTATGTCGGC CCGATCGACG GCCACGACAT GGACGCCCTG
GTCAGCGTGC TCAACAACGT CAAGCACTTC GACGAGAAGC CGGTCCTGGT CCACGTCGTC
ACCCAGAAGG GCAAGGGCTA CGCCCCGGCC GAGGGCGCGG CCGACAAGCT GCACGCGGTG
GTCAAGTTCG ACGTGGTCAC CGGCCAGCAG CACAAGGCGG CCGCCGGTCC GCCCAGCTAC
ACCAAGGTCT TCGCCCAGGA GCTGATCAAA CACGCGGCGG TCGACGACAA GATCATGGCC
ATCACCGCGG CCATGCCCTC GGGCACGGGC CTGGATTTGT TCGGCAAGGC CTATCCCGAG
CGCACCTTCG ACGTCGGCAT CGCCGAGCAG CACGCGGTGA CCTTCGCCGC CGGCCTGGCC
GCCGACGGCA TGAAGCCGTT CGTGGCGATC TATTCGACCT TCCTGCAGCG CGGCTATGAC
CAGGTCGTCC ACGACGTGGC GATCCAGCGC CTGCCGGTGC GCTTCGCCAT GGACCGCGCC
GGCCTGGTCG GCGCCGACGG CCCGACCCAT GCCGGCACCT TCGACCTCGG CTTCATGGGG
GCGCTGCCCG GCATGGTGCT GATGGCCGCC GCCGACGAGG TCGAGCTGGC CCACATGGTC
TCGACCGCCG TGGCCATCGA CGACCGCCCC AGCGCCTTCC GCTATCCCCG CGGCGAGGGC
CTGGGCCTGA CCATCCCCGA CCTGGCCGCC CCGCTGGAGA TCGGCAAGGG CCGCATCGTT
CGCGAAGGAA CCAGCGTCGC CATCGTCTCG CTGGGCACGC GCCTGGCCGA ATGCCTGAAG
GCCGCCGACC TGCTGGCCGC GCGCGGTCTC TCCGCCACCG TCGCCGACGC CCGCTTCGCC
AAGCCGCTGG ATGTCGACAT GCTGCTGCGC CTGGCCCGCG AGCACGAGGC GATCATCACG
GTGGAAGAGG GCGCCATGGG CGGCTTCGGA GCCTTCGTGC TGCAGGCCCT GGCCACGCAC
GGGGCTCTCG ATCGCGGCCT GAAGATCCGC ACCCTGGTGC TGCCCGACGT CTTCCAGGAC
CAGGACAAGC CCGACCTGAT GTACGCCCAG GCCGGCCTCG ACGCCGAGGG CATCCTGCGC
GGGGCGCTGT CGGCGCTCGG CATCGACAAC ATCAGCGCGG CGGGGCGGCG GGCCTAG
 
Protein sequence
MLALTPLLDT ISSPADTRGF SVAELKQLAS EVRAETIDAV SVTGGHLGAG LGVVELTVAL 
HHVFETPKDI VIWDVGHQAY PHKILTGRRD RIKTLRQGGG LSGFTKRAES VYDPFGAAHA
ATSISAALGF CAARDAKGEN NSVIAVIGDG SMSAGMAYEA MNAAVDTTKR LIVILNDNDM
SIAPPVGGMS AYLANLVSGG AYRKVRKLGK TVVEKLPTPI RDAARKAEEY ARGMVTGGTF
FEELGFHYVG PIDGHDMDAL VSVLNNVKHF DEKPVLVHVV TQKGKGYAPA EGAADKLHAV
VKFDVVTGQQ HKAAAGPPSY TKVFAQELIK HAAVDDKIMA ITAAMPSGTG LDLFGKAYPE
RTFDVGIAEQ HAVTFAAGLA ADGMKPFVAI YSTFLQRGYD QVVHDVAIQR LPVRFAMDRA
GLVGADGPTH AGTFDLGFMG ALPGMVLMAA ADEVELAHMV STAVAIDDRP SAFRYPRGEG
LGLTIPDLAA PLEIGKGRIV REGTSVAIVS LGTRLAECLK AADLLAARGL SATVADARFA
KPLDVDMLLR LAREHEAIIT VEEGAMGGFG AFVLQALATH GALDRGLKIR TLVLPDVFQD
QDKPDLMYAQ AGLDAEGILR GALSALGIDN ISAAGRRA