Gene Caul_3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3542 
Symbol 
ID5900997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3819516 
End bp3820859 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID641564049 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001685167 
Protein GI167647504 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.667598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA AACACGTCCC GACCGACTAT CCGGATGCGG GTGCGGTCGA GCGCGTGGAG 
CAGACGCTCC GTCAGATGCC GCCGTTGGTG TTCGCTGGGG AAGCCCGCCG GCTGAAGAGC
CTGCTGGGCG ACGTGGCCGA AGGACGCGCC TTCCTGCTGC AGGGCGGCGA CTGCGCCGAG
AGCTTCAAGG AATTCCACGC CGACAACATC CGCGACACCT TCCGCCTGAT CCTGCAGATG
GCGGTGGTGC TGACCTTCGC CGGCGGCAAG CCGGTGGTGA AGGTGGGCCG CATCGCCGGC
CAGTTCGCCA AGCCCCGCTC CGAACCGATC GAGACGATCG ATGGCGTGAC CTTGCCGTCC
TACCGGGGCG ACAACATCAA CGGCATGGAC TTCACGGCGC AGGAGCGCAT TCCCGATCCG
GATCGCCTGC TGCGCGCCTA CGGCCAGTCG GCGGCGACCC TGAACCTGCT GCGCGCCTTC
GCCAGCGGGG GTTACGCCGA CCTCTACAAC ATCCATCGCT GGACCCTGGG CTTCGTGGGC
GACAGCCCGC AGGGCGCCCG GTACCGCGAG CTCAGCGAAA AGATCAGCGA AGCCCTGACC
TTCATGGCGG CGGTCGGCGT CACGCCGGAA ACCCAGCCGG ACCTGAAGCG CGTCGAGTTC
TTCACCAGCC ACGAGGCCCT GTTGCTGGGC TTCGAGGAAG CCATGACCCG CGTCGATAGC
ACCTCGGGCG ACTGGTACGA CACCAGCGCC CACCTGCTGT GGATCGGCGA GCGCACCCGT
CAGCTGGACG GGGCGCACAT CGAATATATG CGCGGCATCA AGAACCCGAT CGGCCTCAAG
TGCGGCCCGA CCATGGAGGG CGACGACCTG CTGCGGCTGA TCGACGTGCT CAACCCGGCC
AACGAGCCGG GCCGCCTGAC CCTGTACGGC CGCTTCGGCT CGGACAAGAT CGCCGACCGC
CTGCCGCGCC TGATGAGGGC CACCAAGGCC GCCGGTCGTT CGGTGGTCTG GGCCACCGAC
CCGATGCATG GCAATACGCT GAAGGCCTCG ACCGGCTACA AGACCCGGCC GTTCGACCGC
ATCCTCTCGG AGGTGAAGTC GTTCGTCGAG ATCGCCAACG CCGAGGGCGT CCACCCCGGC
GGCGTGCACC TGGAAATGAC CGGCCAGAAC GTCACCGAGT GCCTGGGCGG CGCGCGGGCG
GTGTCGGAAG GCGATCTCGC CGACCGCTAC CACACCCATT GCGACCCGCG CCTGAACGGC
GAGCAGGCTC TGGAACTGGC GTTCCTGGTG GCCGAAAAGC TGAAGTCGGC GCGCGACGAC
CAGCGGCGCC TGGCCGCGGG ATAA
 
Protein sequence
MPAKHVPTDY PDAGAVERVE QTLRQMPPLV FAGEARRLKS LLGDVAEGRA FLLQGGDCAE 
SFKEFHADNI RDTFRLILQM AVVLTFAGGK PVVKVGRIAG QFAKPRSEPI ETIDGVTLPS
YRGDNINGMD FTAQERIPDP DRLLRAYGQS AATLNLLRAF ASGGYADLYN IHRWTLGFVG
DSPQGARYRE LSEKISEALT FMAAVGVTPE TQPDLKRVEF FTSHEALLLG FEEAMTRVDS
TSGDWYDTSA HLLWIGERTR QLDGAHIEYM RGIKNPIGLK CGPTMEGDDL LRLIDVLNPA
NEPGRLTLYG RFGSDKIADR LPRLMRATKA AGRSVVWATD PMHGNTLKAS TGYKTRPFDR
ILSEVKSFVE IANAEGVHPG GVHLEMTGQN VTECLGGARA VSEGDLADRY HTHCDPRLNG
EQALELAFLV AEKLKSARDD QRRLAAG