Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3542 |
Symbol | |
ID | 5900997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3819516 |
End bp | 3820859 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564049 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001685167 |
Protein GI | 167647504 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.667598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCA AACACGTCCC GACCGACTAT CCGGATGCGG GTGCGGTCGA GCGCGTGGAG CAGACGCTCC GTCAGATGCC GCCGTTGGTG TTCGCTGGGG AAGCCCGCCG GCTGAAGAGC CTGCTGGGCG ACGTGGCCGA AGGACGCGCC TTCCTGCTGC AGGGCGGCGA CTGCGCCGAG AGCTTCAAGG AATTCCACGC CGACAACATC CGCGACACCT TCCGCCTGAT CCTGCAGATG GCGGTGGTGC TGACCTTCGC CGGCGGCAAG CCGGTGGTGA AGGTGGGCCG CATCGCCGGC CAGTTCGCCA AGCCCCGCTC CGAACCGATC GAGACGATCG ATGGCGTGAC CTTGCCGTCC TACCGGGGCG ACAACATCAA CGGCATGGAC TTCACGGCGC AGGAGCGCAT TCCCGATCCG GATCGCCTGC TGCGCGCCTA CGGCCAGTCG GCGGCGACCC TGAACCTGCT GCGCGCCTTC GCCAGCGGGG GTTACGCCGA CCTCTACAAC ATCCATCGCT GGACCCTGGG CTTCGTGGGC GACAGCCCGC AGGGCGCCCG GTACCGCGAG CTCAGCGAAA AGATCAGCGA AGCCCTGACC TTCATGGCGG CGGTCGGCGT CACGCCGGAA ACCCAGCCGG ACCTGAAGCG CGTCGAGTTC TTCACCAGCC ACGAGGCCCT GTTGCTGGGC TTCGAGGAAG CCATGACCCG CGTCGATAGC ACCTCGGGCG ACTGGTACGA CACCAGCGCC CACCTGCTGT GGATCGGCGA GCGCACCCGT CAGCTGGACG GGGCGCACAT CGAATATATG CGCGGCATCA AGAACCCGAT CGGCCTCAAG TGCGGCCCGA CCATGGAGGG CGACGACCTG CTGCGGCTGA TCGACGTGCT CAACCCGGCC AACGAGCCGG GCCGCCTGAC CCTGTACGGC CGCTTCGGCT CGGACAAGAT CGCCGACCGC CTGCCGCGCC TGATGAGGGC CACCAAGGCC GCCGGTCGTT CGGTGGTCTG GGCCACCGAC CCGATGCATG GCAATACGCT GAAGGCCTCG ACCGGCTACA AGACCCGGCC GTTCGACCGC ATCCTCTCGG AGGTGAAGTC GTTCGTCGAG ATCGCCAACG CCGAGGGCGT CCACCCCGGC GGCGTGCACC TGGAAATGAC CGGCCAGAAC GTCACCGAGT GCCTGGGCGG CGCGCGGGCG GTGTCGGAAG GCGATCTCGC CGACCGCTAC CACACCCATT GCGACCCGCG CCTGAACGGC GAGCAGGCTC TGGAACTGGC GTTCCTGGTG GCCGAAAAGC TGAAGTCGGC GCGCGACGAC CAGCGGCGCC TGGCCGCGGG ATAA
|
Protein sequence | MPAKHVPTDY PDAGAVERVE QTLRQMPPLV FAGEARRLKS LLGDVAEGRA FLLQGGDCAE SFKEFHADNI RDTFRLILQM AVVLTFAGGK PVVKVGRIAG QFAKPRSEPI ETIDGVTLPS YRGDNINGMD FTAQERIPDP DRLLRAYGQS AATLNLLRAF ASGGYADLYN IHRWTLGFVG DSPQGARYRE LSEKISEALT FMAAVGVTPE TQPDLKRVEF FTSHEALLLG FEEAMTRVDS TSGDWYDTSA HLLWIGERTR QLDGAHIEYM RGIKNPIGLK CGPTMEGDDL LRLIDVLNPA NEPGRLTLYG RFGSDKIADR LPRLMRATKA AGRSVVWATD PMHGNTLKAS TGYKTRPFDR ILSEVKSFVE IANAEGVHPG GVHLEMTGQN VTECLGGARA VSEGDLADRY HTHCDPRLNG EQALELAFLV AEKLKSARDD QRRLAAG
|
| |