Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0311 |
Symbol | |
ID | 5897585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 349911 |
End bp | 351008 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641560795 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001681946 |
Protein GI | 167644283 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.494385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCCT CGACCGCCGC CAATCTCGCC TCGCTGCCCG TCCCCACCGA CGACCTGCGA ATCCGACAGC TTCAGACCCT CAGTCCGCCG GCCCAGGTGA TCGGCGAGGC GCCGGCCACC TCATCGACGG CCGAGGTCGT CGGCGACGCG CGCCGCGCGG TGCATGAGAT CCTGGAAGGC CGCGACGACC GCCTGGTGGT GGTGATCGGG CCATGCTCGA TCCACGATCC CAAGGCCGCC CTCGACTACG CGCGCCGCCT GGCCGTCGAG CGCGAACGTC ACGCCGGCGA GTTGGAAGTG ATCATGCGGG TGTATTTCGA GAAGCCGCGC ACCACGGTCG GCTGGAAGGG CCTGATCAAC GACCCGGACA TGGACGGCGG CTTCCGGATC AATGAAGGCC TGCGGCTGGC GCGGCGGGTG CTGCTCGACA TCAGCGCCCA GGGCCTGCCG GCGGCCTGCG AGTTCCTGGA CGTCACCACC CCGCAATACA TCGCCGACCT GGTGGCCTGG GGCGCGATCG GAGCCCGCAC CACCGAAAGC CAGATCCACC GCGAGATGGC GTCGGGCCTG TCCTGCCCGG TCGGATTCAA GAACGGCACC AACGGCGACG TCAAGGTCGC GGTCGACGCG GTCCTGGCCG CCGCCCAGCC GCACCATTTC CTGGCCGTGA CCAAGGAAGG GCGCGGCGCC ATCGCCACCA CCACGGGCAA CGCCGACTGC CACGTCGTGC TGCGCGGCGG CAAGACGCCC AACTACGACG CCGCCAGCGT CGCGGCGGTG GCCCAGGCCC TGACCGCCGC CGGCCTGCCG CCACGCGTCA TGGTGGACGC CAGCCACGCC AACAGCGGCA AGGACCACGA GAACCAGCCC GGCGTGGTCG CCGATCTCTG CGCCCAGGTG GCGACCGGCC ATTCGCCGAT CATGGGGGTG ATGATCGAGA GCCATCTCGT CGCCGGCCGG CAGGACATCG TTCCAGGCCG GCCCCTGACC TACGGTCAGT CGGTCACCGA CGCCTGCATC GACTGGGAGA CCTCGGTGCG CCTGCTGGAT CAGCTGGCGG CGGCGGTGCG CGCCGGACGC CGGTCCATCG ACCGATAG
|
Protein sequence | MPPSTAANLA SLPVPTDDLR IRQLQTLSPP AQVIGEAPAT SSTAEVVGDA RRAVHEILEG RDDRLVVVIG PCSIHDPKAA LDYARRLAVE RERHAGELEV IMRVYFEKPR TTVGWKGLIN DPDMDGGFRI NEGLRLARRV LLDISAQGLP AACEFLDVTT PQYIADLVAW GAIGARTTES QIHREMASGL SCPVGFKNGT NGDVKVAVDA VLAAAQPHHF LAVTKEGRGA IATTTGNADC HVVLRGGKTP NYDAASVAAV AQALTAAGLP PRVMVDASHA NSGKDHENQP GVVADLCAQV ATGHSPIMGV MIESHLVAGR QDIVPGRPLT YGQSVTDACI DWETSVRLLD QLAAAVRAGR RSIDR
|
| |