Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1440 |
Symbol | |
ID | 5898895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1529629 |
End bp | 1531455 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561927 |
Product | phosphogluconate dehydratase |
Protein accession | YP_001683068 |
Protein GI | 167645405 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.683477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA ACCCCGTCGC TCTTCATCCT GTGATCGCCG AAGTCACCGC CCGGATCATC GAGCGCAGCC GCGACAGCCG CGCGACCTAC CTGGCCAATC TCGACGCGGC CGCGGCCGCC CAGCCGGGAC GGGCAAAACT CAGCTGCGCC AACTGGGCCC ACGCCTTCGC CGCCTCGCCG TCGGTCGACA AGGTTCGCGC CCTCGATCCG AACGCCCCCA ACCTGGGCAT CGTCTCGGCC TATAACGACA TGTTGTCGGC CCACCAGCCG CTGGAGGAGT ACCCGGCGCT GATCAAGGCG GCCGCGCGCG AGGTCGGGGC CACGGCGCAA TTCGCCGGCG GCGTGCCGGC CATGTGCGAC GGCGTCACCC AGGGGCGTCC GGGCATGGAG CTGTCGCTGT TCTCGCGCGA CGTGATCGCC ATGGCCACGG GCATCGCCCT GACCCACGAC GCCTTCGACG GCGCGCTCTA CCTGGGCGTC TGCGACAAGA TCGTGCCGGG CCTGCTGATC GGCGCCCTGA CCTTCAGCCA CCTGCCGGCG ATGTTCGTGC CGGCCGGTCC GATGACTTCG GGCCTGCCCA ATTCGGAGAA GGCCCGCATC CGCGCCCTCT ACGCCGAGGG CAAGGTCGGT CGCGAGGAAC TGCTGGCCGC CGAGAGCGCC AGCTATCACG GCCCTGGCAC CTGCACCTTC TATGGCACGG CCAACACCAA CCAGATGCTG ATGGAGCTGA TGGGCCTGCA CCTGCCGGGC TCGGCCTTCG TCCATCCCCA CACCCCGCTG CGCAGCGCCC TGGTCAAGGA AGCCGCCAAA CGCGTCGCCG CCATCACCCA CAAGGGCAAT GAGTGGATTC CGGTCGGCCG GGTGATCGAC GAGAAGGCCG TGGTCAACGG CGTGGTCGGC CTGATGGCCA CCGGCGGCTC GACCAACCTG GCCCTGCACC TGGTCGCCAT GGCCCACGCG GCCGGCATCA TCCTGACCCT TGAAGACCTG GACGACATCT CGAAGAACAC GCCGCTGCTG GCCAAGGTCT ATCCAAACGG CTCGGCCGAC GTGAACCAGT TCCACGCCGC CGGCGGCATC CATTTCGTGG TCAGGGAGCT GTTGAAGGCC GGCTTGGTCC ACGAGGACGT CCTGACCGTC GTCGGCCCCG GCCTGTCGCG CTACACCCAG GAGCCGGTCC TGATCGACGG CGAGCTGGCT TGGCGGGAGG GCGCCGAGCA GTCGCTGGAT CTCAACATCC TGCGCCCGGC CTCAGACCCG TTCAGCCCGG AAGGTGGTCT GCGCCTACTG ACCGGAAATC TGGGTCGCGG GGTGATCAAG GTCTCGGCCG TCAAGCCCGA GCATCAGGTG ATCACCGCCC CGGCGGCCGT GTTCCAGGAA CAGGAAGACT TCATCGCCGC CTTCAAGCGC GGCGAGCTCG ACCGCGATGT CGTGGTGGTG GTGCGCTTCC AAGGCCCGTC GGCCAACGGC ATGCCTGAAC TGCACAACCT GTCGCCCTCG ATCTCGGTGT TGCTGGACCG CGGCTTCAAG GTCGCCCTGG TCACCGACGG CCGGATGTCG GGCGCGTCGG GCAAGACCCC GGCCGCCATC CACCTGACCC CGGAAGCGGC CAAGGGCGGC GCCCTGGCCT ATGTCGAGGA CGGCGACGTC ATCTCTCTGA ACGCCCATAC GGGCGAACTG AAAATCTTGG TGGACGAGGC GACCCTGCGC GCCCGCACGC CGGCAAAAGT CCCGGCGTCC AAGCCGGGCT ATGGACGGGA ACTCTTCGGC CTGCTGCGTT CGGGGGTCGG CGCGTCCGAC CACGGCGCAT CGGTGCTGTT TGCTTAG
|
Protein sequence | MAVNPVALHP VIAEVTARII ERSRDSRATY LANLDAAAAA QPGRAKLSCA NWAHAFAASP SVDKVRALDP NAPNLGIVSA YNDMLSAHQP LEEYPALIKA AAREVGATAQ FAGGVPAMCD GVTQGRPGME LSLFSRDVIA MATGIALTHD AFDGALYLGV CDKIVPGLLI GALTFSHLPA MFVPAGPMTS GLPNSEKARI RALYAEGKVG REELLAAESA SYHGPGTCTF YGTANTNQML MELMGLHLPG SAFVHPHTPL RSALVKEAAK RVAAITHKGN EWIPVGRVID EKAVVNGVVG LMATGGSTNL ALHLVAMAHA AGIILTLEDL DDISKNTPLL AKVYPNGSAD VNQFHAAGGI HFVVRELLKA GLVHEDVLTV VGPGLSRYTQ EPVLIDGELA WREGAEQSLD LNILRPASDP FSPEGGLRLL TGNLGRGVIK VSAVKPEHQV ITAPAAVFQE QEDFIAAFKR GELDRDVVVV VRFQGPSANG MPELHNLSPS ISVLLDRGFK VALVTDGRMS GASGKTPAAI HLTPEAAKGG ALAYVEDGDV ISLNAHTGEL KILVDEATLR ARTPAKVPAS KPGYGRELFG LLRSGVGASD HGASVLFA
|
| |