Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3613 |
Symbol | |
ID | 5901068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3898451 |
End bp | 3900253 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564124 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001685238 |
Protein GI | 167647575 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.772791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCTG CCGCCGTTCC GCCCCGCGCC CTTCGTTCGC GCGCCTGGTT CGACAATCCG GACAATGCGG ACATGACGGC GCTCTACCTG GAGCGTTACC TGAACTATGG CCTGTCGCTG GAAGAACTGC AGTCAGGCAA GCCGATCATC GGCATCGCCC AGACCGGCAG CGACCTGTCG CCCTGCAACC GCCACCACCT GGTTCTGGCC GAACGCGTGC GCGAAGGCAT CCGCACGGCC GGCGGCATCG CCCTGGAGTT TCCGGTCCAT CCGATCCAGG AGACCGGCAA GCGCCCGACC GCGGGCCTGG ACCGCAACCT CTCCTACCTC GGTCTGGTCG AAATCCTGTA CGGCTATCCG ATCGACGGCG TGGTTCTGAC CATCGGCTGC GACAAGACCA CGCCAGCCTG TCTGATGGCC GCCGCCACCG TCAACATCCC GGCCATCGCC CTGTCGGTCG GACCGATGCT GAACGGCTGG CACAAGGGTG AGCGCACGGG CTCGGGCACC ATTGTCTGGA AGGCTCGCGA AATGCTGGCG GCCGGCGAGA TCGACCGCGC CGGCTTCATC AAGCTGGTGG CCAGTTCCGC CCCCTCGACC GGCTATTGCA ACACCATGGG CACGGCCACG ACCATGAACT CGCTGACCGA GGCCTTGGGC ATGTCGCTGA CGGGCTCGGC GGCGATTCCC GCCCCGTACC GCGACCGGCA ACAGAACGCC TACGAGACCG GCCTGCGGAT CGTCGAGCTG ACCGAGCAGG ACATCAAGCC GTCCGACATC CTGACCCGCG ACGCCTTCCT CAACGCCGTG GTCGTCAATT CGGCGATCGG CGGCTCGACC AACGCCCCGA TCCACCTCAA CGCCCTGGCG CGCCATATCG GCGTCGAGCT CAGCGTCGAC GACTGGCAGG CCTATGGCGA AGAGGTGCCG CTGCTGGTCA ACCTGCAGCC GGCCGGCGAA TATCTGGGCG AGGACTATTA CCGGGCCGGC GGCGTGCCGG CCGTGGTCAA CCAGTTGATG GGCCAGGGTC TGATCCGCGA GGACGCCCTG ACCGTCTCGG GCCAGACCCT GGGCGAGGCC TGCCGGAACG CGGCGATCGA GGACGAGGCG GTGATCCGCC CCTTCGACAA GCCGCTGGTC GAGCGCGCGG GCTTCGTGGT CATGCGCGGC AACCTGTTCA ACTCGGCGAT CATGAAGACC AGCGTGATCA CCGCCGAGTT CCGCGACCGC TATCTGTCGA ACCCCGACGA CCCGGACGCC TTCGAGGGCG AGGCGGTGGT GTTCGACGGA CCCGAGGACT ACCACCGCCG CATCGACGAT CCGGCGGTCG GGATCACCGA GCGCAGCGTG CTGTTCATGC GCGGGGCCGG GCCGATCGGC TATCCTGGCG CGGCCGAGGT CGTGAACATG CGGGCCCCGG ACTACCTGAT CAAGCGCGGG ATCCACCAAC TGCCCTGCAT CGGCGACGGG CGCCAGTCGG GCACCTCGGG CTCGCCCTCG ATCCTCAACG CCTCGCCGGA GGCGGCGGCC GGCGGCGGCC TGGCCCTGCT GAAGTCCGGC GACAAGGTGC GGGTCGATCT GCGCAAGTCG CGGGTCGATG TGCTGGTCAC GCCCGAAGAG GTGGTCGCGC GGCGCGCCGC GCTCGAGGCG GCCGGCGGCT ACGCCTATCC CGAAAGCCAG ACGCCCTGGC AGGAGATCCA GCGCGGCATC ATCGGCCAGA TGGACACCGG CGCGGTGCTG GAGCCGGCGG TCAAGTACCA GCGCATCGCC CAGACCAAGG GCCTGCCGCG GGACAACCAC TGA
|
Protein sequence | MSSAAVPPRA LRSRAWFDNP DNADMTALYL ERYLNYGLSL EELQSGKPII GIAQTGSDLS PCNRHHLVLA ERVREGIRTA GGIALEFPVH PIQETGKRPT AGLDRNLSYL GLVEILYGYP IDGVVLTIGC DKTTPACLMA AATVNIPAIA LSVGPMLNGW HKGERTGSGT IVWKAREMLA AGEIDRAGFI KLVASSAPST GYCNTMGTAT TMNSLTEALG MSLTGSAAIP APYRDRQQNA YETGLRIVEL TEQDIKPSDI LTRDAFLNAV VVNSAIGGST NAPIHLNALA RHIGVELSVD DWQAYGEEVP LLVNLQPAGE YLGEDYYRAG GVPAVVNQLM GQGLIREDAL TVSGQTLGEA CRNAAIEDEA VIRPFDKPLV ERAGFVVMRG NLFNSAIMKT SVITAEFRDR YLSNPDDPDA FEGEAVVFDG PEDYHRRIDD PAVGITERSV LFMRGAGPIG YPGAAEVVNM RAPDYLIKRG IHQLPCIGDG RQSGTSGSPS ILNASPEAAA GGGLALLKSG DKVRVDLRKS RVDVLVTPEE VVARRAALEA AGGYAYPESQ TPWQEIQRGI IGQMDTGAVL EPAVKYQRIA QTKGLPRDNH
|
| |