Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2723 |
Symbol | eutB |
ID | 4028212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3052102 |
End bp | 3053097 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637967931 |
Product | threonine dehydratase |
Protein accession | YP_574769 |
Protein GI | 92114841 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR02991] ectoine utilization protein EutB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGT CCCAGCCCGA TATCAGCCTG GCATCGATCT ACCGGGCGCG TGCGCGCCTG CAGGGGCAGG TGACGCGCAC GCCGCTGGTA CGTTCGCAGG CGCTGTCGCG GCGTTTCGCG GCCGACGTGT TCCTCAAGCT GGAGACCTGC CAGCCCACCG GCGCGTTCAA GCTGCGCGGC GCCACCAACA TGCTCGCGGC CTTGCTCGAA CGGGACGGAC GCGAGGCGTT GGCGTGCGGT GTGACCACGG CCTCGACGGG CAACCATGGG CGCGCCGTGG CGTATGCGGC GCGTCAGCTC GGGCTGCCGG CGACCATCTG CGTATCGCGC CTGGTGCCCG AGAACAAGGT CGAGGCCATC GAGGCGCTGG GCGCCGAGGC CCGCCGCGTC GGAGACAGTC AGGATGATGC CTTCGCCGAG GTCGATCGCC TGGTGGCCTC GGGCATGACG GCGATTCCCC CCTTCGACGA TCCGCTGATC GTGAGCGGGC AGGGCACCAT CGGCCTCGAA CTCATGGAAG ATCAGCCGGC GCTCGACCGG GTCATCGTCG GACTGTCCGG CGGCGGCCTG CTGGGCGGCA TCGGGGCAGC CGTGAAGGCG ATTCGCCCGG CGACGCGCGT GACCGGCATC AGCCTCGCCC GGGGGGCGGC CATGTGGGAG AGCCTGCAGG CCGGGCATCC GGTGAACGTC GAGGAGGTCG CCAGTCTCGG CGACTCGCTG GGCGGCGGTA TCGGCCTGAA CAATCGTTAC ACCCTCGACC TGGTGCGCAG CGTGATGGAC GATCATCATC AGGTGTCGGA AGCCGCCATC GCCCGCGCCA TGGTCGAATT GCTCGCTACC GAGAAAATGC TGGTCGAGGG CGCGGCGGCC GTGGGGCTGG CGGCGCTCGA CGAGCATGCC CTCGACATTC GCGGCCAGCG TGTCGCACTG GTCGTCTCCG GTAACGGCGT CTCGCTCGAG ACCCTCGACC GCGCCCGGGC GCTCGCCGGC CGGTAA
|
Protein sequence | MTESQPDISL ASIYRARARL QGQVTRTPLV RSQALSRRFA ADVFLKLETC QPTGAFKLRG ATNMLAALLE RDGREALACG VTTASTGNHG RAVAYAARQL GLPATICVSR LVPENKVEAI EALGAEARRV GDSQDDAFAE VDRLVASGMT AIPPFDDPLI VSGQGTIGLE LMEDQPALDR VIVGLSGGGL LGGIGAAVKA IRPATRVTGI SLARGAAMWE SLQAGHPVNV EEVASLGDSL GGGIGLNNRY TLDLVRSVMD DHHQVSEAAI ARAMVELLAT EKMLVEGAAA VGLAALDEHA LDIRGQRVAL VVSGNGVSLE TLDRARALAG R
|
| |