Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0540 |
Symbol | |
ID | 4027679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 598395 |
End bp | 599618 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637965708 |
Product | gamma-butyrobetaine,2-oxoglutarate dioxygenase |
Protein accession | YP_572601 |
Protein GI | 92112673 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTGGC CCGCGAGCCA AGCGGTTACG AATCCATGCC AGCAGTGGAG AGCACCGTTC ATGTTCATTT CCGCCGACGT CGAACTCAAG GACGAGGGGC GCCGCCTGAT CCTGCATGCC GCCGGTCAGC GTCGCGAATT CGCCGCCCTC TGGCTGCGCG AGCGGGCGCC GGACGACACC ACGCTCGACA CCCGCACCGG ACAGCGCCTG ATCGAGGCCG CGCAACTGCC CCTGACGTTG TGCGCCGAGA CCGCAAGCTG CGAGGCAGAC TCCCTGCACG TACGCTTCAG CGACGGTCAC GCCACGGCCT ATGCCCTGAA CGACCTGCTC CTCGACACCG ACGCCGATCA CGCCGAGGTC GAGCCCGGCC TGCGTCTGTG GGACGCCGGT CTCGACGCGC TGCCCCAGGC GACCTTCGCT TCGGCGCTCG AGGACGACGG TGCCCTGCTG GCCATGCTCG AGGACCTGCA CCGCTACGGC TTCGTCAAGG TCAGCGGCGT GCCCTGCGAG GCAGACGGCA TGCAGCCGTT GATCGACCGT ATCGGCCCGT TGCGCCGCAC CAACTGGGGC GGCATCGCCG ACGTCAAGTC GGTGGCCAAC GCGTTTGACC TCACCATGAC GCAACGAGGC CTCGAGCCGC ATACCGACAA CCCCTATCGC GATCCGATCC CCGGCTATAT CTGGCTGCAC TGCCTGAGCA ACGCCGCCGA CGGGGGCGAC AGCACGCTGA CCGATGGTTT CATGGCGGCA CAGCGTCTCA AGGCCGAGGC GCCCGAGGAT TTCGCATGCC TGACGCGTCT CTCGCCACGC TTCCGCTACA CCGACGCCAC CACCGACCTG GAAAGCGAGG GACCGCTGAT CGAACTCGAC AGCCGAGGAC GTCTGGCGCG CGTGCGCTAC TCCAATCGCA CCGAGCGCAT CGCGGCCCAC GACGCGGCGC TGCTCGAGCG TTACTACGCC GCGCGTCAGC GGTTCTATCG CCTGATCACC GACGAGGCAT TGACCGTGCA TCTCAAGCTC GGGCCGGGCG ACATGCTGAT CATGGACAAC TATCGGCTGC TGCACGGCCG CACCGCGTAC CAGCTCGAAG GGGGCGTGCG TCACCTGCGC CAGGGCTATG TGGATCGCGA CAGTACCGCC AGCCGGCGCC GCGTGCTCGG CGCCCAGCTC GCCGGAAACG CGCGGCCTGG CGCATCGCAT ACCGCTCAAG GAGTCAACCC ATGA
|
Protein sequence | MSWPASQAVT NPCQQWRAPF MFISADVELK DEGRRLILHA AGQRREFAAL WLRERAPDDT TLDTRTGQRL IEAAQLPLTL CAETASCEAD SLHVRFSDGH ATAYALNDLL LDTDADHAEV EPGLRLWDAG LDALPQATFA SALEDDGALL AMLEDLHRYG FVKVSGVPCE ADGMQPLIDR IGPLRRTNWG GIADVKSVAN AFDLTMTQRG LEPHTDNPYR DPIPGYIWLH CLSNAADGGD STLTDGFMAA QRLKAEAPED FACLTRLSPR FRYTDATTDL ESEGPLIELD SRGRLARVRY SNRTERIAAH DAALLERYYA ARQRFYRLIT DEALTVHLKL GPGDMLIMDN YRLLHGRTAY QLEGGVRHLR QGYVDRDSTA SRRRVLGAQL AGNARPGASH TAQGVNP
|
| |