Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2770 |
Symbol | |
ID | 4028909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3101158 |
End bp | 3102384 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637967978 |
Product | gamma-butyrobetaine,2-oxoglutarate dioxygenase |
Protein accession | YP_574816 |
Protein GI | 92114888 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | [TIGR02409] gamma-butyrobetaine hydroxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.708275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTCCC AGAGCCAATC CCCCGCCGTC GAGATGAGCG AGCTGTTCCC TTACGCACAA GGGCCCGCAC TGCGCACCAG TCAGGCCCAC GACAAGGCGC TGGAGATCAT CTGGGAAAAC GCCGACAGCG CGCGCTTCAG CTACCGCTGG CTGCGCGACC ACTGCGCCTG CCCCGAGTGC CGTCATCCCA TGACGCGCGA GCGTCTTTAC ATGCCGCTCG AGGACGAAAA CTTCCTGAAC GCCCGGCCCG AAGCCAGCGT CGAGGACGGC GTGCTGGTGC TGCGCTGGGC CGACGGTCAC GAGTCGCGCT TCGACGCCGG CTGGCTGCAC CAGCGCCGTC CCGAATCGCG CATCGATGAC GGCGTTCCCC GTGCCGAAGC CTGGCGCGAA GGCTTCACCC CGGCACATGT GCCGCATGCC GAGATGATGG GCGGCCACGA GGGCCGGCGC GAGTGGCTGA CCGCCCTGTT GCGCGACGGC CTGGTCCTGC TCGACGACGG TCCGCGCGAG CTCGAGGAGG TCGTGCGCAT CGCCGAACTG TTCGGGCCGA TGCGGGCCAC CAACTTCGGC GCACGTTTCG ACGTCCAGTC CAAGCCGAAC CCCAACAATG CCGCTTACAC GGCCATCGGC CTCGAGCTGC ACACCGACCT TCCCAACTGG CGCCATCCGC CGGACATCCA GCTGCTGTAT TGCCTGGAGA ATGAAGCCGA AGGCGGCGAA TCGCTGTTCG CCGATGGCTT CGCCGTGGCC GAGGCGCTGC GTCATGAAGC GCCGGAGCTC TTCCTCCGTC TGCGCGATAC GCCCATCGAT TTCCGTTTCC AGGACGAAGA CAGCGACATC GCCGTGCGCG CGCCGGTCAT CGAGGTCGAT GACACCGGCC GCATTCGTGA AGTCCGCTTC AACAACTGGA TTCGCGACAC CCTGCGCCTG CCGCCGGAAG AAGCCGACGC CTGGTACGAG GCTTACCTGG TCTTCTGGCA GCGCCTGCGC GAGCCGCGCT TCCGCGTCGA CTTCGCGCTC GAACCCGGCC AGATGGTCGC CTTCGACAAC CGTCGTGTAC TGCACGGGCG TGGCGCCTTC GACCCCAATA CCGGTCGTCG CCACCTCCAG GGCACCTATC TGGACATCGA TCACCTCGAA TCGCATCTAC GCGTGCTGGC GCGCCACGCC ACTTCCGCCG CGCCCGACAC CCCGACAACA TCCACCAATG AAACGGGAGT TTCCTAA
|
Protein sequence | MQSQSQSPAV EMSELFPYAQ GPALRTSQAH DKALEIIWEN ADSARFSYRW LRDHCACPEC RHPMTRERLY MPLEDENFLN ARPEASVEDG VLVLRWADGH ESRFDAGWLH QRRPESRIDD GVPRAEAWRE GFTPAHVPHA EMMGGHEGRR EWLTALLRDG LVLLDDGPRE LEEVVRIAEL FGPMRATNFG ARFDVQSKPN PNNAAYTAIG LELHTDLPNW RHPPDIQLLY CLENEAEGGE SLFADGFAVA EALRHEAPEL FLRLRDTPID FRFQDEDSDI AVRAPVIEVD DTGRIREVRF NNWIRDTLRL PPEEADAWYE AYLVFWQRLR EPRFRVDFAL EPGQMVAFDN RRVLHGRGAF DPNTGRRHLQ GTYLDIDHLE SHLRVLARHA TSAAPDTPTT STNETGVS
|
| |