Gene Csal_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0540 
Symbol 
ID4027679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp598395 
End bp599618 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content68% 
IMG OID637965708 
Productgamma-butyrobetaine,2-oxoglutarate dioxygenase 
Protein accessionYP_572601 
Protein GI92112673 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTGGC CCGCGAGCCA AGCGGTTACG AATCCATGCC AGCAGTGGAG AGCACCGTTC 
ATGTTCATTT CCGCCGACGT CGAACTCAAG GACGAGGGGC GCCGCCTGAT CCTGCATGCC
GCCGGTCAGC GTCGCGAATT CGCCGCCCTC TGGCTGCGCG AGCGGGCGCC GGACGACACC
ACGCTCGACA CCCGCACCGG ACAGCGCCTG ATCGAGGCCG CGCAACTGCC CCTGACGTTG
TGCGCCGAGA CCGCAAGCTG CGAGGCAGAC TCCCTGCACG TACGCTTCAG CGACGGTCAC
GCCACGGCCT ATGCCCTGAA CGACCTGCTC CTCGACACCG ACGCCGATCA CGCCGAGGTC
GAGCCCGGCC TGCGTCTGTG GGACGCCGGT CTCGACGCGC TGCCCCAGGC GACCTTCGCT
TCGGCGCTCG AGGACGACGG TGCCCTGCTG GCCATGCTCG AGGACCTGCA CCGCTACGGC
TTCGTCAAGG TCAGCGGCGT GCCCTGCGAG GCAGACGGCA TGCAGCCGTT GATCGACCGT
ATCGGCCCGT TGCGCCGCAC CAACTGGGGC GGCATCGCCG ACGTCAAGTC GGTGGCCAAC
GCGTTTGACC TCACCATGAC GCAACGAGGC CTCGAGCCGC ATACCGACAA CCCCTATCGC
GATCCGATCC CCGGCTATAT CTGGCTGCAC TGCCTGAGCA ACGCCGCCGA CGGGGGCGAC
AGCACGCTGA CCGATGGTTT CATGGCGGCA CAGCGTCTCA AGGCCGAGGC GCCCGAGGAT
TTCGCATGCC TGACGCGTCT CTCGCCACGC TTCCGCTACA CCGACGCCAC CACCGACCTG
GAAAGCGAGG GACCGCTGAT CGAACTCGAC AGCCGAGGAC GTCTGGCGCG CGTGCGCTAC
TCCAATCGCA CCGAGCGCAT CGCGGCCCAC GACGCGGCGC TGCTCGAGCG TTACTACGCC
GCGCGTCAGC GGTTCTATCG CCTGATCACC GACGAGGCAT TGACCGTGCA TCTCAAGCTC
GGGCCGGGCG ACATGCTGAT CATGGACAAC TATCGGCTGC TGCACGGCCG CACCGCGTAC
CAGCTCGAAG GGGGCGTGCG TCACCTGCGC CAGGGCTATG TGGATCGCGA CAGTACCGCC
AGCCGGCGCC GCGTGCTCGG CGCCCAGCTC GCCGGAAACG CGCGGCCTGG CGCATCGCAT
ACCGCTCAAG GAGTCAACCC ATGA
 
Protein sequence
MSWPASQAVT NPCQQWRAPF MFISADVELK DEGRRLILHA AGQRREFAAL WLRERAPDDT 
TLDTRTGQRL IEAAQLPLTL CAETASCEAD SLHVRFSDGH ATAYALNDLL LDTDADHAEV
EPGLRLWDAG LDALPQATFA SALEDDGALL AMLEDLHRYG FVKVSGVPCE ADGMQPLIDR
IGPLRRTNWG GIADVKSVAN AFDLTMTQRG LEPHTDNPYR DPIPGYIWLH CLSNAADGGD
STLTDGFMAA QRLKAEAPED FACLTRLSPR FRYTDATTDL ESEGPLIELD SRGRLARVRY
SNRTERIAAH DAALLERYYA ARQRFYRLIT DEALTVHLKL GPGDMLIMDN YRLLHGRTAY
QLEGGVRHLR QGYVDRDSTA SRRRVLGAQL AGNARPGASH TAQGVNP