Gene Csal_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3003 
Symbol 
ID4028969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3342319 
End bp3343317 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID637968209 
Productectoine hydroxylase 
Protein accessionYP_575046 
Protein GI92115118 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG5285] Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin 
TIGRFAM ID[TIGR02408] ectoine hydroxylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTGT TTGTCGGCGC CGACCTCTCC GACTACGTTT TCTCGGGGAT CGGCGGCAAT 
ACCGTTCCCA GCAGACTAAT GGAGGAGTTT GCAATGAAAG AGACACAAGA CCTGTTTCCG
ACGCGCCTGG AACGCAAACT GGGAATGTTC GAGCGCATCG ATCCGGTCGT ACACAGTGAA
GGCGATCAGC GCAAGGGGCC GCTCAGCGAA GCCGAGCTCG ACGAGTTCGA CCGCAAGGGG
TTCCTGTCTT TCGAGGGGTT CTTCGACGAG GACGAAATGG AAGCGTTCCT CCAGGAGCTC
CGCGACTACG AGAGCGATGA AGACCTCAAG CTCTCGGAAG GCACCATTCT CGAGCCCGGC
AAGCAGGAAA TCCGTTCGAT CTTCGGCATC CACGAGGTGT CAGAACGTTT CAGTCGTCTG
ACGCGCGATC CACGCCTATT GGCCATGGTG CAACAGATCC TCGGTGGCGA TGCCTACATT
CACCAATCGC GGATCAACTA CAAGCCGGGC TTCAAGGGCA AGGGCTTCGA CTGGCATTCG
GATTTCGAGA CCTGGCACAG CGAGGACGGC ATGCCGCGCA TGCGCTCGGT GAGCTGCTCG
ATCATTCTCA CCGAAAACGG CGAGTTCAAC GGTCCGCTGA TGCTGGTGCC CGGTTCGCAC
CATTATTTCG TGCCCTGCGT GGGGCGTACG CCGGAGGACA ACTACAAGGA GTCGCTGAAG
AGTCAAGACA TCGGCGTGCC GGACGATGCC AGCCTGCGCG ACCTGATGCT CAAGGGCGAT
ATCGAAGCCC CCAAGGGTCC CGTCGGGTCG CTGGTGATGT TCGAGTGCAA CACCCTGCAC
GGCTCCAACA TCAACATGTC GTGCTGGCCG CGCAGCAACC TGTTCTTCGT CTACAACAGT
GTCGAGAACA CGCTGCACGA CCCGTATTGC GGCAACCGTC CGCGGCCCGA GTTCCTCGCC
AACCGCAAGG ACTGGCGGCC GCTGACACCG GCCGAGTAA
 
Protein sequence
MAVFVGADLS DYVFSGIGGN TVPSRLMEEF AMKETQDLFP TRLERKLGMF ERIDPVVHSE 
GDQRKGPLSE AELDEFDRKG FLSFEGFFDE DEMEAFLQEL RDYESDEDLK LSEGTILEPG
KQEIRSIFGI HEVSERFSRL TRDPRLLAMV QQILGGDAYI HQSRINYKPG FKGKGFDWHS
DFETWHSEDG MPRMRSVSCS IILTENGEFN GPLMLVPGSH HYFVPCVGRT PEDNYKESLK
SQDIGVPDDA SLRDLMLKGD IEAPKGPVGS LVMFECNTLH GSNINMSCWP RSNLFFVYNS
VENTLHDPYC GNRPRPEFLA NRKDWRPLTP AE