Gene Csal_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0093 
Symbol 
ID4026015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp116845 
End bp118059 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID637965244 
Producthypothetical protein 
Protein accessionYP_572156 
Protein GI92112228 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGCAC CCGGGAACTG GGAAATGGAG AAGGGCATGC CGCCGTCGCG TCCGGCGCAC 
CTGCTGCATG ATGTGCCGCT GCCGGCGGCG GCGGTCTTCG AGGCGCCGCT GACCCACAAC
CTGGCGTGGA TGCAGCGCTT CGCCGAGGGG CATGGCGCCA AGCTGGCGCC GCACGGGAAG
ACCACCATGG CCCCGGCATT GTTCAGGCGA CAGCTCGAGG CGGGCGCCTG GGGAATCACG
CTGGCGACGG CGGTGCAGAC GGTGACGGCG CATGCCCATG GCGTCGACCG TGTGCTGATG
GCCAACCAGC TGGTCGGCCG GCCGAACATG ACGCTGGTGG CGGATGCCAT CGAGGCGGGG
CTGGAGTACT ACTGCGTGGT GGACGGCGTC GATAACGTGC GCGATCTAGG GGCGTTCTTC
GCCGACAGGG AGCTCACGCT GAACGTGCTG ATCGAGCTGG GCGTCGATGG CGGACGCTGC
GGCTGCCGCA ACGCCGCGCA GGTCGATGCG CTGGTGGCAG AGATCGCCAG GCAGCCCGCG
CTGGCCCTCG TCGGCATCGA AGGTTACGAG GGGATGATCG CCGGCGGCGA TGAAGCCGCT
GCCGTGCGTG CCTACGGCGA GCGGTTGGTC GAGACCGTCC GCACGTTGCA GGCCAGTGAT
GTTCTGCAAC GCGAGGCGCC GATCGTCACC GCTTCCGGCT CCAAGTGGTT CGACCTGATC
GCGGAGACGT TCGACAGGGC GGAGCTGCGC GAGCACTACA CGCCGGTGCT GAGGCCGGGC
TGCTACGTGG TGCACGATCA CAAGCTCTAT GCCGGTGCGA TGGAGGCGAT CAAGGCGCGC
GATCCCGGCC TGGAGGGCGA GCTGCGCCCG GCGCTGGAAG TCTTTGCTCA TGTGCAGTCG
CTGCCCGAAC CGGGCCTGGC GATCATTGCG CTGGGCAAGC GCGATATCGG GCACGAGCCT
GATCTGCCGT TGCCGCTACG CCGCTATCCA CGGGAGGCGG GAGGTACGGT GAGTGTGGAC
GTCAGCGGCT GGCGAACGAC GCACATCATG GATCAGCATG CGTTTCTCGA GATTCCCGAG
CACGCCGATA TCGCGGTAGG CGATGTGCTG GCCTTCGGCA CGTCCCATCC CTGCCTGACG
TTCGACAAGT GGCGGCGCGT ACTATGCGTC GATGAAGCGC TGGCAGTGAA GGAAGTGATG
ACGACGCATT TCTGA
 
Protein sequence
MVAPGNWEME KGMPPSRPAH LLHDVPLPAA AVFEAPLTHN LAWMQRFAEG HGAKLAPHGK 
TTMAPALFRR QLEAGAWGIT LATAVQTVTA HAHGVDRVLM ANQLVGRPNM TLVADAIEAG
LEYYCVVDGV DNVRDLGAFF ADRELTLNVL IELGVDGGRC GCRNAAQVDA LVAEIARQPA
LALVGIEGYE GMIAGGDEAA AVRAYGERLV ETVRTLQASD VLQREAPIVT ASGSKWFDLI
AETFDRAELR EHYTPVLRPG CYVVHDHKLY AGAMEAIKAR DPGLEGELRP ALEVFAHVQS
LPEPGLAIIA LGKRDIGHEP DLPLPLRRYP REAGGTVSVD VSGWRTTHIM DQHAFLEIPE
HADIAVGDVL AFGTSHPCLT FDKWRRVLCV DEALAVKEVM TTHF