Gene Csal_3154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3154 
Symbol 
ID4028621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3518134 
End bp3520362 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content66% 
IMG OID637968368 
Productmalate synthase G 
Protein accessionYP_575197 
Protein GI92115269 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACC GTACGACACG CCATCGTTTG CAGGTGGCCA GCCAGCTCGA CCGTTTCATC 
AACGACGAAG CGTTGCCGGG AACGGGCGTC GACCCCGAGG CCTTCTGGGC CGGGTTCGAT
GCCCTGGTCC ACGAGCTCGT GCCCACCAAT CGTGACCTGC TGGCCGAGCG CGAGCGACTG
CAGGACGAAC TGGACGCCTG GCACAAGGCG AACCCCGGGC CGGTGCGGGA CATGGCCGCC
TATCGTGCCT TCCTGAAGGA CATCGGTTAT CTGGTCGAGG CGCCGGCCAA GGTGCAGGCC
ACCACCGCCA ATGTCGATGA CGAGATCGCC GTGCAGGCGG GGCCCCAGCT GGTGGTGCCG
GTAAGCAACG CGCGTTATGC CCTCAATGCC GCCAATGCGC GCTGGGGCAG CCTTTACGAC
GCGCTCTACG GCACCGACGT GATTCCCGAG ACGGACGGCG CCGAAAAAAG CCAGGGCTAC
AACCCCAAGC GTGGCGAAAA GGTCATCGCC TACGCACGCG GTGTGCTCGA CCAGGCAGCC
CCCCTGGCCG AGGGCACGCA TGCCGAGGCG AGCGCTTATG CTCTGCGTGA CGGCAAGCTG
GTGGTGACCC TGCAGGGCGG CGGTGAGACG GGGCTGGCCG ATCCTGCGCA ACTGGTCGGT
TACCGAGGCG AGGCACAGGC GCCCACGGCG GTGCTGCTCG CCAATCATGG CCTGCATCTG
GAAGTCCAGT TCGACGCCAC TCACCCCATC GGCAAGACCG ATCCGGCCCA CGTCAAGGAC
GTGCTGGTCG AGGCGGCGGT GAGCACCATC ATGGACTGCG AGGACTCCGT GGCGGCCGTG
GACGCCGACG ACAAGACGCT GGTCTATCGC AACTGGCTGG GGCTGATGAA GGGCGACCTC
GAGGAGCGCT TCGACAAGGG CGGCAAGACG GTGACGCGCG CCCTCAACCC GGACCGCGAC
TACACGGTCC CCGGCGGTGG CGAGTTGCGC CTGCCCGGAC GCTCTTTGCT GTTCGTGCGC
AATGTCGGCC ATCTGATGAC CACGCCGGCC GTGCTCGATG GCGATGGCAA CGAGATTCCC
GAGGGAATGC TGGATGGCGT GGTCACCAGT CTGCTGGCGA TTCACGACCT CAAGAAAGGC
GACGGCGCTG CCCCGTCCGC CACGGCACCG GAGGCCAAGC GTAATAGCCG GACGGGATCG
GTCTATATCG TCAAGCCCAA GATGCACGGC CCCAGGGAAG TGGCCTTCGC CAACAGCCTG
TTCATGCGCA TCGAGGACAT GCTGGGCCTG CCGCGCGATA CCCTCAAGAT GGGCATCATG
GACGAGGAGC GGCGTACCTC GATCAACCTC GATGCGTGCA TTCACGAGGC GGCGTCCCGC
GTGGCGTTCA TCAACACCGG CTTCCTCGAC CGTACCGGCG ACGAGATGCA CACCGCCATG
GAAGCCGGCC CCATGCTGCG CAAGGGGGAG ATGAAGGGCA CCAAGTGGAT CGCGGCCTAC
GAAAAGAACA ATGTCCAGAC CGGCTTGGCG TGCGGCCTGC GCGGCCGGGC GCAGATCGGC
AAGGGCATGT GGGCCATGCC GGAGCTGATG GCAGCGATGC TCGAGCAGAA GATCGGCCAT
CCCCAGGCCG GTGCGACCAC GGCCTGGGTG CCGTCACCCA CCGCCGCCGT GCTGCATGCC
CTGCACTACC ATCAGGTCGA TGTCGCGACG ATTCAGCGTG AACTGGAGGC CAAGCCCGGT
GGCGACTTTC TCGACGATCT GCTCACCGTG CCGGTGGTCG AAAGCGCGGC TTCGGGCGCT
AACAAGAGCC CGAGCTGGTC CGACGACGAG ATTCAGCAGG AGCTGGATAA CAACTGTCAG
GGCATCCTCG GCTATGTGGT GCGCTGGGTC GAGCATGGCG TGGGCTGCTC CAAGGTGCCG
GACATCCACG ACGTGGGGCT GATGGAGGAT CGCGCGACGT TGCGTATCTC CAGTCAGCAC
ATCGCCAACT GGCTGCATCA TGGCATCGTC AGCGAGGCAC GGGTACGCGA GACGCTGGAG
CGCATGGCCA AGGTGGTCGA CGACCAGAAT GCCCACGATC CCGACTATAC GCCGATGACC
TCGCACCTGG CGGAGTCCTG CGCCTTTCAG GCGGCATCCG ACCTGATCTT CAAGGGGCGT
GAGCAGCCCG CCGGCTACAC CGAGCCGCTG CTGCACCACT GGCGCGCGGT GCACAAGGCG
AAATCATAA
 
Protein sequence
MTDRTTRHRL QVASQLDRFI NDEALPGTGV DPEAFWAGFD ALVHELVPTN RDLLAERERL 
QDELDAWHKA NPGPVRDMAA YRAFLKDIGY LVEAPAKVQA TTANVDDEIA VQAGPQLVVP
VSNARYALNA ANARWGSLYD ALYGTDVIPE TDGAEKSQGY NPKRGEKVIA YARGVLDQAA
PLAEGTHAEA SAYALRDGKL VVTLQGGGET GLADPAQLVG YRGEAQAPTA VLLANHGLHL
EVQFDATHPI GKTDPAHVKD VLVEAAVSTI MDCEDSVAAV DADDKTLVYR NWLGLMKGDL
EERFDKGGKT VTRALNPDRD YTVPGGGELR LPGRSLLFVR NVGHLMTTPA VLDGDGNEIP
EGMLDGVVTS LLAIHDLKKG DGAAPSATAP EAKRNSRTGS VYIVKPKMHG PREVAFANSL
FMRIEDMLGL PRDTLKMGIM DEERRTSINL DACIHEAASR VAFINTGFLD RTGDEMHTAM
EAGPMLRKGE MKGTKWIAAY EKNNVQTGLA CGLRGRAQIG KGMWAMPELM AAMLEQKIGH
PQAGATTAWV PSPTAAVLHA LHYHQVDVAT IQRELEAKPG GDFLDDLLTV PVVESAASGA
NKSPSWSDDE IQQELDNNCQ GILGYVVRWV EHGVGCSKVP DIHDVGLMED RATLRISSQH
IANWLHHGIV SEARVRETLE RMAKVVDDQN AHDPDYTPMT SHLAESCAFQ AASDLIFKGR
EQPAGYTEPL LHHWRAVHKA KS