Gene Csal_2466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2466 
Symbol 
ID4026604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2773186 
End bp2774271 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content65% 
IMG OID637967673 
Productchorismate synthase 
Protein accessionYP_574512 
Protein GI92114584 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.358807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGTA ATACATTTGG TAAGTTGTTC ACCGTCACCA CCTTCGGCGA GAGCCACGGC 
GAGGCGCTGG GGGCCATCGT CGATGGCTGC CCGCCGGGCG TGGCGCTCGA GGCCTCGGAC
TTGCAGCATG ATCTTGATCG GCGCCGCCCG GGAACGTCGC GGCATACCAC CCAGCGCCGT
GAGCCCGATC AGGTGCGCAT TCTTTCCGGG GTGTTCGAGG GCGTCACCAC CGGGACGCCC
ATCGGCCTTC TGATCGAGAA TACCGATCAG CGCTCCAAGG ACTACTCGAA GATCAAGGAC
CAGTTCCGGC CCGCCCACGC CGATTACACC TACCATCACA AGTATGGCAT TCGCGATTAC
CGCGGAGGCG GGCGCTCGAG CGCGCGGGAG ACCGCGATGC GCGTCGCCGC CGGCGCCATT
GCACGCAAGT TTCTGGCCTC GCAGGGCATT CGCGTGCGCG GTTACATGAG TCAGTTGGGC
CCCATCGACA TCGCCTTCAA GCAATGGGAG GCCGTCGACA CCAATCCCTT CTTCTGCCCC
GATCCGGACA AGCTTCCCGA GCTCGAAGCC TTCATGGATC AGTTGCGGCG CGACCAGGAC
AGCGTCGGCG CGCGCATCAC GGTGGTCGCC GACGGCGTGC CGGTAGGGCT CGGTGAACCG
GTCTTCGACC GCCTGGATGC CGACCTGGCG CATGCCTTGA TGAGCATCAA CGCGGTCAAG
GGCGTGGAAA TCGGGGACGG TTTCGCATCG GTTGCCCAGC GGGGCAGCGA GCATCGCGAC
GAAATGACGC CGCAAGGCTT TCTCTCCAAC CACGCCGGGG GAGTGCTGGG CGGCATTTCC
TCGGGGCAGC CCCTGATTGC GCATCTGGCA CTCAAGCCGA CCTCGAGCAT CACCCAGCCC
GGGCGCTCGA TCGATGTGCA CGGGGAGGCA GTCGAGGTCG TCACCAAGGG ACGCCACGAC
CCTTGTGTCG GCATCCGGGC CACGCCGATC GCCGAGGCGA TGATGGCGCT GACGCTCATG
GATCATTACC TGCGTCACCG GGCGCAGAAC GCCGATGTCG AGGTGAGCAC GCCGCGTCTT
GGCTGA
 
Protein sequence
MSGNTFGKLF TVTTFGESHG EALGAIVDGC PPGVALEASD LQHDLDRRRP GTSRHTTQRR 
EPDQVRILSG VFEGVTTGTP IGLLIENTDQ RSKDYSKIKD QFRPAHADYT YHHKYGIRDY
RGGGRSSARE TAMRVAAGAI ARKFLASQGI RVRGYMSQLG PIDIAFKQWE AVDTNPFFCP
DPDKLPELEA FMDQLRRDQD SVGARITVVA DGVPVGLGEP VFDRLDADLA HALMSINAVK
GVEIGDGFAS VAQRGSEHRD EMTPQGFLSN HAGGVLGGIS SGQPLIAHLA LKPTSSITQP
GRSIDVHGEA VEVVTKGRHD PCVGIRATPI AEAMMALTLM DHYLRHRAQN ADVEVSTPRL
G