Gene Csal_1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1605 
Symbol 
ID4027567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1825523 
End bp1826542 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content65% 
IMG OID637966794 
Productaminodeoxychorismate lyase 
Protein accessionYP_573657 
Protein GI92113729 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0288124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACAAG GTAGCGACAT GCGAGTGGTC AAGATTTTAT TGGGCGGGGC GGCGCTGGTG 
GGCGTGGCGG CATTCGGCGC CTACCAGTAT TGGCAATCCC GGCTGGCGGC CCCGATCGCC
CTCGAGGCGC CCACGATCTA CGAGGTGCCG CGTGGCGCGG GGTTCCAGCA AATCCTGGGC
GAACTCGAGT CGCAAGGCAT CATCGAGGCC GCCTGGCCGT ACCGCGTGCT GGCGAAACTC
TCGCCGGAAG CGGTGAACGG CCTGCGCTCC GGCGAGTTCG AGCTCACCCC GGGCATGAGC
GGTCGCGAGA TGGTGGCATG GCTCTCCAGC GACAATATCG TCACCTATCG CCTCACCATT
CCCGAGGGAT GGACGTTCGC GCAGATGCGT CGCGCACTGG CCGAGGCGCC CAAGCTCGAG
CATCGCACGC AGGACATGAG CGATGCGGAG GTCATGGCGG CGCTGGGGCA TGAGGACGAG
CATCCCGAAG GCCGCTTTTT CCCCGATACG TACCGCTATC ACAAGGGAAT GACGGATCTG
GCGCTGCTCG AACGCGCCTA TGCGCGCATG GACAACATGC TGCGCGACGC CTGGGCGGGA
CGCAGCGACG ATCTGCCGCT CGAGACGCCT TACGAAGCCC TCATCCTGGC GTCGTTGATC
GAGCGCGAAA CGGGCGTGCC GAATGAGCGT CGGCGGATCG CCGGCGTCTT CGTGCGGCGT
CTCGAGCGTG GCATGCGCCT GCAGACCGAT CCCACGGTCA TCTACGGCAT GGGCGAGGAC
TACGATGGCA ACATCACGCG CGATGACCTG CGTCGCGAAA CGCCCTACAA CACCTACGTG
ATCGACGGCC TGCCGCCCAC GCCGATCGCC ATGCCCGGCG AAGCTTCCCT GGAAGCTGCC
GTGGACCCCG CCCCCGGGGA CGCCCTGTAT TTCGTGTCCC GGGGCGACGG ATCGCACTAT
TTTTCCAGTA CGCTGGCCGA ACACAATGCC GCGGTACGCC GCTATATCCT CAACCGCTGA
 
Protein sequence
MKQGSDMRVV KILLGGAALV GVAAFGAYQY WQSRLAAPIA LEAPTIYEVP RGAGFQQILG 
ELESQGIIEA AWPYRVLAKL SPEAVNGLRS GEFELTPGMS GREMVAWLSS DNIVTYRLTI
PEGWTFAQMR RALAEAPKLE HRTQDMSDAE VMAALGHEDE HPEGRFFPDT YRYHKGMTDL
ALLERAYARM DNMLRDAWAG RSDDLPLETP YEALILASLI ERETGVPNER RRIAGVFVRR
LERGMRLQTD PTVIYGMGED YDGNITRDDL RRETPYNTYV IDGLPPTPIA MPGEASLEAA
VDPAPGDALY FVSRGDGSHY FSSTLAEHNA AVRRYILNR