Gene Csal_2692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2692 
Symbol 
ID4028181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3017139 
End bp3019028 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content59% 
IMG OID637967900 
Productaminodeoxychorismate lyase apoprotein / aminodeoxychorismate synthase, subunit I 
Protein accessionYP_574738 
Protein GI92114810 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAACC TCGAAGAACA GCAGGACGAT AATATATTCA TACTGCTCGA AAATACACGC 
TGTTCGAACG GCAACCGTAC CTCTTTATTG TTCGAGAACC CGGTTTTCGA GGTTATATGC
TATCGCAACG ACGCGTTGCG CGCCGCCTTG CGGGAGATCG ACGAGCTGCG TGGACAGGGC
TATTACCTCA GCGGTTACCT CGCCTACGAA GCCGGTTATG CGCTTTCCGA CAAGCAGGAT
TTCGCCTTTT GCCGGCGCCC TTCGAGCGAC ACGCCACTGG TGCATTTCTA TGCGTTTCGG
GACGTACGGC GTTTGTCCCA GCAGCAAGCG AGCCGGTTCC TCGAGTCACG GACTCCCGAT
GCCACGCCGT CGGCCATTCG CCACCTGGCA CTCAACGAAA CTCGCGACCG CTATCTCAAA
AACATCGAAA AGATAAAGTC CTACATTCGT GAAGGCGATA CTTATCAAAT CAACTACACA
CTGAAGTATC GTCTCGAATA TCAGGGATCG CCGATCACCT TGTATAGAAA ACTTCGTCAT
CGACAAAAAG TCGAATTCGG CGGCTTCCTG AACTTTCCGG AATATTCAGT CCTTTCTCTG
TCGCCGGAGC TGTTCCTGCG CAAACAAGGC ACCGCGCTGG AATCCAAGCC CATGAAAGGC
ACTTTCCCGC GCGGCGTCAC GCCGCAGGAA GATGCCGGCA TTCTCGACAC CATGCGCCAT
GATGCCAAGA CACGCTCGGA AAACGTGATG ATCGTCGATT TGCTGCGCAA CGACATCAGT
CGTATTGCCT CACCAGGGTC GGTGGCCGTC AAGAACCTGT TCGAGATACA GACATTCGAG
ACGCTGCACC AGATGATTTC CACGGTGACC GGCAGTATCG CCTCCGATGC CAGGATCGAG
CATGTCTTCC GCGAACTGTT TCCGTGCGGT TCGATCACCG GAGCCCCCAA GATACGCACG
ATGCAGATCA TCGAGGAGCT GGAACGCGAG CCACGCGGCG TCTATACCGG CGCGATCGGG
TATCTCACGC CGCACAACGA CTTCTGCTTC AACGTTCCCA TTCGCACCTG CATCGCACAT
GCCGACGGTA CGGCTGAGAT GGGCGTCGGC GGCGGTGTGC TCTTCGAGTC CGATGCCGAG
GCGGAGTATG CGGAGTGCCT GCTCAAGGCA CGCTTCCTGA CGGGACTCAA TCAGGACCTG
CAACTGATCG AGACGATGCG CTATTCCAAC GCCGAGGCAC GCATCGAGCA CCTCGAGGAA
CATCTGCAGC GCCTGGCGCG TTCGGCGCAC GATCTGCAGT TCGTCTTCGA TGGGCCACGC
GTGCGTGACG CCCTCGGCGA GGCCATCGCG GATCTTCGTC ACGATGCCAA GGTGCGCCTG
TTGATGGCAC ACGACGGTCA GCTCGAGGTG ACCACGGCTC CGCTGCCGGC CATGCCCGAG
AGTACGCAGA CCGCCCGCCT GGGGATCAGC GACCAGCGTA TCGACCGACG CGATTTCCTG
CTGCAGTACA AGACGACGGA GCGTTCGCTG TACGAGCAGG CTTACCAGCA CCACCGCGAG
GCCGGCGACT ACGACGTCGC TTTTCTCAAC GCGGAAGGAC GCCTGACCGA GGCGAGTCGC
CACAACCTGT TCATCGAAAA GGACGGCCTG TTGCTGACGC CGCCGCTCGA GGAAGGCGTA
TTGCCCGGCA TCGCCCGACG CATGCTCATC GAAACGAGTT GCGAGCGTTG CTGCGAGCGC
CCGCTGACCC CGCAGGATCT GCTGGAGGCC GATGCCATCT GGTTGACCAA TGCCGTGCGT
GGCGTCGTGC CGGTCACGCT CGGCAAGCAG GCTCGCCAAA CCCTCATCGC CGTGGCCGGT
CAGGAGGCTG CACATGCTTT GCTTGATTGA
 
Protein sequence
MHNLEEQQDD NIFILLENTR CSNGNRTSLL FENPVFEVIC YRNDALRAAL REIDELRGQG 
YYLSGYLAYE AGYALSDKQD FAFCRRPSSD TPLVHFYAFR DVRRLSQQQA SRFLESRTPD
ATPSAIRHLA LNETRDRYLK NIEKIKSYIR EGDTYQINYT LKYRLEYQGS PITLYRKLRH
RQKVEFGGFL NFPEYSVLSL SPELFLRKQG TALESKPMKG TFPRGVTPQE DAGILDTMRH
DAKTRSENVM IVDLLRNDIS RIASPGSVAV KNLFEIQTFE TLHQMISTVT GSIASDARIE
HVFRELFPCG SITGAPKIRT MQIIEELERE PRGVYTGAIG YLTPHNDFCF NVPIRTCIAH
ADGTAEMGVG GGVLFESDAE AEYAECLLKA RFLTGLNQDL QLIETMRYSN AEARIEHLEE
HLQRLARSAH DLQFVFDGPR VRDALGEAIA DLRHDAKVRL LMAHDGQLEV TTAPLPAMPE
STQTARLGIS DQRIDRRDFL LQYKTTERSL YEQAYQHHRE AGDYDVAFLN AEGRLTEASR
HNLFIEKDGL LLTPPLEEGV LPGIARRMLI ETSCERCCER PLTPQDLLEA DAIWLTNAVR
GVVPVTLGKQ ARQTLIAVAG QEAAHALLD