Gene Clim_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2047 
Symbol 
ID6355025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2256123 
End bp2257544 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content55% 
IMG OID642669643 
Productisochorismate synthase 
Protein accessionYP_001944055 
Protein GI189347526 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.489046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTATGC GTGAACAGCA GAACATCATC ATACCTGAAC AGGAACCCCT GCCGATTGAC 
CGGGCTGTAG CCGCCCTGCG GAAGGCGATA CAAGCTTATG ATCCCTCCGC AGTCAACAGA
AGTCCGGCAC TGAGTATCTT CCGTCAGCGG GTTCTGCCGG CTGACCCCCT TATCTGGCTT
TTCCGGCAGA GAGTCTTTCC CAGGGTATTC TGGATGAACC GGGAGAAAGA CTGCACCCTC
GCGGGCATCG GTTCTGCCGA CTGGATACGG CACGAAGCGG AAGGCTCGAA CAGCGACAGC
TTCGACCTCC TTGTGCGCAC ACTCTCGGAA AAAGATCCTG CCGTCCGTTA TATTGGCGGA
TTCCGTTTCA ATAATATGGA AAGTCAGGAC GAAACGTGGA GTGCCTTCCC GTCATTCTCC
TTCGTGCTCC CCCTCGTCCT GTATGCAGAA GAGAGAGACG GCTCCTGGTT GAGCTGCCAC
CTCTTCGTGA AAGAGGGCGA AGATAGCGGC AGAAAAAAAA CCGTGCTGCT GCAAACGCTC
GAAGCCCTCG ATCTCAAAGC TGACGCCGCT ATCCCGGAAA TGCCGCTCCT GAAACAGGCC
TCCTGTATTC CTGACCGGAA ACTCTGGGTT GAAGGGTGCC GTAAAGCCCT GGGACTGTTT
GCATCCGGCG AAATGGACAA AATCATGCTG GCCCGAAGAA CCGTTCTCGA GTTCGGCAGC
AGTTTTTCTC CGCTGCTCTA CCTGATCCGC TACCCTTATC CCCGGAATGC GACATTCCGG
TTCTGTTATG AGCCGATGGA AAATCATGCG TTCATCAGCT TTACGCCCGA ACGCCTCTAC
CGGCGGGACG GGCAGATGAT TCTCACCGAA GCCCTGGCGG GAACCTGTCT GAAAGAGAGC
ATGAACGGCA ACGATTTTCA CGCCTCGGAA ATACTGCTCA ACTCTGAAAA AGATATCAGG
GAACACGGCT TTGTAAAAGA AGCCATATTC AGGGCGCTGC AGCCGGTTTC AAGCTCCTTT
GAAATGGAGC AGAATCTCCG GGTACTGCAG CTGAACCGCC TGGCCCATCT CTATACCTGC
TGTAAGGCAA CCCTGAAACC GGAGTACAGC AGCGACAGTA CCGTCCTCTC GGTACTGCAC
CCGACCCCTG CTGTCGGCGG CGTGCCAAAA AACGAAGCGA TGCAGCATAT TCTGGATCTC
GAACCGTTCT GCCGGGGCTG GTATGCCGCT CCCGTCGGAT GGATCAGCCG CGACAGCGCC
GAGTTTGCTG TCGGTATCCG TTCAGCTCTG GTTTCTGAGG AGTTCACGAA CCTTTACTCC
GGAGCCGGTC TGGTCGAAGG TTCGGATCCC GACCTCGAGT GGGATGAGAT AGAACAGAAA
ATCGGCGACC TTATGGCTAT TGCGAGGGGT TCCCATGAAT AA
 
Protein sequence
MLMREQQNII IPEQEPLPID RAVAALRKAI QAYDPSAVNR SPALSIFRQR VLPADPLIWL 
FRQRVFPRVF WMNREKDCTL AGIGSADWIR HEAEGSNSDS FDLLVRTLSE KDPAVRYIGG
FRFNNMESQD ETWSAFPSFS FVLPLVLYAE ERDGSWLSCH LFVKEGEDSG RKKTVLLQTL
EALDLKADAA IPEMPLLKQA SCIPDRKLWV EGCRKALGLF ASGEMDKIML ARRTVLEFGS
SFSPLLYLIR YPYPRNATFR FCYEPMENHA FISFTPERLY RRDGQMILTE ALAGTCLKES
MNGNDFHASE ILLNSEKDIR EHGFVKEAIF RALQPVSSSF EMEQNLRVLQ LNRLAHLYTC
CKATLKPEYS SDSTVLSVLH PTPAVGGVPK NEAMQHILDL EPFCRGWYAA PVGWISRDSA
EFAVGIRSAL VSEEFTNLYS GAGLVEGSDP DLEWDEIEQK IGDLMAIARG SHE