Gene Cpha266_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2088 
Symbol 
ID4570436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2420529 
End bp2421962 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content52% 
IMG OID639766670 
Productisochorismate synthases 
Protein accessionYP_912524 
Protein GI119357880 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000640643 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACA AGCGACATAC CATTATTGCT GACAACAACC CGCTTCCTCT GCAAAAGGCC 
GTTCAAAGCC TTCTGCGCGA GGTTGAACGG TTAAAAGAGA GAACGACAGA TCGATTCGAA
GAAAGCCCGT CCGGCTCGCT TCTGACCATC AGCCAGCCTC TTCTTCCGCT CGATCCGCTT
GACTGGCTTA ACCGCCAGCA TCTCTTCCCC AAACTGTACT GGATGAACCG CGAAAAAAGT
TTTTCGGTCG CAGGAATAGG TACGGCTGAT TGCATTGAAC AAAACACGCC GGGCACAAAC
GCATCGAGTT TTGCCGAGCT TACCCGGACA ATAGCCACAA AAGATCCCGA CGCGCGGTAT
TTCGGAGCAT TCCGGTTCAA CAATATGGAG GAACAAAGCG AGCCGTGGCA CTCATTCTCC
TCCTATGCTT TTGTTCTGCC CCTTGTCGGG ATAACGTTTG AACAGGAACG GTACGTACTC
TTCTGCAATC TTTGGCTGGA ACCGGGGGAG GCGCCTGACA TAAAAATCCG GAGCATTTGT
GATGCTCTCG AAAACATGTC GACCACGCAG TCGGATTGCG ATAGCGACCG AAATATTCCC
GCACTGGTGC GTATCTCCCG CAATCCGGAT GTACAAAGCT GGACCAGACA GTGCGAACGG
GCACTGCGAA CATTCGAGGC AGGCGACATG GATAAAATCA TGCTGGCCCG ACAAACCATT
CTTGAATTTT CGGAAAGTTT TTCGCCGCTG CTTTTTCTCA TCAACTATCC TTATCCGAAA
AACTCGACCT ACCGGTTTTA CTTTGAGCCG AAAAAAAACC ATGCGTTTTT CAGTTTCACT
CCTGAACGCC TCTATCGCAG GGATGGCGTC ACGTTGCAGA CCGAAGCCCT TGCGGGAACC
AGTCTGAAAG AGAATCTCAC CGGTGACGAC AACCTTGCTT CCGAAGTCCT TCTGAACTCC
GAAAAAGATA TCAGGGAACA CAAATTCGTC AAAGACAGCA TCTACGGGGA GCTGTTTCCG
GTTTGCAGCG AGATTCAGAT GGATGAACAG GTCCATGTGC TTCAACTGAA CCGTCTGGCT
CATCTTTATA CCCGATGCAG CGCAACGCTC AAGCCGGAGT TCAGCAATGA CAGTACCTTG
CTCACCCGCC TCCACCCTAC GCCTGCCGTT GGAGGAGTTC CGAGGGATGA GGCGCTTCGG
CATATTCTCG ATATTGAACC CTTCAACCGC GGGTGGTATG CCGGCCCTGC CGGATGGATA
AGCAGCAATG CCGCTGAATT CTGTGTCGGC ATCCGATCCG GAGTCGTTGT CGAAGCGATG
ACCTTTCTCT ACTCCGGTGC CGGTCTGGTC AAGGGATCAG ACCCCGTCTC GGAATGGGAT
GAAATCGAGC AGAAAATCGG AGATCTCCTG ACCACAGCAA ACGGCGATAC ATGA
 
Protein sequence
MSDKRHTIIA DNNPLPLQKA VQSLLREVER LKERTTDRFE ESPSGSLLTI SQPLLPLDPL 
DWLNRQHLFP KLYWMNREKS FSVAGIGTAD CIEQNTPGTN ASSFAELTRT IATKDPDARY
FGAFRFNNME EQSEPWHSFS SYAFVLPLVG ITFEQERYVL FCNLWLEPGE APDIKIRSIC
DALENMSTTQ SDCDSDRNIP ALVRISRNPD VQSWTRQCER ALRTFEAGDM DKIMLARQTI
LEFSESFSPL LFLINYPYPK NSTYRFYFEP KKNHAFFSFT PERLYRRDGV TLQTEALAGT
SLKENLTGDD NLASEVLLNS EKDIREHKFV KDSIYGELFP VCSEIQMDEQ VHVLQLNRLA
HLYTRCSATL KPEFSNDSTL LTRLHPTPAV GGVPRDEALR HILDIEPFNR GWYAGPAGWI
SSNAAEFCVG IRSGVVVEAM TFLYSGAGLV KGSDPVSEWD EIEQKIGDLL TTANGDT