Gene Cwoe_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3036 
Symbol 
ID8733482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3242230 
End bp3244101 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content73% 
IMG OID646503651 
Productaminodeoxychorismate lyase 
Protein accessionYP_003394830 
Protein GI284044490 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0256494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.08592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC GCGGTGGAGA TCGCACGCCT GAGGAGCGCG AGCGCGCACG TCGCGAGCGC 
GAGCGGCGCC GCTACGAGCG CGCCGGCGAG CCCGTGCCCG AGCACCTGCT CGAGCCCGAG
GAGCGCGCGG CCGCCGCGCC GCCTGAGCCG GTCGCACCGG AGCCTGAGCC GTTCGAAGCG
GAGCCTGAGC CGTTCGAAGC GGAGCCCGAT CCGGTCGACG CGGAGCCGTT CGCACACGAG
CCGGCGCCGG TCGAGCCGGA GCCCGTCGCC TACGAGCCGC CGCCGGTCGA GCCCGTCGGT
CCGGAGCCGG CCGCCTATGA GCCGCCGCTC GAGCCGGTCG AGCCGGAGCC GGCCGCGCCC
GTCGAGCCGC CCGCGGCGGA GCGGGGACAG TCGCACGATC CCCAGGAGAC GGTCCAGTGG
GACGTCAGCC AGGCGTGGGC CGACGAGCAC CCGCCGCACG CCGACGTCGC CGCTCCCGCG
GCCGCAGCGC ACGCGACCCA GGCGCACGAT CACGAGCACG CCGCCGCGGT CGAGCAGCCG
CCGCAGGCCG AGCCGCCGCC CGCGCACGAC CCGCAGGCGA CCCAGGCGCA CGACGTGCAG
GCCGACTGGG GCGCGGAGTG GACCGACGAG CACGCGGCTG TCGAGCAGCC GTCGGAGGAG
CATGAGGCGC CACTCGGCAC CAAGCGCATC AGCGGCAAGG ACAGGATGCA CCTGCCGCAC
ATCCACCGGC CGACGCGCGG CGAGCGGACC GGGAAGGGTC GCGGCGTGCG CGTGAAGCGG
CCGGGGGAGA CGTTCGGCGG CGGCCCGCGC ACGCGGCGCA GCGTCGGCGG GCGGATCTTC
GCAGGCTTGT TCGTGCTGCT CGGGATCGCG CTCGTGTGGT TCCTCGTCTC GCTGTTCCAG
CCGTTCGGCG GCGGGGGCGA CGGCAGCGGC AGAGTCGCCG TCACGATCCC GGAGGGCGCG
AGCGCCGGTG ACATCGGCAA GCTGCTGGCT AACAGAGGCG TCGTCGACTC CGGCTTCTTC
TTCGGCCTGC GGGCGACCGT CTCCGGCGAG CGCAGCAACC TCAAGTCCGG CAGATACACG
CTCAGAGAGG ACATGAGCTA CGGCGCGGCG CTCGACGCGC TGACGTCTGA GCCGGAGGTC
AGAAGAGTCG CGACCGTCAG CGTCTCGATA CCGGAGGGCC GCAGCCGTCG CGAGACGGCG
AGAATCGCCA GGCAGTCCGG CCTCAGGGGC GACTACTTCA CCGCCTCGCG CAGATCGCGC
CAGCTCGACC CGCGCAGATA CGGCGCGCCG GCCGGCGCGA CGCTGGAGGG CTTCCTGTTT
CCGGCGACAT ACGAGCTGAG ACGCGGCGCG AGAGTCCAGC GGCTCGTCGA CGACCAGCTG
AGAGCGTTCA AGCAGAACTT CGCCGGGATC AACCTCAGAT TCGCCAGAAG CAAGCAGCTG
ACCGCCTACG ACGTGCTGAC GATCGCCTCG ATGGTCGAGC GCGAGGTCAG CGTCGCGAGA
GAGCGGCCGC TCGTCGCCGC CGTGATCTAC AACCGCCTGC GCGACTCGAT CCCGCTTGGG
ATCGACGCGA CGCTGCGGTT CGAGCAGAAC GACTGGGTCA ACCCGCTGCG CCAGTCGGTG
CTCGACGCCG ACACGCCGTA CAACACCCGC CGCAAGCTCG GCCTGCCGCC CGGGCCGATC
GGCAGCCCCG GCCTCGCGTC GATCAGAGCG GCGGCGAACC CGGCCAGAAG CGACGCGCTC
TACTACGTCG TCAGACCGGG GACGTGCGGC GAGCACGCGT TCGCGCCGTC CTACGAGCAG
CACCTGCAGA ACGTCCAGCG CTACGAGCAG GCGCGGCAGG CCGCCGGCGG GAGATCGCCG
ACGAGATGCT GA
 
Protein sequence
MSRRGGDRTP EERERARRER ERRRYERAGE PVPEHLLEPE ERAAAAPPEP VAPEPEPFEA 
EPEPFEAEPD PVDAEPFAHE PAPVEPEPVA YEPPPVEPVG PEPAAYEPPL EPVEPEPAAP
VEPPAAERGQ SHDPQETVQW DVSQAWADEH PPHADVAAPA AAAHATQAHD HEHAAAVEQP
PQAEPPPAHD PQATQAHDVQ ADWGAEWTDE HAAVEQPSEE HEAPLGTKRI SGKDRMHLPH
IHRPTRGERT GKGRGVRVKR PGETFGGGPR TRRSVGGRIF AGLFVLLGIA LVWFLVSLFQ
PFGGGGDGSG RVAVTIPEGA SAGDIGKLLA NRGVVDSGFF FGLRATVSGE RSNLKSGRYT
LREDMSYGAA LDALTSEPEV RRVATVSVSI PEGRSRRETA RIARQSGLRG DYFTASRRSR
QLDPRRYGAP AGATLEGFLF PATYELRRGA RVQRLVDDQL RAFKQNFAGI NLRFARSKQL
TAYDVLTIAS MVEREVSVAR ERPLVAAVIY NRLRDSIPLG IDATLRFEQN DWVNPLRQSV
LDADTPYNTR RKLGLPPGPI GSPGLASIRA AANPARSDAL YYVVRPGTCG EHAFAPSYEQ
HLQNVQRYEQ ARQAAGGRSP TRC