Gene Cwoe_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3736 
Symbol 
ID8734191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3968627 
End bp3969967 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content74% 
IMG OID646504358 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_003395528 
Protein GI284045188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0799866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCCGA ATCGCTCGGC CGGAGGCCTC GGATTCGCCG CCGCGGTCCG CTTCGACCCG 
CCCGCGAGCG GGCTGCGCGG GTCGCTGCGG GTCCCGCCGG ACAAGTCGAT CTCGCATCGC
GCGGCGCTGT TCGCGGCGAT GACGCCCGAG CCGGTCAGCG TCACCAACTA CCTCGACGCG
GCCGACACGA ACTCGACGCT CGCCGCCGTC GAGCAGATCG GCGCGCTGGT CCAGCGGCGC
GGTGCCGGCG AGCTGTTGAT CCGCGGCTGC GGCCTGCGCG ACGCGCACGA GAGCGACGGC
CCGATCGACG TCGGCAACGC CGGCACGCTG ATGCGGCTGC TGCCCGGCTG GCTGGCGACG
CAGCCCGGCC GCTCGTGGAC GTTCGACGGC GACAGCTCGA TCCGCAGACG CCCGATCGAC
CGGATCGCCG ACCCGCTGCG GCTGATGGGC GCGCGGATCG ACGCGACCGA CGAGCGCTTC
CCGCCGTTCA CGCTCCACGG CGCCGACCTG ACCGGGATCG AGTACCCGAT GCCAGTCGCC
TCCGCGCAGG TCAAGTCGTG CGTGCTGATC GCGGGCATGA CGACCGCCGG CGGCACGACC
GTGATCGAGC CGGCGCCGAG CCGCGACCAC ACCGAGCGGA TGCTGGCGGC CGCCGGCGCG
CCGGTCGAGC GCGACGGGAA CCGCGTGACG GTGCGCCACG TCGACGAGCT GGGGCTCGAC
GCGATCGCGG TGCCCGGCGA CCTCTCCAGC GCGGCCTTCT GGGTCGCCGC GGCGGTGCTC
GTGCCCGGCT CGCGGATCGT GCTGGAGGAC GTCAACGTCA ACTGGACGCG CACCGGCTTC
CTGCGGATCG TCGAGCGGAT GGGCGGCATC GTGCTCGGCG ACCTGGAGGA GCATGGCGCC
TTCACGCCGG GCGAGCCGAT CTCCGAGCTG GACGTCGCGC ACGGCCCGCT GAGCGCGACG
ACGGTCGAGG CCGAGGAGGT CCCGCTGGCG ATCGACGAGC TGCCGCTCGT CGCGCTGCTC
GGCTGCTTCG CCGACGGCGA GACGGTCGTG CGCGGCGCCG CTGAGCTGCG CGTGAAGGAG
TCCGACCGGA TCCAGACGGT CGTGGACGGG CTGAACGGGC TCGGCGCCGA CATCGAGGGG
ACCGACGACG GCTTCGTCGT GCGCGGCGGG ACCGGGCTCC GCGGCGGGCG GATCTCCTCG
CACGGCGACC ATCGCCTCGC GATGCTCGGC GCGGTCGCCG GGCTGGCGTC GCGCGAGGGT
GTCGAGGTCG AGGGGATGGA CGCCGCCGCG GTCTCCTACC CAGGCTTCGT CGCCGACCTC
GACACGCTGC TCGCGCGCTA G
 
Protein sequence
MRPNRSAGGL GFAAAVRFDP PASGLRGSLR VPPDKSISHR AALFAAMTPE PVSVTNYLDA 
ADTNSTLAAV EQIGALVQRR GAGELLIRGC GLRDAHESDG PIDVGNAGTL MRLLPGWLAT
QPGRSWTFDG DSSIRRRPID RIADPLRLMG ARIDATDERF PPFTLHGADL TGIEYPMPVA
SAQVKSCVLI AGMTTAGGTT VIEPAPSRDH TERMLAAAGA PVERDGNRVT VRHVDELGLD
AIAVPGDLSS AAFWVAAAVL VPGSRIVLED VNVNWTRTGF LRIVERMGGI VLGDLEEHGA
FTPGEPISEL DVAHGPLSAT TVEAEEVPLA IDELPLVALL GCFADGETVV RGAAELRVKE
SDRIQTVVDG LNGLGADIEG TDDGFVVRGG TGLRGGRISS HGDHRLAMLG AVAGLASREG
VEVEGMDAAA VSYPGFVADL DTLLAR