Gene Cwoe_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0441 
Symbol 
ID8730869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp450417 
End bp451466 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content75% 
IMG OID646501055 
Product4-hydroxy-2-oxovalerate aldolase 
Protein accessionYP_003392252 
Protein GI284041912 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.216235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00115827 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGCCG CGGCTCAGGC GCCCGCGCGG CTCGACGTGC GCGTCACCGA CACCTGCCTG 
CGCGACGGCT CGCACGCCGT CGGCCACCGC TTCACGCGCG AGCAGGTGCG CGACGTCGTC
GCCGCGCTCG ACGCGGCCGG CGTGCCGGTG CTGGAGGTCA CGCACGGCGA CGGGCTCGGC
GGCAGCTCGT ACAACTACGG CTTCTCCGGC ACGCCCGAGC GCGAGCTGAT CGCGACCGCG
GTCGCGACGG CGAGACGCGC GCGGATCGCC GCGCTGATGC TGCCCGGCGT CGGCACCGCG
GACGACATCC GCGCCGTCGC CGACCTCGGC GTCGAGGTGA TCCGCGTCGC GACCCACTGC
ACCGAGGCCG ACGTCGCGAT CCAGCACTTC GGGCTCGCGC GCGAGCTGGG GCTGGAGACG
GTCGGCTTCC TGATGATGTC CCACTCGCAG CCGCCCGACG TGCTCGCCGC GCAGGCGCGC
GTGATGGCCG ACGCCGGCTG CCAGTGCGTC TATGTCGTCG ACTCGGCTGG CGCGCTCGTG
CTGGAGCAGG TCGCCGAGCG GGTCGAGGCG GTCGGCGCCG AGCTGGGCGA CGACGCGCAG
GTCGGCTTCC ACGGCCACGA GAACCTCGGC CTCGCGATCG CGAACACGGT CGCCGCCGTC
CGCGCCGGCG CGGCGCAGGT CGACGGCTGC ACGCGGCGGC TCGGCGCCGG CGCCGGCAAC
ACGCCGACGG AGGCGCTGGC AGCCGTCTGC GAGAAGCTCG GGATCGAGAC CGGCCTCGAC
GTGCTCGCGC TTGCCGACGC GGCGGAGGAG GTCGTGCGCC CGGCGATGGC GGCCGAGTGC
ACGCTCGACC GCGGCGCGCT GCTGCTCGGC TACGCCGGCG TCTACTCGTC GTTCCTCAAG
CACGCCGAGC GCTCCGCCGA GCGCTACGGC GTGTCGACCG CTCAGATCCT GCTGGCGTGC
GGCGAGCGCC GGCTCGTCGG CGGGCAGGAG GACCAGATCA TCGCGATCGC CGCGGACCTC
GCGGCGGCGC GCAAGGAGGA GGCCGCATGA
 
Protein sequence
MSAAAQAPAR LDVRVTDTCL RDGSHAVGHR FTREQVRDVV AALDAAGVPV LEVTHGDGLG 
GSSYNYGFSG TPERELIATA VATARRARIA ALMLPGVGTA DDIRAVADLG VEVIRVATHC
TEADVAIQHF GLARELGLET VGFLMMSHSQ PPDVLAAQAR VMADAGCQCV YVVDSAGALV
LEQVAERVEA VGAELGDDAQ VGFHGHENLG LAIANTVAAV RAGAAQVDGC TRRLGAGAGN
TPTEALAAVC EKLGIETGLD VLALADAAEE VVRPAMAAEC TLDRGALLLG YAGVYSSFLK
HAERSAERYG VSTAQILLAC GERRLVGGQE DQIIAIAADL AAARKEEAA