Gene Cwoe_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4003 
Symbol 
ID8734461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4252194 
End bp4253345 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID646504628 
ProductRieske (2Fe-2S) iron-sulphur domain protein 
Protein accessionYP_003395795 
Protein GI284045455 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGCGA CTGAGATCCC GAGCAGCGTC CTGCGTCCGG AGGAGCTCGA GAACCTCGGC 
AAGCCGGTCA GCGAGGCCCG GGGGATGCCG CCGCGGGTCT ACACCGACCC GGAGATCTTC
GAGGCGGAGA AGGACAAGAT CTTCAACCGC GAGTGGATGG CGGTGCTGCA CGAGTCGACG
GTCAGAAACC CCGGCGACTA CCGTGTCGTC GAGCTGCTGG GGCAGTCGTT CCTGATCGTG
CGCGGCACCG ACGGCGAGCT GCGCGGCTTT CACAACATCT GCCGTCACCG GGGTGCGAAG
GTCGCCGTCG GCGAGGGCAA CTGCTCCAAG TTCCGCTGCC CGTACCACAC GTGGACATAC
GACCTCAGAG GCTCGCTGAT CGGCGCGCCG ACGATGGCCC ACGTCGTCAG AGAGGGCATC
GGTCTCGTCG ACCTCCGCAT CGACACGTGG CTGGGCTTCG TCTTCATCAA CATCGACGGG
CAGGCCGAGC CGCTGGCCAG CAAGGTGTCG AGACTCGACG ACGTGCTCGC GCCCTGGCTG
GACGTCGACC TCGAGGTCGT CTACGAGCTG CCGTACCCGG GCAACTGGAA CTGGAAGCTG
ACGTACGAGA ACACGATCGA GGGCTACCAC GTGATCGGCA CGCACCTCGA CAGCGCGCAG
CCGATAGCGC CCGGCGAGCT GACCTTCACC TCGACCTCCG ACGACGACTT CGAGACGTTC
ACGGACTTCC GCATGCCGTA TGCGGCGGGT ATGACCATGC GCGACGAGAC GGGCGGCTCG
GTGCCGCTCG ACGGCGTCCC GAGCTGGGTC GACGAGGAGG CGCGCTTCTA CGTCGTCTGG
CCCAACTTCC TCTTCTCGCT CGCGCCGGAG AACCTGACGG GCTACATCGT CCTGCCGGGC
AAGGGGCCGG GGGAGGTCAC GTTCGTGTGG TGCAACGTGG CGCGGCCCGA GACGCTCCAG
ATGCCGAACT TCGCCGAGTA CAAGGCCGCG CAGGAGCTGT GGTCGACGAC GGTCCAGACC
GAGGACCAGT ACCCCTGCGA GACGATGTGG GAGAACATGC ACAGCGACTC GTTCATCCCC
GGCCCGTACG CCGAGGGCGA GCGCGCCGTG TACCACTTCA ACCAGTGGTA CATGAAGCGG
ATGTCGAGCT GA
 
Protein sequence
MSATEIPSSV LRPEELENLG KPVSEARGMP PRVYTDPEIF EAEKDKIFNR EWMAVLHEST 
VRNPGDYRVV ELLGQSFLIV RGTDGELRGF HNICRHRGAK VAVGEGNCSK FRCPYHTWTY
DLRGSLIGAP TMAHVVREGI GLVDLRIDTW LGFVFINIDG QAEPLASKVS RLDDVLAPWL
DVDLEVVYEL PYPGNWNWKL TYENTIEGYH VIGTHLDSAQ PIAPGELTFT STSDDDFETF
TDFRMPYAAG MTMRDETGGS VPLDGVPSWV DEEARFYVVW PNFLFSLAPE NLTGYIVLPG
KGPGEVTFVW CNVARPETLQ MPNFAEYKAA QELWSTTVQT EDQYPCETMW ENMHSDSFIP
GPYAEGERAV YHFNQWYMKR MSS