Gene Cwoe_5550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5550 
Symbol 
ID8736025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5943365 
End bp5944627 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content77% 
IMG OID646506180 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003397330 
Protein GI284046990 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGC GGCGCACCGT CGAGGCGGTT TGGCGGATCG AGTCGCCGCG GCTGATCGCC 
TCGCTGGCGC GGCTCGTCCG CGACGTCGGG CTGGCCGAGG AGCTGGCCCA GGACGCGTTC
GTGCTCGCGC TCGAGCAGTG GCCGCGCGAC GGCGTCCCGC CGAACCCCGG CGGCTGGCTG
ATGGCGACGG CCAAGCACAA GGCGATCGAC CGGATCCGCC GCGAGCGCAC GCGCGACGAC
AAGTACGCGC AGCTCGCGGT CGACCTCGCG GCGTCGGGGG CGGGGAGCGC GCCCGCTCCC
GACGCCGCAG CCGTCGAGGC GGTCGAGGAC GACCTGCTGT CGCTCGTCTT CGTCGCCTGC
CACCCGGTCC TCCCGCGCGA GGCGCGCGTG GCGCTGACGC TGCGGCTGCT CGGCGGTCTC
AGCACCGACG AGATCGCGCG CGCGTTCCTC GTCCCGTCGG CGACGCTCGG GCAGCGGATC
TCGCGCGCCA AGCGCACGCT GGCGGAGGCG CAGGTGCCGT TCGAGCTGCC GGGAGAGGAC
GAGCTGCCGG CGCGCCTGGC GGCGGTCCTG GAGGTCGTCT ACCTCGTCTT CAACGAGGGC
TACGCGGCCA GCTCGACGGA GGCGTGGGTC CGCCGCGACC TCGCCGAGGA GGCGATGCGG
CTCGGGCGCG TGCTCGCCGG GCTGCTGCCG AGAGAGCCCG AGGTGCATGG CCTGAACGCG
CTGATGGAGC TGCAGGCGTC GCGCTTCGGC GCACGCGTCG GCCCCGACGG CGAGCCGATC
CTGCTGCCCG ATCAGGACCG CAGGCGCTGG GACCGGATGC TCGCCGCCCG CGGCCTGGCG
GCGCTCGACC GCGCGACGCA GCTGCGGCGG CCGCTCGGCC CGTACACCGT GCAGGCGGCG
ATCGCCGCCT GCCACGCCCG CGCGCCGAGC TTCGCCGCGA CCGACTGGGA GGCGATCGTC
GCGCTCTACG ACGCGCTCGG CCAGCTCGCG CCGTCGCCGG TGATCGACGT CAACCGCGCT
GTCGCGGTGC TGCACGCGGA CGGTCCGGAG GCCGCGCTCG CAGCGCTGGA GCGCGTGCGG
GAGGACCCGC GCCTGGCCCG CTACCACCTG CTCGGCGCGG TCCGCGCCGA CGTCCTGCTG
CGGCTCGGCC GCGACGACGA CGCCGTCGAG GAGCTGCGGC GCGCTGCCGC CCTCGCCCCG
ACGGAGCGCG AACGCCGGCT GCTGCTCGAC CGCGCCGCGC GCGCCGCCGA GGCCACGGGG
TGA
 
Protein sequence
MDPRRTVEAV WRIESPRLIA SLARLVRDVG LAEELAQDAF VLALEQWPRD GVPPNPGGWL 
MATAKHKAID RIRRERTRDD KYAQLAVDLA ASGAGSAPAP DAAAVEAVED DLLSLVFVAC
HPVLPREARV ALTLRLLGGL STDEIARAFL VPSATLGQRI SRAKRTLAEA QVPFELPGED
ELPARLAAVL EVVYLVFNEG YAASSTEAWV RRDLAEEAMR LGRVLAGLLP REPEVHGLNA
LMELQASRFG ARVGPDGEPI LLPDQDRRRW DRMLAARGLA ALDRATQLRR PLGPYTVQAA
IAACHARAPS FAATDWEAIV ALYDALGQLA PSPVIDVNRA VAVLHADGPE AALAALERVR
EDPRLARYHL LGAVRADVLL RLGRDDDAVE ELRRAAALAP TERERRLLLD RAARAAEATG