Gene Cwoe_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1551 
Symbol 
ID8731991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1639945 
End bp1641543 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content77% 
IMG OID646502169 
ProductRNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003393354 
Protein GI284043014 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.357255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.29633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGCGT CTTCGATCCC AGCCACGGCA GGTGTTCCCC GGCAGAAGAG CCGGGCCTCC 
GTCGGCCCCC TGCTGCGGCT GCGCTCCGAC GAGCAGCTCG TCAAGCTCTT TCGCCAGGGC
AACGAAGAGG CGTTCCGCGC GATCCACGAT CGCTACCGCG CACGCCTGTT CGCCTACACG
CGCCAGATGC TGTCCGGCTC GAGGCAGGAC GCCGAGGACG CGCTGCAGGA CGTCTTCGTG
CGCGCGTACG GCGCGCTGCG CGCCAACGAC CGCGAGGTCT CGCTGCGCGC CTGGCTCTAC
CGCGTCGCGC ACAACCGCTG CATCGACGAG CTGCGACGGC CCGCGCCGCC GCCGCCTGAG
ATGTTCGAGC AGATCCGGCC GCCGGCGAAC GACCCGATCG CCGAGACCGA GCAGCGCGAG
TCGCTGCGGC GCCTGGTCGA GGACGTCAGG CGGCTGCCGG AGCAGCAGCG CTCCGCCCTC
CTGATGCGCG AGATCGTCGG CATGTCGTAC GCCGACCTCG CCGCGGCGCT CGACGTGACC
GTCCCGGCCG TCAAGTCGCT GCTGGTGCGC GCCCGCATGG GCCTCGCGCA GGCCGCGGAG
GCGCGCGACA CCGCCTGCGT CGAGATCCGC GAGGAGCTGG TCGGCGCGCA CGACCGCGGG
GTCCGCGCCA GCGGCCTCGC GCGGCGCCAC ATGCACGACT GTGCCGGCTG CCGCGCGTAC
AAGAGAGAGC TGAAGTCGAT GCGCGAGCGC TTCGCCGCGC TGACGCCGGC GCTCGGGCCG
TTCGCGCTGG TCGCGAAGCT GCTCGGCATC GGCGGCGGCG GCGCGGCCGC GGGCGGCACC
GCAGCCGGCG GCGGGGCCGC CGCCGGCGGT GCGGCGGCGG TCGGCTACGG CGCGGCCGTC
GGCGGCACCG TCAGCGCGGG CCACGTCGCG GCCGTCGTCG CGGCGGCGGT CGTCGGCGCC
GGCGGCGCGG TCGAGGTCAA GCGCACGCTG AACCCGCCCC AGCAGTCCGC CAAGGGCGCG
GCGATCGTGC AGGTCGAGAA GCCGCGTGAC CGGCCGACCT TCGCGGCCGC GGTCGCGGCG
GACACGGCGC CGGCCGTCGC GGCGACGCCC GCCTCGTCGG CCTCCGCCTC GACAGACGCG
CGTGGGACGG CGGAACCGGC GAAGGTCAAG GACGCGCGGC CGAGAGCCAC GGCGCCGGCG
CGCGTCGTCG CGACCCCGCC CACGACCGCT CCGATCACGA CCGGCAACGG CAACGGCGGC
GCTGAGGCGC CGGCCGACGA GACGCTCGAG GAGCCGGTCG TCGTCGACCC CGTCACGACG
CCGCCGGCGG AGCCGACGAC CGGGAGCGGC GGCACTACGG GGACCGGCGG GACGACCGGC
AGCGGTGGCA CGACCGGCAC CGGCGGGACC ACCACCGGCG GCACGACGAC CGCGCCACCG
ACCGGCGGCA CGGCCGGGAC GGGCTCCGGC GCACCGCCGA CGACCGCGCC GACGACGCCG
ACGACCGGGT CGACGCCCAC GACGCAGCCG CCCGCGACGC CGTCGTCGAC CGGCGGCGCC
GGCACGACGC CGAGCAACAC GAACCCGCCG GCCCGCTGA
 
Protein sequence
MEASSIPATA GVPRQKSRAS VGPLLRLRSD EQLVKLFRQG NEEAFRAIHD RYRARLFAYT 
RQMLSGSRQD AEDALQDVFV RAYGALRAND REVSLRAWLY RVAHNRCIDE LRRPAPPPPE
MFEQIRPPAN DPIAETEQRE SLRRLVEDVR RLPEQQRSAL LMREIVGMSY ADLAAALDVT
VPAVKSLLVR ARMGLAQAAE ARDTACVEIR EELVGAHDRG VRASGLARRH MHDCAGCRAY
KRELKSMRER FAALTPALGP FALVAKLLGI GGGGAAAGGT AAGGGAAAGG AAAVGYGAAV
GGTVSAGHVA AVVAAAVVGA GGAVEVKRTL NPPQQSAKGA AIVQVEKPRD RPTFAAAVAA
DTAPAVAATP ASSASASTDA RGTAEPAKVK DARPRATAPA RVVATPPTTA PITTGNGNGG
AEAPADETLE EPVVVDPVTT PPAEPTTGSG GTTGTGGTTG SGGTTGTGGT TTGGTTTAPP
TGGTAGTGSG APPTTAPTTP TTGSTPTTQP PATPSSTGGA GTTPSNTNPP AR