Gene Cwoe_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2872 
Symbol 
ID8733316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3066462 
End bp3068051 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content69% 
IMG OID646503485 
ProductRNA binding S1 domain protein 
Protein accessionYP_003394666 
Protein GI284044326 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.569342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACTA CCGACGTCCA GCCCACCGCC ACGATCGTGG AGGGCTCCGA TGGCCTTCTG 
CTCGAGATCG ACGGCCAGAT CGTCCCGAAC TACGACGCGA CGCTCACTCC ATTCGAGGAG
GGCGACGTCG TCACCGGCCA CGTCGTCCGC ATCGACAAGG ACGAAGTCCT CGTCGACATC
GGCTACAAGT CCGAAGGCGT CATCCCGTCC AACGAGCTGT CGATCCGCAA GTCGGTCGAC
CCGAAGGACG AAGTCGAGAT GGGCGAGGAG GTCGACGCGC TCGTCCTCAC GAAGGAGGAC
CAGGACGGCC GGCTGATCCT CTCGAAGAAG CGTGCGCGCT TCGAGAAGGC GTGGCGTCGC
ATCGAGGCTG CCGCCGAGTC CGGCGAGCCC GTCGACGGCA CCGTCATCGA GGTCGTCAAG
GGCGGCCTGA TCATCGACCT CGGGGTCCGC GGCTTCCTGC CCGCCTCGCT CGTCGACATC
CGCCGCGTGC CGCACCTGGA CGAGTACCTC GGTCAGACGA TCGAGTGCAG AGTCATCGAG
CTGAACCGTT CAAGGAACAA TGTCGTCCTC TCGCGCCGCG CGGTGCTGGA GGAGCAGCGC
AGAGAGGACC GCGAGCGGAT CCTCGACCGC CTGCAGCCGG GCATGATCGT CGAGGGCACG
ATCTCGAACA TCGTCGACTT CGGCGCGTTC GTCGACCTCG ACGGGATCGA CGGCCTGATC
CACATCTCCG AGCTGTCGTG GTCGCACGTC AACCACCCGA GCGAGATCCT CTCGATCGGC
GACACGGTGA GCGTGAAGGT GCTCGACATC GACCGCGACC GCCAGCGCAT CTCCCTCGGT
CTCAAGCAGA CCCAGGAGGA CCCGTGGCAG CGCGTCGTCG ACACCTACAA CGTCGGTGAC
GAGCTCGAAG GCAGAGTCAC GAAGGTCGTC ACGTTCGGCG CGTTCGTCGA GATCCTCGAC
GGCGTCGAGG GCCTCGTGCA CATCTCGGAG CTGGCGCAGC ACCACGTCGA GAACCCGCGC
GAGATCATCC AGCCCAACGA CGAGGTGAAG GTCAAGATCC TCGAGATCGA CTCCGAGCGC
CGCCGGCTCT CGCTCTCGAT CAAGCGGGTC GAGGGCCAGA TCCTGCCCGT CCGCAGCCTG
GAGGGCGAGG ACGGAGCGGA GGGCGCGGTC GCCGAGGACA CCGGCTACGA CAACGTGCCG
GAGCTGGGCC TGTCGGAGGA CGTCTTCGCC GAGGGTGCGA CGCCGGAGGC TCCGGCCGCC
GACGCCGCGC CGGACTCGCC TGAGGTCGTC GCCGAGGTGC AGGAGGCCGA GGCCGCAGCC
GAGGAGGCCG CGATCGCCGA GGAGGCGCCC GTCGCCGCCG AGGATCCCGA GGTCGAGGTC
GCCGCACCGG AGGCCGCTGC CGAGCCGGAG GCCGCCGCTG AGCCGGAGGC CGCTGCCGAG
CCGGAGGCTG CCGCTGAGCC GGAGGCTGCC GCTGAGCCGG AGGCTGCCGC TGAGCCGGAG
GCCGCTGAGG CCGACGCCGC CGCCGAGCCG GACGCGCCCG CCGAGCCGGA GGCTGAGGCC
GCCGGCGACG ACGAGACGCC GCAGGCGTAG
 
Protein sequence
MSTTDVQPTA TIVEGSDGLL LEIDGQIVPN YDATLTPFEE GDVVTGHVVR IDKDEVLVDI 
GYKSEGVIPS NELSIRKSVD PKDEVEMGEE VDALVLTKED QDGRLILSKK RARFEKAWRR
IEAAAESGEP VDGTVIEVVK GGLIIDLGVR GFLPASLVDI RRVPHLDEYL GQTIECRVIE
LNRSRNNVVL SRRAVLEEQR REDRERILDR LQPGMIVEGT ISNIVDFGAF VDLDGIDGLI
HISELSWSHV NHPSEILSIG DTVSVKVLDI DRDRQRISLG LKQTQEDPWQ RVVDTYNVGD
ELEGRVTKVV TFGAFVEILD GVEGLVHISE LAQHHVENPR EIIQPNDEVK VKILEIDSER
RRLSLSIKRV EGQILPVRSL EGEDGAEGAV AEDTGYDNVP ELGLSEDVFA EGATPEAPAA
DAAPDSPEVV AEVQEAEAAA EEAAIAEEAP VAAEDPEVEV AAPEAAAEPE AAAEPEAAAE
PEAAAEPEAA AEPEAAAEPE AAEADAAAEP DAPAEPEAEA AGDDETPQA