Gene Cwoe_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0166 
Symbol 
ID8730594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp162096 
End bp163763 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content74% 
IMG OID646500780 
ProductProtein of unknown function DUF1800 
Protein accessionYP_003391977 
Protein GI284041637 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.450807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAT CCTCCAAGCC ATCGAAGCAG CGCCGCCCGC GCAGAAAGCC GTGCCGCAGA 
CCGGCGCGCG GCAAGCGCGG CAAGCAGCCG GCCAGCTGCC GCAGAGCGCC GGCGAAGCGC
AGAGCGCCGG CGAGCAGAGC GCCGGCGCCG AAGCGCCCGC CGCGCAGAGC GCCCGACGCC
GAGTCGCCGC ATCCCGCGCT GCCGACGATC CCGAACGGCA CGCCGGTCCC GGGGCTGCCG
CCGGAGGCGG CGCCGCCCAC GCCGCCGCCC GGCGAGCAGC CGCCGGGCAC GCCTCCGCCG
GAGCAGCCGC CGTCCGTCCC GCGGCCGCCC GGCCTGCTCC GCTACGACGG CCCGTTCGGC
GTCGCGCAGG CGACGCGGCT GCAGTGGCGC GCCGGCTTCG GGCCACGTCG GGGCGAGGCG
GCGCAGCTGG CCGCGCTCGG GCTCGACGGC GCGGTCACGG CGCTCACGCG CGCCAGCGGT
CCCGCCCAGC TCGTCGGCCC GGCGCCGTAC ACCGAGTTCG ACGACAGACT CGAGCCCGGC
GTCAGATTCG AGCACGACCA CCTCGCCTGG CTCGACCGGA TGGTGCGCTC GACGCAGCCG
TTCGTCGAGC GGATGGCACT GCTGTGGCAC GACTGGTTCG GCATCAGCGA CGACAACGTC
GGCCAATACA CGCTGCTGGC GAGACACGTC GAGCTCTTTC GCACGCACGG CCGCGGCTCG
TTCCGCGACC TCGCGCGGAT GGTGATGGAG GACGGCGCGA TGCTCGTCCG CCTCAACGGC
GTCGGCAACC AGCGCCACCG CCCGAACGAG AACTTCGCGC GCGAGCTGAT GGAGCTGTTC
ACGCTCGGCC CCGACCGTGG TGCGTACGCC GAGCGCGACA TCCGCGAGGC CGCGCGCGGT
CTGACCGGCT ACGAGGGCGA CTTCCGGCAG CCGAACGGCT GGCAGAGCTT CTGGTTCGAC
CCGCAGTGGC ACGACACGAC GAACAAGGAG ATCTTCGGCA GAGTCGCGCC GTACGATCCC
GACGACGTCG TCGACGCCTG CATCACGCAC CGCCTGCACC CGTCGTTCTT CGTGCTGAAG
CTGTGGTCGG CGTTCGTCGC GCAGCCGCCG CCGCCCGACC AGCGGGTCCT GCTCGAACGG
CTCTACGTCG AGCGCGGGCA CGCGGTCATG CCGGTCGTCG AGGCGATCCT CACGCACCCC
GCGGTCTACG AGGGGCCGCC GCTCGTCAAG CCGCCGGTCG TCTATGCCGC CGGGCTGATG
CGCGCGCTCG GCGCCGGGAT CGAGGCGATC GGCTGGACCG GCTACACCGC CAACGCCGGC
CAGCGGCTCC ACTACCCGCC GACGATCGCC GGCTGGCGCG AGGACCGCTG GCTCGACACC
TCGACGTTGC ACGCCCGCTG GAAGCTCGCC GACCTCGCGC TCAAGGGCCG TCACCTGACC
GCCGAGAGCC CGTACCCGGC GGACGAGACA CCGCAGCAGG CGGTCGCGAT GGCGCTGCGC
TTCTGGGACG ACCCGCCGCT CCACGGCGAC ACCGTCGCGG TGCTGGTCGA GACCGCCGAG
CGGATCGGCC GCCTCTCGAC CGGCGTCAGC GCGAGCGGGC GCAACGCGCT GCGCCAGACG
ATCCTCCGCC ATCTCGCGAT CACGTCCCCC GACGCACAGG TCTGCTGA
 
Protein sequence
MRRSSKPSKQ RRPRRKPCRR PARGKRGKQP ASCRRAPAKR RAPASRAPAP KRPPRRAPDA 
ESPHPALPTI PNGTPVPGLP PEAAPPTPPP GEQPPGTPPP EQPPSVPRPP GLLRYDGPFG
VAQATRLQWR AGFGPRRGEA AQLAALGLDG AVTALTRASG PAQLVGPAPY TEFDDRLEPG
VRFEHDHLAW LDRMVRSTQP FVERMALLWH DWFGISDDNV GQYTLLARHV ELFRTHGRGS
FRDLARMVME DGAMLVRLNG VGNQRHRPNE NFARELMELF TLGPDRGAYA ERDIREAARG
LTGYEGDFRQ PNGWQSFWFD PQWHDTTNKE IFGRVAPYDP DDVVDACITH RLHPSFFVLK
LWSAFVAQPP PPDQRVLLER LYVERGHAVM PVVEAILTHP AVYEGPPLVK PPVVYAAGLM
RALGAGIEAI GWTGYTANAG QRLHYPPTIA GWREDRWLDT STLHARWKLA DLALKGRHLT
AESPYPADET PQQAVAMALR FWDDPPLHGD TVAVLVETAE RIGRLSTGVS ASGRNALRQT
ILRHLAITSP DAQVC