Gene Cwoe_4652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4652 
Symbol 
ID8735118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4953952 
End bp4955142 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID646505281 
Productglycosyl transferase group 1 
Protein accessionYP_003396440 
Protein GI284046100 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.34008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.779648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCCG TTCGCATCCT CATCCTCTCC TGGGAGTACC CGCCGCTGAT CGAGGGCGGG 
CTCGCCCGTC ACGTGCGCAA GCTGTCCGAG CAGCTGGTGG CGCGCGACGT CGAGGTGCAC
GTGCTGACGC GCGGCGACGC GCGCGGGCGG CTGGAGGAAG AGATGGACGG CGTCCACGTC
CATCGCGTCA GCGAGCCGAA GAAGCCGGCC GACCTCGACG AGTTCGTCAC CTGGATCGAG
CAGATGAACG GTGACATGGT GACGGCCGGC CTCGAATTGG GCGCGCGGAT GACGTTCGAC
CTCGTGCACG GCCACGACTG GCTCGTGGCG GCGGCCGGCG ACGAGCTGGC GCGCAGGCTG
CGCTGCCCGT GGGTCGTGAC GATCCACGCG ACCGAGTACG GCCGCCACCA GGGCTGGGTC
GACAAGCACC CGCAGTCGCA CATCCACGGC GTCGAGACGT GGATGGCGAA CAACGCCGAC
GCGGTCATCA CCTGCTCGCA CTACATGCGC GACCACGTCA GCGACATCTA CGGGCTCGAC
GACCAGCGCG TCGCCGTGAT CCCGAACGGG ATCGACCCGC TCGACCTGCA GCCGGTCGAG
GACCTCGACG CGCTGCGTGC CCGCTTCGCG CAACCGAGCG AGCGGCTTGT GCTGCTGGTC
GGCCGGCTCG TCTACGAGAA GGGCTTCCAG ATCGCGCTCG AAGCGCTCCC GGGGGTGATC
GAGCGGCTCG GCGACGTGCG CTTCCTCGTC GCCGGCTCGG GGACGGCCGA GACCGAGCTG
CGCGAGCAGG CGTCCGCCTT GGGCCTGCTC GACCACGGCA CCTTCCTCGG CTGGATCGGC
GACGACGTGC TGCACTCGCT CTACCGGATC GCCGACCTGA CGGTCGTGCC GAGCATCTAC
GAGCCGTTCG GGCTCGTGGC GCTGGAGGCG ATGGCGTCCG GCTGCCCGAC GATCGTCGCC
GACACCGGCG GTCTGCGCGA GGTCGTGCCG AACGAGCACG TCGGCCTGCG CTTCCGCTCG
CGCGACCCCG ACTCGCTCGC GTCGATGATC GAGCGGGTGC TGTCCGACGA GCCGCTGCGC
GAGCAGCTGA TCGCCGAGGC GAGCGAGCAC GTCCTCAGCT TCGACTGGGC CGACATCGCC
CGCCAGACCG CAGAGGTCTA CGGCGAGCTG CGGCGCGGCC TCGCCGTCTA G
 
Protein sequence
MAPVRILILS WEYPPLIEGG LARHVRKLSE QLVARDVEVH VLTRGDARGR LEEEMDGVHV 
HRVSEPKKPA DLDEFVTWIE QMNGDMVTAG LELGARMTFD LVHGHDWLVA AAGDELARRL
RCPWVVTIHA TEYGRHQGWV DKHPQSHIHG VETWMANNAD AVITCSHYMR DHVSDIYGLD
DQRVAVIPNG IDPLDLQPVE DLDALRARFA QPSERLVLLV GRLVYEKGFQ IALEALPGVI
ERLGDVRFLV AGSGTAETEL REQASALGLL DHGTFLGWIG DDVLHSLYRI ADLTVVPSIY
EPFGLVALEA MASGCPTIVA DTGGLREVVP NEHVGLRFRS RDPDSLASMI ERVLSDEPLR
EQLIAEASEH VLSFDWADIA RQTAEVYGEL RRGLAV