Gene Cwoe_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0789 
Symbol 
ID8731219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp826416 
End bp828338 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content75% 
IMG OID646501402 
ProductPHP domain protein 
Protein accessionYP_003392597 
Protein GI284042257 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0613] Predicted metal-dependent phosphoesterases (PHP family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0255666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGAGC GCTTCGCGAT CGCCCAGGTC ACGCCGTACG CCTGGGAAGT GCCGCATGAC 
GTGAACCGTG CGGTCGCACG CGTGGCGGAC GAGCTGGCCG CGCGCGGCCA CCGCGTCCTG
ATCGTCGCCC CATCGCAGTC GGCCGCGCTC GTGCGCGAGT CGCGCAAGGC GATCCACGCG
GCGCGGCAGG ACCCCGCCGC GCTGCTGGAG GGGACCGGGG ACGGCTTCCC GCGCGTACTC
GGCGTCGGCG AGGTGCTGCC GTTCTCCCCT TCCCGCCGCA AGGCCGTCTC GCTGCCGCTC
GACGTCGCCC GCACGATCGA GCAGGCGCTC GCGATCGCGC CGCTCGACCT CGTCCACCTG
CACGAGCCGT TCGCGCCGAG CGCCGCCAGC GCCGCGCTGC GCCACTCGCG CGCGCTCAAC
GTCGGCTCCT TCCACGCACC GACCGAGCGG ATCCTGTCGA CGCAGGTGGC ACGCCGCGTC
GTCGAGCTGT TCCTCGGCCG GCTCGACGCG CGCATCGCCA GCTACGACGC GACGCGCGAC
CTGCTCGCCC GCTGGTTCCC CGCCGACTAC CGCGTGATCC TGCCGGGCGC CGACCCGCCG
CGCGCGGTGG CCGACGGCGA GACGGCGCGC GGCGACGGCC CGCCCGAGCT GGTGCTCGTC
TCCGACGAGG AGCGGCCGGC CGTGCGGACG TTCCTGCGCG CGCTGCGCCA GCTCCCGCTC
GACGTGCCGT GGCACGCGAC CGTGTGGTCC CCGCGCGGGA TCCAGCCCGC CGGTGCGCTG
CGCCGCGAGC TGCGCGACCG CGTCGCCTTC GCCGGCCCCG AGGACGTCGA CGAGGCGACG
CTGCTGGCAC GCGCTGACGT CGCCGTGCGG GCCTCCTCAG GCGCCGCGCC GGCGCCGAGC
GGCCTCCTCG GCGCGGTCGC CGCCGGCGCC GTCCCCGTCG TCGCGCGGCT GCCCGTCTAC
GAGGAGCTGG TCGGCGAGGG CGATCGCGGA CTGCTGTTCG AGCCCGGGGA CGTCGAGACG
CTGGCGGCGC AGCTCGGCCG GCTCGTGCGC GAGCCGGCGC TGCGCGAGCG GCTGCGGGCG
GGCGCGGAAG GGCTGCGCGA GCACCTCAGC TGGTCGCGCG TCGTCGACCA ACTGGAGCAG
GTCTACGGTG GCCTCGTCGC CAAGCGGCAC GACGGCCGCG GCGACCCCGT GCTGCGCGCG
AAGGTCGCGC AGCGGCGGCT GATCGACGTC GACCTCCACA TGCACACCGA CCACTCCGGC
GACTGCGCGA CGCCCGTGGA GGTGCTGCTC GCGACCGCCA AGGCGAGAGG GCTCGGCGCG
ATCGCCGTCA CCGACCACAA CGAGATATCG GGCGCGCTGG AGGCGCAGGC GAAGGCGAGC
GGCATCAAGG TGATCGTCGG CGAGGAGGTC AAGACCAAGG ACCAGGGCGA GGTGATCGGC
CTGTTCCTGA CGGAGCTGAT CCCGCGCGGC CTCTCGCTCG CGGCGACGAT CGCCGAGATC
AAGCGCCAGG GCGGCGTCGT CTACGTCCCG CACCCGTTCG ACCGGATGCA CGCGGTGCCC
GACTACGAGC ATCTGCTGGC AGTGCTCGAC GACGTCGACG CGATCGAGAT CTTCAACCCC
CGCATCGCGA TCCAGGAGTT CAACGAGGAG GCCGTCCGCT TCGCCGCGAA GTACCGCATC
CCGGCGGGCG CCGGCTCCGA CGCGCACGTC GCGCAGGGGC TCGGCTCGGT GCGGATCCGG
ATGCCCGACT TCGACGGCCC GCAGGAGTTC ATGGAGTCGC TGCGGGAGGC CGACGTGATC
CGCACGCCCG CCAGCCTGCT GTACGTCCAG GCGCTGAAGT TCCTGCAGAC GAAGGCGACG
CCGGCCCCGG CGCGCAAGGC CGCACGCGAC CGGCGCGTGC GGCGGGCGGT TCGCAAGTCC
TGA
 
Protein sequence
MSERFAIAQV TPYAWEVPHD VNRAVARVAD ELAARGHRVL IVAPSQSAAL VRESRKAIHA 
ARQDPAALLE GTGDGFPRVL GVGEVLPFSP SRRKAVSLPL DVARTIEQAL AIAPLDLVHL
HEPFAPSAAS AALRHSRALN VGSFHAPTER ILSTQVARRV VELFLGRLDA RIASYDATRD
LLARWFPADY RVILPGADPP RAVADGETAR GDGPPELVLV SDEERPAVRT FLRALRQLPL
DVPWHATVWS PRGIQPAGAL RRELRDRVAF AGPEDVDEAT LLARADVAVR ASSGAAPAPS
GLLGAVAAGA VPVVARLPVY EELVGEGDRG LLFEPGDVET LAAQLGRLVR EPALRERLRA
GAEGLREHLS WSRVVDQLEQ VYGGLVAKRH DGRGDPVLRA KVAQRRLIDV DLHMHTDHSG
DCATPVEVLL ATAKARGLGA IAVTDHNEIS GALEAQAKAS GIKVIVGEEV KTKDQGEVIG
LFLTELIPRG LSLAATIAEI KRQGGVVYVP HPFDRMHAVP DYEHLLAVLD DVDAIEIFNP
RIAIQEFNEE AVRFAAKYRI PAGAGSDAHV AQGLGSVRIR MPDFDGPQEF MESLREADVI
RTPASLLYVQ ALKFLQTKAT PAPARKAARD RRVRRAVRKS