Gene Cwoe_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4004 
Symbol 
ID8734462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4253397 
End bp4254596 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID646504629 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003395796 
Protein GI284045456 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGGA CGGTGACACT GGCGGTCGCC TGCGTGCTGG CGCTCGGCCT CGCGGCCTGC 
GGCAGCAGCA GCGGCGGCGG CGACGACGCG AGCACCGGGA CGACGGGCGG CGGCGCCCCC
GCGACGGCCG CCGGGGCCGA CGCGCAGGCG GCGATCGACA GAGCGCTGAG AGGCTCCTTC
GCGCTGCCGC CCGGTGGCGG GCCCAGAGCG CAGAGAGACA AGAAGATCTG GTTCATCCCC
GTCTCATCCG AGACGTACGA CTACCGCAGA CCCGGATCGC TGTTCGACGC GGCCGGCAAG
CTCGGCTGGG ACGTCACGCA GTTCGACGGC AGATACAGCC CCGACACGAT CGTCAGCGGC
ATCCGTCAGG CGATCGCCGA CAGAGCCGAC GGGGTGATCC TCTACATCGT CGACTGCCCG
AACGTGAAGG CGGCGCTGCA GGAGGCGAGA GCGGCCGGCG TCAAGATCGT CGCGGCTCAG
GGCTTCGACT GCAGCGACGT GAGAGCGAGC GAGCCGGCGC TGTTCGACGC GACCGTCCGC
TACGCCGGCT CCGGCGACGC GTCCAAGCCG ATGCCGTTCC CCGACTTCAT ACGCGATGAG
TGGGGCACCG CGCAGGCGCT CGCGGTCATC AACGGCACCG GCGGCAAGGC GAAGCTGATC
GACGTCTACG AGTCGGACCT GCTCGTCACC GTCGAGCAGG ACAAGGGCGT GCGCGCCGGG
CTCAGAAGGT ACTGCCCGGA CTGCGAGGTC GTCGACACCG TCGAGTTCAC CGGCGCCGAG
ATCGGCGCGC CGCTGCAGCA GAAGATCGAG CAGTCGCTGA CGCGCCACCC GGAGGCCAAC
GCGATCATCT CGCCGTACGA CTCGGTCACC AAGATCACGG CCGCCGCCGT CCGCGCGTCC
GGGCGCACCG GCCAGATCTA CAGCGTCGGA TCCGAGGGCG ACGCCGAGGT GATGGACGCC
GTCCGCGACG GCGGCAAGGG CGTCGACGCC GGCGTCGGGC TGGCACCCGA CTGGGAGCTG
CACGGGATGC TCGACGCGAT GAACCGCCTG CTCGGCGGCG AGAGAGAGGG CGACGGCTTC
CCGACCGGCA ACGGCACGCA GATCATCGAC AGAGAGCACA ACCTGCCCGA GCCCGGCGAG
CGCTACGCGC CGCCGCTCGA CTTCCGCAGA GCGTACTACG ACGCCTGGGG AGTGAATTGA
 
Protein sequence
MSRTVTLAVA CVLALGLAAC GSSSGGGDDA STGTTGGGAP ATAAGADAQA AIDRALRGSF 
ALPPGGGPRA QRDKKIWFIP VSSETYDYRR PGSLFDAAGK LGWDVTQFDG RYSPDTIVSG
IRQAIADRAD GVILYIVDCP NVKAALQEAR AAGVKIVAAQ GFDCSDVRAS EPALFDATVR
YAGSGDASKP MPFPDFIRDE WGTAQALAVI NGTGGKAKLI DVYESDLLVT VEQDKGVRAG
LRRYCPDCEV VDTVEFTGAE IGAPLQQKIE QSLTRHPEAN AIISPYDSVT KITAAAVRAS
GRTGQIYSVG SEGDAEVMDA VRDGGKGVDA GVGLAPDWEL HGMLDAMNRL LGGEREGDGF
PTGNGTQIID REHNLPEPGE RYAPPLDFRR AYYDAWGVN