Gene Cwoe_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4454 
Symbol 
ID8734916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4750222 
End bp4751622 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID646505080 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003396243 
Protein GI284045903 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGCTGA GCAAATCATT CCGTGGCCGG GCAGCGGCCG GCCTCGCGGT GCTGGGTGCG 
TGCGCCCTGG CCGCCTGCGG CGGTCCCGGC ACGAGCGCAT CGACGTCGAC GGACTCCTCG
GCGTCGGCCG ACACGAGCGT TCCGAGACAG GACCTCACCG TCACGATGTG GGACCAGGAG
GCGCAGAACC CCATCAACCC CGGGCTCGAG GCCGCGATCT CCCAGTTCGA GAAGCTGCAT
CCGAACATCT CGATCAAGCG GGTGGCGCCG CCCGGGATCG GCAACGAGTC GACCGTCGAG
TCCACGACGA AGACGAAGCT GGCCGTGTCG GGGTCGAATC CCCCGACCCT CGTCGAGGGC
GACACGCTCC CCGGAGGGGT GCTCGCGCCG CTCATCCAGG CAGGGCTCAT GCGCGACATG
GGACCGTACG TCGACGCCTA CGGCTGGGAC CGGCGGTTCC GCTCGATCGG CGGCCTGACG
CTCGACCGCT ACTCCAACAA CACCAGACGC TGGGGAGAGG GCCCGGTGTA CGGGGTCCCG
TGGTACGGCT CGGAGATCGG CGTCTTCTAC AACCGCAGAC TGCTGGAGCA GATCGGCGTC
CCGCTGCCGA AGACCTTCGC GGACTTCGAG GCGTCGCTGG CTGCCGCCAA GCGCGCGGGC
ATCACGCCCA TCGTGCTCGG CAACTCCGAT CGCGACGGCG GCGGCTGGCT GTTCAACGAG
CTGCTCGCCG ACTTCCTCGG CGCACCCGAG ATGCTCAAAT GGGTCCTCAA CAAGCCGGGA
GCGACGATCA CCGGGCCTGA CGGGGTGCAA GCGGCTCAGA CGATGAAGGA CTGGGCCGAC
GCGGGCTACT TCACGAAGGG CTACGAAGGC ATCAGCCTCA CGCAGCAGGC GGCCCAGTTC
GGTCGCGGCG AGGGCCTGTA CATGATCGCC CTCCACGTGC TGGCACCCAC CAAGGACCCT
GACGGCTTCG GCTTCTTCTA CGTGCCCGAC CCCAACGACG CCGGCAGAAC GCCCGCAACC
GTGAACAACG CGATGTGGAG CTTCGGCATC TCGGCGAACG CGCCCCAGGA AGAGGCGAAC
GCCGCCGCTG CGTTCCTCGA CTACCTGAGC TCGCCGCAGA TGCAGCGCCA ACTCGCGCTC
AGACATGGGT CGCTCCCGGT CGTGCCGGTG GACGGGCATG TCGATCGCGG TGCGGTCTAC
GACGACCTGC TGCAGGCGTG GAACGAACAG CGGGCGTCCG GGACGGCGCT GCCCTTCCTC
GGCAGCGGCA CGCCGCTCAA CTACACCTAC ACGACCTCGA TCCAGGGCGT GCAGGACCTG
CTGGCCGGAC GCGTGAGCCC TGCGGAGCTG CTGGACAAGA TCGAGGAGCA GCAGCAGGCG
TATGTCAGAG CCGGCAAGTG A
 
Protein sequence
MVLSKSFRGR AAAGLAVLGA CALAACGGPG TSASTSTDSS ASADTSVPRQ DLTVTMWDQE 
AQNPINPGLE AAISQFEKLH PNISIKRVAP PGIGNESTVE STTKTKLAVS GSNPPTLVEG
DTLPGGVLAP LIQAGLMRDM GPYVDAYGWD RRFRSIGGLT LDRYSNNTRR WGEGPVYGVP
WYGSEIGVFY NRRLLEQIGV PLPKTFADFE ASLAAAKRAG ITPIVLGNSD RDGGGWLFNE
LLADFLGAPE MLKWVLNKPG ATITGPDGVQ AAQTMKDWAD AGYFTKGYEG ISLTQQAAQF
GRGEGLYMIA LHVLAPTKDP DGFGFFYVPD PNDAGRTPAT VNNAMWSFGI SANAPQEEAN
AAAAFLDYLS SPQMQRQLAL RHGSLPVVPV DGHVDRGAVY DDLLQAWNEQ RASGTALPFL
GSGTPLNYTY TTSIQGVQDL LAGRVSPAEL LDKIEEQQQA YVRAGK