Gene Cwoe_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3983 
Symbol 
ID8734441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4226841 
End bp4228211 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content70% 
IMG OID646504608 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003395775 
Protein GI284045435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCAG ATCGTCAGAC GTCGCGCATG GACGCCGCCG GGGCCGCGCC CGGCGGAGGT 
CCGCGCGTCT CGCGACGCGC CTTCGCGCGG ATCGCCGCCT CCGGGCTCGC GGTCTCCGTC
GCCGCACCGC TGCTCGCCGC CTGCGGCGAC GACGGGGAGA GCAGCGCGGA CGGCAAGGTC
ACGCTGAAGT TCTGGAAGTA CGAGGACCCG GCCACGAAGT CCGTGCTCGA GCAGCTCGTC
GCGAAGTACA ACCGCGAGCA GCAGAACGTC AAGGTCGTGA TGCAGACCTT CCCGTTCGAC
CAGTACCTGG CCGAGAAGAT CACGACCGCG CTGTCGGCGG GCTCCGGGCC CGACGTCTTC
TGGGTCAGCG CAGCGACGCT GCTGAACTTC GCGCCGAAGC AGCTGCTGCT GCCGCTCGGC
GACACCTTCA CGCCGAAGGA GCAGAGCGAC TTCCTGCCGC AGAGCTTGAG AGGCATCACG
CTCAGAGGCG ACGTCTACGG CGTCCCGCAC GAGATGGGCG TCCAGGGCCT GCTCTACGAC
CAGCGCCTGA TGGAGAGACT GCGGCTCGAG CCCCCGAAGA CGTGGGACGA GCTGAAGGAG
GTCGCGGCCA AGATCAAGAC CGACACGCGC TGGGGCATCA TGCTGCCGAC CGCCCCCGAC
GTGTTCCAGA ACTTCATCTG GTGGCCGTTC CTGTGGATGG GCGGCGGCGA GGTCGTCAGC
GCCGACTACA GCCACGCGAC GATCGCCGAG CCTGCCGGCG TGCAGGCACT GGCGCTGTGG
GGCGACCTCG TGCGGGACGG GCTCGCGGCG CCGAAGTCGT CCGGTCCCTT CGGCGAGGAG
CTGGCGCAGG GCAAGGCCGG GATGGCGGCG CTCGGGATGT GGGTCGTCGG CAACTACCGC
ACGACGTACC CGAACGTCGC GCTCGGCGCG GCGCCGCTGC CGACGCCGAC CGCCGGCGGG
AGATCGCTCG CGGCGTTCGG CGGCTGGTAC ACCGCCGTCA GCGCGGCGAC GAAGCACGCC
GAGGAGGCGC GCAGATTCGC CGTCTGGCTG TTCGGCGAGA ACCCGGCCAA CGCCGTCGAG
CTGACGAAGG CGATGACGGT GCTCTCGCCG CGCAGATCGG TCACCGCGAC GCTCGAGACG
CTGCCCGCGT TCAGAAAGGC GCCGATCCCG GAGTTCACGC GGATCTGGCC GAGCACGCGT
GCGGAGCCGG CGTACCCGCC GGAGATCCAG ACCGCCGTCA CGAACGCGCT GCAGGCGGTC
ATGTTCAGCA AGGCGGAGCC CGAGCGGGCG GCGGAGGACG CCGCGAAGGC GATCGACAGC
TACCTCGCGA GTCCCGACGG GAGCCTGCTC AAGGAGCTCA TGGGCTCGTG A
 
Protein sequence
MSPDRQTSRM DAAGAAPGGG PRVSRRAFAR IAASGLAVSV AAPLLAACGD DGESSADGKV 
TLKFWKYEDP ATKSVLEQLV AKYNREQQNV KVVMQTFPFD QYLAEKITTA LSAGSGPDVF
WVSAATLLNF APKQLLLPLG DTFTPKEQSD FLPQSLRGIT LRGDVYGVPH EMGVQGLLYD
QRLMERLRLE PPKTWDELKE VAAKIKTDTR WGIMLPTAPD VFQNFIWWPF LWMGGGEVVS
ADYSHATIAE PAGVQALALW GDLVRDGLAA PKSSGPFGEE LAQGKAGMAA LGMWVVGNYR
TTYPNVALGA APLPTPTAGG RSLAAFGGWY TAVSAATKHA EEARRFAVWL FGENPANAVE
LTKAMTVLSP RRSVTATLET LPAFRKAPIP EFTRIWPSTR AEPAYPPEIQ TAVTNALQAV
MFSKAEPERA AEDAAKAIDS YLASPDGSLL KELMGS