Gene Cwoe_5706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5706 
Symbol 
ID8736182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6107929 
End bp6109587 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content71% 
IMG OID646506333 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003397482 
Protein GI284047142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.47815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGAGCA TTCGGAAGCG GCCGCTCGCG GCCATCGCGG CATGCGCGCT GCTGGCCGCC 
GGCACGACCG CCTGCGGCGG CGCGACGAGA TCGTCGTCCT CGACCCAAGG CGGCGGCACG
ACCGCCGGCG GCGGCGTCGC GCTGAGCGAC GGCACGCCGG CGCCGGCGGG CGACGTCGAC
AAGGTCACGT GGGCGGTCTA CGCCGAGCCC GCGTCGCTCG ACTGGGCGTT CGCGAACGAC
TTCCCGCCGC TGGAGATCGG CGCGAACGTC TGCGAGAGCC TGTCCGCGGT CACGCCCGAG
ATGGAGATCG TCCCCGCGCT CGCGACCGGC TGGAGAGCGC CGAACCCGAA GACGCTCGTC
TACACGTTGC GCGACGGCGT CAGATTCCAC AGCGGCGCGC CGCTGACCTC CGAGGACGTC
GCCTACAGCC TCGGCCGCAA CCTCGACAGA AGCGTCGGCT CCTACTACGC CGGCGCCTAC
GCGAACGTGA GCAAGATCGA GGCGACCGGC CCGAACGAGG TGACGATCAA GCTCGAGCGT
CCCGACGCGC AGCTGCCGAG CGCGCTCGCG ACGCCCGCCG GCCGGATCGA GAGCAAGGCG
TTCCTTGAGC AGAGAGGCAG AAGCTACGGC ACGCCCGAGG GCGGCATCGA CTGCACCGGG
CCGTTCAGCT TCGGCAGCTG GACGAAGGGC CAGTCGGTCA CGCTGGAGCG TTTCGATGGC
TACTGGGACG CCGAGCGCGT GCCGAAGATC GCGCAGCTGG AGGTCGACTT CATCGCCGAC
CCGGCCGCGC GCGTCAACGC GCTCGCCTCC GGCACGATCG ACGGCACCTA CCAGGTGCCG
ACCTCGGGCT TCAAGAGACT CGGCTCCTCG CCGACCGGGA CGCTCTCGTT CGGCCGCGCC
GCCGGCAGCT ACGTCGCGAT GGTGACGAGC CTCGACGGGC CGCTCAGAGA CGTCCGCATC
CGCAGAGCGC TGTCGCTCGC GATCAACCGC GACGGGATCA TCTCCAGCGT CCTCGACGGC
GCCGCCGAGC CGCTGAAGGC GCCGGTCGCG CCGGGCGCGT GGGGCTACGC GAAGGAGACG
TACAGAGCGG CCTACGACGC GCTGCCGGAG CCGTCCGGCT CGGTCGAGGA GGCGAAGAGA
CTGGTGCAGG AGGCGGGCGC TCCGAGCGAG CCGATCACCG TCGCGATCAC GCGCGACCGC
GAGGAGATGC CGACGATCGC GGCCGAGATG CAGCGCGCGG CGAGAGAGAT CGGGCTGCAG
CTGGAGATCA AGAGCCTCGC CGGCAACAGC TACAACGCGC TCTACTCCGA CGCGAAGGCG
CGCGACGGCG TCGACATGCT GTTCTCGCAG TGGGTGCCGG ACTTCCCCGA CCCGCTCCAG
CTCTATCAGT ACATGCGTGG CGACAACTTC TACGGCTACG CGAGATGGGA GGACCCCGAC
TTCATGCGCC TCACCTCCGA GGCGGCCGGC ACGGCCGACG AGGAGCAGCG CGCCAGACTG
ATCGCGGAGG CGCAGGAGCG CGCCGTAGAG GCGCAGATCT GGATCCCGCT CTACACGCCG
TACAACCCGG TCTTCCTCAA CAAGCGGATC ACCGGCGCGC CGACGAGCGC GGTCAACCTG
ACCTACTCGT GGGCGGCCGA CCTGGGAGCG ACGGGGTAG
 
Protein sequence
MESIRKRPLA AIAACALLAA GTTACGGATR SSSSTQGGGT TAGGGVALSD GTPAPAGDVD 
KVTWAVYAEP ASLDWAFAND FPPLEIGANV CESLSAVTPE MEIVPALATG WRAPNPKTLV
YTLRDGVRFH SGAPLTSEDV AYSLGRNLDR SVGSYYAGAY ANVSKIEATG PNEVTIKLER
PDAQLPSALA TPAGRIESKA FLEQRGRSYG TPEGGIDCTG PFSFGSWTKG QSVTLERFDG
YWDAERVPKI AQLEVDFIAD PAARVNALAS GTIDGTYQVP TSGFKRLGSS PTGTLSFGRA
AGSYVAMVTS LDGPLRDVRI RRALSLAINR DGIISSVLDG AAEPLKAPVA PGAWGYAKET
YRAAYDALPE PSGSVEEAKR LVQEAGAPSE PITVAITRDR EEMPTIAAEM QRAAREIGLQ
LEIKSLAGNS YNALYSDAKA RDGVDMLFSQ WVPDFPDPLQ LYQYMRGDNF YGYARWEDPD
FMRLTSEAAG TADEEQRARL IAEAQERAVE AQIWIPLYTP YNPVFLNKRI TGAPTSAVNL
TYSWAADLGA TG