Gene Cwoe_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2105 
Symbol 
ID8732548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2206848 
End bp2208464 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content74% 
IMG OID646502723 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003393905 
Protein GI284043565 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0251373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAGA GAGACCGCCG AACCTTGAAT CCGCACGCCA TCGACCGACG TGACTTCCTG 
CGCGCCGGCG CCGTGCTGGC CGCCGGCGCC GCCGGCGCCG CGGCGCTCGG CGCTGCGGGG
TGCGGCGGCC GCGGGGCGGG GGCGAGCGGC GCCGTGGCGA TCTCCTCTGA CGCGCTCGGG
CCGCGCGGCG GGACCGCGCG GCTGCTGCTC GGCGGCGGCG GGCCGCGGCT CGTGCTCGAC
CCCGCGACGC AGGTCAACGA GCCGGACGCG ATCGTCGACG GGCTGCTCTA CGACGGACTC
GTGCGCCTGC ACGACGACTG GCGGGTCGAG CCGCGGCTCG CGACCCGCTG GGAGTCCGAC
GCGGCGCAGC GCGTGTGGCG CTTCGAGCTG CGCGACGGGG TCACCTTCCA CGACGGCCGG
CCGCTGACGG CGAAGGACGT CGTCTACAGC CTCCGCCGTC TGCTGGACGA GCGGCTCGGC
TCCGCGGTCT ACCCGCGGCT CAACGGTGAG CTGAGGCCCG ACGGCGTCCG CGCCGCCGGC
TCCGGCGCCG TCGAGCTGCG GCTCACGCAG CCGGACGCCT TCCTGCCGGT CGCGCTCGGC
GCCCGCCACT GCAAGATCGT CCCGGCCGGC ACGACTGACT TCTCCCGCGC GATCGGGACG
GGACCGTTCC GCCTGCGCTC GCTCGACCAG TCGAAGCTCA GCTTCGAGCT GGAGCGCAAC
CCGGGCTTCT GGCAGGAGGG GCTGCCGCGC CTGGACCGGA TCGAGGGGAT GCTCGCCAAC
GACCAGGCGT CGCTCGTGCA GTCGGTCGCG TCCGGCCGCT TCCACTTCGG CGGCTTCATC
GACCCCTCGC TCGCGTCGAG CGCCGAGGCG AGCGGCGACG CGCGGCTGCT CGCGCACCGC
TCCGCGCTCT TCAACGACCT CGTCGCGGCG GCCGACTCCG AGCCGTTCAC GAACCCCGAC
GTGCGGACGG CGCTGAAGCT CGCGATCGAC CGTGAGCAGA TCCTGAGCCT CGCCTACAAG
GGCCACGGCA GCATCGCCCA CGACGTGCCG GTGCGGACCG CGGACCCGTT CTTCGCCGAG
GGGCTCGCGC ACCGCACTCG CGACGTCGAC GAGGCGCGTC GGCTGCTGCG CCGGGCCGGC
TACCCGAACG GCATCGACCT CGAGCTGCTG ACCGCTCCCG CCGGCGCCGC AATGGTCGAC
ATGGCGGTCG TGGCGAAGGA GAGCCTCGCC GAGGCCGGCA TCCGCGTCTC GGTCCAGCAG
CGACCGGCCG GCACCTACTA CGACGCCGTC TGGTTGAAGG AGGCGTTCTA CGTCGACACG
TGGGTGCTGC GCCACCCGCT CGACGCGATG GCCGTGATGT TCGAGAGCTC CGCCCCGTGG
AACGAGGCGA GACTGCGCTC GCCGCGGCTC GACGAGCTGC TGCGCGAGGC GCGCAGCACC
GGCGAGCGGT CCGAGCAGGC GCAACTGCTC GGCGCGGCCC AGACGCTCGT CGCCGACCAG
GCCGGCTTCG TCTGCCCGGC GTGGCTGGAC GAGCTGTACG TCGCCAAGCC CGAGCTGGCC
GGGGTCGGCT TCAACGCGAC CGACCTCGTC GACTTCCAGC GAGCGTCGCT GGGCTGA
 
Protein sequence
MSERDRRTLN PHAIDRRDFL RAGAVLAAGA AGAAALGAAG CGGRGAGASG AVAISSDALG 
PRGGTARLLL GGGGPRLVLD PATQVNEPDA IVDGLLYDGL VRLHDDWRVE PRLATRWESD
AAQRVWRFEL RDGVTFHDGR PLTAKDVVYS LRRLLDERLG SAVYPRLNGE LRPDGVRAAG
SGAVELRLTQ PDAFLPVALG ARHCKIVPAG TTDFSRAIGT GPFRLRSLDQ SKLSFELERN
PGFWQEGLPR LDRIEGMLAN DQASLVQSVA SGRFHFGGFI DPSLASSAEA SGDARLLAHR
SALFNDLVAA ADSEPFTNPD VRTALKLAID REQILSLAYK GHGSIAHDVP VRTADPFFAE
GLAHRTRDVD EARRLLRRAG YPNGIDLELL TAPAGAAMVD MAVVAKESLA EAGIRVSVQQ
RPAGTYYDAV WLKEAFYVDT WVLRHPLDAM AVMFESSAPW NEARLRSPRL DELLREARST
GERSEQAQLL GAAQTLVADQ AGFVCPAWLD ELYVAKPELA GVGFNATDLV DFQRASLG