Gene Cwoe_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2236 
Symbol 
ID8732679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2352865 
End bp2354508 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content70% 
IMG OID646502854 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003394036 
Protein GI284043696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0669965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.364829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCAC ACTCGTTCCG GGCCGTCGCG GGCGCCGTCG CAGTCGGCGT CGCGCTGCTT 
GGCGCCGGCT GCGGCCGATC CAGCGAGGAC GCCGGCGGCA CGACGGCCGG CGTGAGCAGA
TCGGCTCCGC TCTCGGCCAC GACGCCCGCC GGGTCCGCCG AGGTCGACAG CGCGACGTGG
GCGATGTACC GCGACACGAT CACGGTCGAC CCGATCTTCG CCGGTGACTA TCCCGAGCGT
CAGGTCGTCG CGCTGATGTG CGAGTCGCTG CTGCGCCAGC AGCCGGACGG CACGACGGCG
CCGGGCCTGG CGAGACTCTC CTACCGCGAC CCGAGAACGG TCGTGCTCAC GCTCGCCGAC
GGCGTCAGAT TCTGGGACGG CAGCCCGCTG ACCCCGGCTG ACGTCGTCTA CAGCATCGAC
CGCAACCGCG ACCCGAGAGT CGGCGGGTAC TGGGCGAGCA ACTTCGGTAG CGTCGACACT
GTCACCGTGA CTGGCGAGCA TGAGGTGACA CTCAAGCTCA GACGGCCGGA CTACTGGCTC
GAGGGCGTGC TCTCGTTCAT GGCGGGAGTG GTGGTGAAGA AGTCGTACGC GCAGGAGAGA
GGCAAGGACT ACGGCACGCC GAGCGGCGGC GCGATGTGCA CCGGCTCCTA CAGACTCGGC
GCCTGGAGAA CCGGCGGCGC GGTCCAGCTC GTGCGCAACG ACGACTACTG GAACTCCGGC
GTGAGACCGC ACGTGCGAGA GCTCAGCTTC AAGGGAGTGC CCGACCACTC CGCCCTCACC
GCCGGCCTGC TGACCGGCGA GATCGACGGC ACCTACCCGC TCGGGCTCTC GACGCTCGAC
CAGCTGCGCC AGAGCGACGC GGTCGAGGTC TACGAGGGGC CGTCGTACAT GGTCGGCGCG
ATGATCCTCA ACCTCGACGG CCCGCTCGGC GACGTGCGCG TGCGCCAGGC GCTGTCGTTG
GCGCTCGACC GCCAGGGCAT CGTCGCGACG ACCTTCAAGG GCACCGCCGA GCCCTCGCGT
GCGCTCGCCA GCCCCGGCAC CTGGGGCTAC GCGAAGGACG TCTTCAGCGC CGCGTGGGAC
GCGCTCCCCG CGCCTGAGCC CGACCTCGAC GCGGCCAGAA GACTGGTCGA GGAGGCCGGC
GCGAGCGGCA GAGAGATCAC GATCGCGACG TCGAGCGAGC TGCAGAACAT CGACACCGAC
GCGAACGCCT ACCGCACCGC GGCCGAGGCG ATCGGGCTGA GAGTCAAGCT GAAGTCGAGC
CCGGCGGCCG TCTACTCGAA CCTCTTCGTC GACGCCGACG CGCGCAAGCA GGTCGACGCG
TTCGCGACGA TGAACTACGC CAACTGGGGC GACCCGGCGT CGCTCTACGC GCCGCTGACG
TTCGCCGACG GCAGCCAGAA CTACTACGGC TACAGATCCT CCGCCGCGAG CGCCAAGCTG
GAGCAGGCGC GCGCCACCGC CGATCCGCAG GAGCGCGCGC GGCTCGTCAC CGAGGCGCAG
CAGACGATCA CCGAGGAGCT GCCCTGGATC GCGACCGTCT CGCCGCACAC GGTGCTCGTG
ATGAGCTCGA AGCTGACCGG CGCGCCGGCC TCCTCGGTCT ACCTGTCGTC CCCCTGGGCC
GACACGCTCG GCGGGAGAGG GTAG
 
Protein sequence
MRAHSFRAVA GAVAVGVALL GAGCGRSSED AGGTTAGVSR SAPLSATTPA GSAEVDSATW 
AMYRDTITVD PIFAGDYPER QVVALMCESL LRQQPDGTTA PGLARLSYRD PRTVVLTLAD
GVRFWDGSPL TPADVVYSID RNRDPRVGGY WASNFGSVDT VTVTGEHEVT LKLRRPDYWL
EGVLSFMAGV VVKKSYAQER GKDYGTPSGG AMCTGSYRLG AWRTGGAVQL VRNDDYWNSG
VRPHVRELSF KGVPDHSALT AGLLTGEIDG TYPLGLSTLD QLRQSDAVEV YEGPSYMVGA
MILNLDGPLG DVRVRQALSL ALDRQGIVAT TFKGTAEPSR ALASPGTWGY AKDVFSAAWD
ALPAPEPDLD AARRLVEEAG ASGREITIAT SSELQNIDTD ANAYRTAAEA IGLRVKLKSS
PAAVYSNLFV DADARKQVDA FATMNYANWG DPASLYAPLT FADGSQNYYG YRSSAASAKL
EQARATADPQ ERARLVTEAQ QTITEELPWI ATVSPHTVLV MSSKLTGAPA SSVYLSSPWA
DTLGGRG