Gene Cwoe_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2781 
Symbol 
ID8733224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2968186 
End bp2969745 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID646503393 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003394575 
Protein GI284044235 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.187703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0364487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAGGT TGAGCAAGTC GGCCACCGCC CTGGCGGTGC TGGTCGCGGC GTGGTCTTTG 
GCCGCATGCG GAGGCGGGGC GAAGGTAGGG GACTCGACGT CGGACGGCGG TGGCGGGGCG
TCGACCGGCG CGTCGTCGAG CGGAGGGACG TTGACGGTCG GCCTCGACTC CGATCCGGCG
TCGCTCGACC CGACCGGCGA CACCGGCTAC GCGGGCTCGC TCGTGACGCC GCAGATCTTC
GAGACGCTCG TCGTCGCGGA CGACGACGGC ACGATCGGCC CGGGCCTCGC CGAGAGATGG
ACGGTCTCGA GAGACGGCCG GACCTACACG CTGACGCTGC GCAAGGGCGT GAGATTCCAC
GACGGCACGC CGCTCGACGC GAGAGCGGTG GTCGCGAGCC TCAGACGCAG CGCCGGAAGA
GCGTCGCCGT GGGCGGCCGA CCTCGCTCCG ATCACAGCGA TGAAGGCGAC CGGCGAGGAC
ACCGTCGTGC TGACGCTCGA CAGACCGAAC GCGCCGCTGC TGTCGACGCT CGCCGACAAG
CCGGGCATGA TCGCCTCGCC GACGGCGGTC GAGCAGGCGG GCAGACGGTT CGGGTCGCAG
CCGGTCGGAA CCGGTCCGTT CGCGTTCGAC CACTGGACGC GCAACCAGGA GCTGATGCTG
AGACGGAACC CCGACTACTG GGACGCCGGC AAGCCGAAGC TCGACGCCGT CGTCTTCAAG
CCGCTGCCGG ATCCGACGCA GAAGGTCACC AACCTCGTCG CCGGCCAGGT GCAGACCGTC
GACTACGTGC CGCCGGAGCT GATATCTCGC GTCGAGGGCG CGTCGAACCT CGAGCTGGAG
CAGGGCCCCG GACCGTACAA CTCGGTCGTC TACGTGCCGA TGAACGCGGC GCGGCCGCCG
CTCGACGACG CGAACGTCCG CCAGGCCGTC TCGCTCGCGA TCGACCGCGA CTCGATCGTC
AGAAACGTCG CCTTCGGAGC CGGCACGCCC GCGCGCTCGA TGCTCTCGCC GACCTCGTGG
GGCTACAGCG ACGAGATTCC GGCGATTCCG TACGACCCTG CCAGAGCGAG AACGCTGCTG
GGCGGGAGAG AGGTGAAGCT CGAGCTGCAG GTGCCGCCGA CCTACACGCA GGCCGCGCAG
GTGATGAAGC AGAACCTGGC CGAGGCCGGG ATCGACGTGA CGCTGCGGCG GATGGACTGG
GGCCAGCTGA TCGACGGCTT CTACAAGGGC GACTTCGACA TGCAGGTGCA GGACCTGCTC
GGGATGCAGC GCTCCGACCC CGACGGCGCG CTCAGCAGCT TCTACGCGCC GGACGGCTCC
AACAACGGCG CCGGCTTCTC CGATCCGCAG ATCACCGCGC TGCTCGACAG AGCCCGCTCG
GGCGGCGACG AGGCGCAGCG CAGACCCGAG TACGTCGAGA TCCAGCAGCT CGCGCAGGAG
CAGAGCCCGT ACGCGCCGGT GTACATCCCC AACCAGGTGC GGGCGTGGGA CAGCAAGGTG
CAGGGACTCG GCCTCAGCAA CGACGGCGTC CTGCACCTGA CCGACGTCAC GATCGGCTGA
 
Protein sequence
MVRLSKSATA LAVLVAAWSL AACGGGAKVG DSTSDGGGGA STGASSSGGT LTVGLDSDPA 
SLDPTGDTGY AGSLVTPQIF ETLVVADDDG TIGPGLAERW TVSRDGRTYT LTLRKGVRFH
DGTPLDARAV VASLRRSAGR ASPWAADLAP ITAMKATGED TVVLTLDRPN APLLSTLADK
PGMIASPTAV EQAGRRFGSQ PVGTGPFAFD HWTRNQELML RRNPDYWDAG KPKLDAVVFK
PLPDPTQKVT NLVAGQVQTV DYVPPELISR VEGASNLELE QGPGPYNSVV YVPMNAARPP
LDDANVRQAV SLAIDRDSIV RNVAFGAGTP ARSMLSPTSW GYSDEIPAIP YDPARARTLL
GGREVKLELQ VPPTYTQAAQ VMKQNLAEAG IDVTLRRMDW GQLIDGFYKG DFDMQVQDLL
GMQRSDPDGA LSSFYAPDGS NNGAGFSDPQ ITALLDRARS GGDEAQRRPE YVEIQQLAQE
QSPYAPVYIP NQVRAWDSKV QGLGLSNDGV LHLTDVTIG