Gene Cwoe_5018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5018 
Symbol 
ID8735484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5349206 
End bp5350786 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content69% 
IMG OID646505645 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396804 
Protein GI284046464 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0327585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACACC TCACCCGGCG TGAGTTCTCG GAGACCGGCA TCAGATACGG CGCGGCAGCG 
GGCCTGCTCG GAGGCGGCCT CGCGACCGTC CTCGCCGGAT GCGGCGGCGA CTCGTCCAGC
GACGGGTCGA CGAGCGCGGG CACGACGCCC GCCGACGGCT CCGGCGGGGG GAGCGGGGGG
ACGATCCGGA TCGGCAACGC GGAGCCGCCG ACGTCGGCCC AATGGGACCC GCACGCGGCG
TTCGGCCTCG CGGACTACCA GACCTGGTCG CTCGTCTACG ACACGCTGCT CGCGTACGAC
AGCGCCGGAG AGCTCGTCGG CCAGCTGGCG AAGTCGTGGA AGCGGCTGTC GCCGACGCGC
CTGCGGATCG TGATCCACAG AGACGTCCAC TTCAGCGACG GCAGCCCGCT CGGCGCCGAG
GACGTGAAGG CGTCGATCGA GCGCATCTCC GCGCCGCAGT CGGAGCTGGC GCTCGCGTCG
AAGCTGCCCG AAGGCGCGAA GGTCGAGGTC CGCGGCGAGC ACGAGCTGGA CATCGTCACG
CCCGAGCCGT TCGGCCCGCT GGAGGGCGCG CTCGTCGTCG TCTCGATCGT CTCGCGCAGA
GACGCGGCGA GACCGGAGGC GTTCAAGCGC CGCCCGCTCG GCAGCGGTCC GTACACGTTC
GTCGAGTACC GCAACAACAG CATCAGACTG AAGGCGAACC CCAGATACTG GCGCGGCAAG
CCGGGCTCCG ACGGCGTCGT GCTGTCCTAC GTCCAGGACC CGAGCGCGCG CATGAACGCG
CTGCTGACCG GCCAGATCGA CATCTACACG CGCGCCGACT CGATCGTGCT CGACGAGGTC
AGAGGCAACG ACGACTTCTA CGTCAACGAC ACCAGTCCGG CGTCGAACTT CTTCTACATC
CCGCAGTTCG ACACGGCGCT CAGAGACGTC CGCGTGCGGC AGGCGATCGC CTACGCGATC
CCGCGTCAGC AGATCGCCGA GAGCATCATG AGAATCTGCC CGCCGGCGCT CTCCTCGCTG
CCCGCGGCGT CGAAGGGCTT CAGACCGATG GAGCCGAGAT TCGACCTCGA CCTGGAGCGC
GCGAGATCGC TGCTGAAGGA GGCCGGCCAC GACGGCGGCC TGTCGATCAC GCTCGCCTCG
GCCAGCGTCT TCGCCCACCA GGAGCAGGTC GACCAGCTCG TGAAGGCGTC GCTGGAGCAG
GTCGGCATCA CCGTCGACAT CAAGAAGCTG GAGAGCGGCA CGTTCCGCTC GAACTTCTCG
CAGTACGCGC TGTCGATGAA CGCGCTCGAC ACGCCGGGCG ACCCGAACTT CATCTTCTCG
TTCTTCCGGC CGTCGATCGC CAGAGAGGTC CTGAAGTGGG ACTCGGCCGA CTTCATGCCG
CTGGTCGAGG CGCAGCGCCG CACGATCGGC GCCAGACGGC AGGCGACGAT CGACGCCGCC
GCGAGATACC TGTGGGAGAA CCAGATCCTC GTCTACCTCA CCGACGACAT CTGGTACACG
GTCGTCAACA GACGCGTCAG CGGCTACGAG CGCTCGACCG TCGAGGGCGA GCCGCTGCTG
TGGAGAGCGA AGGCGGCGTA G
 
Protein sequence
MKHLTRREFS ETGIRYGAAA GLLGGGLATV LAGCGGDSSS DGSTSAGTTP ADGSGGGSGG 
TIRIGNAEPP TSAQWDPHAA FGLADYQTWS LVYDTLLAYD SAGELVGQLA KSWKRLSPTR
LRIVIHRDVH FSDGSPLGAE DVKASIERIS APQSELALAS KLPEGAKVEV RGEHELDIVT
PEPFGPLEGA LVVVSIVSRR DAARPEAFKR RPLGSGPYTF VEYRNNSIRL KANPRYWRGK
PGSDGVVLSY VQDPSARMNA LLTGQIDIYT RADSIVLDEV RGNDDFYVND TSPASNFFYI
PQFDTALRDV RVRQAIAYAI PRQQIAESIM RICPPALSSL PAASKGFRPM EPRFDLDLER
ARSLLKEAGH DGGLSITLAS ASVFAHQEQV DQLVKASLEQ VGITVDIKKL ESGTFRSNFS
QYALSMNALD TPGDPNFIFS FFRPSIAREV LKWDSADFMP LVEAQRRTIG ARRQATIDAA
ARYLWENQIL VYLTDDIWYT VVNRRVSGYE RSTVEGEPLL WRAKAA