Gene Cwoe_5711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5711 
Symbol 
ID8736187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6113252 
End bp6114904 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content70% 
IMG OID646506338 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003397487 
Protein GI284047147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.789808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTC GTACCACCAC CCTTTCCCTG GCCACCGCGG CCGTCGCCGC GGCAACCCTC 
TTCAGCGCCT GCGGCGGCGC GACCGACAGC GGCTCCGGCG ACGGCGCCAG AGCCGGCCCG
CCGGCCGGCG CCGAGCCGAC CGTCACGACC CCCGCCGCGA GCGGCACGAC CGACCGCGTC
ACGTGGGCGC TGACCGCCGA GCCCGCCTCG CTCGACTGGG TCCTCAACGC CGACTTCTTC
GCCGGCCAGG TGCTCGCGAA CGTCTGCGAC GGGCTCGTCC GCCAGGCGCC CGACTTCTCG
CTCCAGCCCG CGCTCGCCGC GAGATTCTCG AACCCGACGC CGACGACCTG GGTCTACGAG
ATCCGCGACG GCGCGACCTT CCACGGCGGC GCGCCGCTGA CCGCGAACGA CGTCGTCTTC
AGCCTGAAGC GGAACATCGA CCCGAAGGTC GGCTCGTTCT GGGGCCAGGC CTTCGCGAAC
GTCAAGACGA TCGCCAAGAC CGGTCCGAGC GAGGTGACCG TCACGCTCAG ACGACCCGAC
TCGATGTTCA ACGCGTACAT GTCGACGCCC GCCGGGATCA TCGGCAGCGA GAGAACGGTC
AAAGCCGAGG GCAAGTCGTA CGGCACGCCT GAGGGCAGCG TCGACTGCGT CGGCCCCTAC
AAGCTCGCGA AGTGGGAGAA GGGCCAGGAG ATCGCCCTCG CCGCCGACGA CGCCTACTGG
GACACGGCGC TGAAGCCGAA GGCGGGCGAG TTCGTCTTCC AGATCATCCG CGACCCCGCC
GCGCGCACCA ACGCGCTGCT GTCCGGCACG GTCGACGGCT CGTGGTTCGT GCCGCCGTCC
GCGCTCGCGA GACTGAACGG CTCGGCGACC GGCAGAGTCT TCTACGGCCC CTCGACGCAG
GGCTTCAACG CGATCGTCCT GAACACCGAC GGGCCGCTCA GAGACGTCCG CATCCGTCAG
GCGCTGTCGA TGGCGATCGA CCGCGAGGGG ATCGTCAGAT CGGTCCTCGC CGGCGCCGCG
CAGCCGTCGC GGGCGCCCGC CGTCCCCGGC ACCTGGGGCT ACGCGAAGGA GACGTTCAGA
AGCGCGTGGG ACGGCCTCGA GGTCACGCAG CGCGACGTCG ACGCGGCCAG AAGACTGGTG
CAGGAGGCCG GCGCGCCGAA GCAGCCGATC ACGATCTCCG TCACCTCGCG TGACGCCGAG
GTCCCCGTCA TCGGCGCCGC GATCCAGGCG GCCGGCGAGC AGATCGGGCT GAAGGTCGTC
CAGAGACAGA TCCCGCCGGA CCAGTACGAC GCCGTCTACA CGAGCGAGGA CGCGCGCAAG
GGGATCGACC TCTACCTGAC CGCGTGGGGC ACCGACTTCG CCGACCCGCT CCAGATCTAC
GAGTACTTCA AGAGCGGCAA CTTCTACAAC TTCGCCGGCT TCTCGGACCC GAGACTCGAC
GCGCTGCTGA ACGACGCGTC GCGCACGACC GACGAGCAGA GAAAGGCAGA GCTGGTGACC
GGCGCGCAGA GAATCGTCGT CGACGAGCTG CTGTGGATCC CGCTCTACGC GCCCTACAAC
ACGCTCTTCA TGAACAAGCG GATCACGGGG GCGCCGGCGA GCTACGTCCA GCTCCACTAC
CCGTGGGCCG CCGCGATCGG CTCCGCCGGC TGA
 
Protein sequence
MTRRTTTLSL ATAAVAAATL FSACGGATDS GSGDGARAGP PAGAEPTVTT PAASGTTDRV 
TWALTAEPAS LDWVLNADFF AGQVLANVCD GLVRQAPDFS LQPALAARFS NPTPTTWVYE
IRDGATFHGG APLTANDVVF SLKRNIDPKV GSFWGQAFAN VKTIAKTGPS EVTVTLRRPD
SMFNAYMSTP AGIIGSERTV KAEGKSYGTP EGSVDCVGPY KLAKWEKGQE IALAADDAYW
DTALKPKAGE FVFQIIRDPA ARTNALLSGT VDGSWFVPPS ALARLNGSAT GRVFYGPSTQ
GFNAIVLNTD GPLRDVRIRQ ALSMAIDREG IVRSVLAGAA QPSRAPAVPG TWGYAKETFR
SAWDGLEVTQ RDVDAARRLV QEAGAPKQPI TISVTSRDAE VPVIGAAIQA AGEQIGLKVV
QRQIPPDQYD AVYTSEDARK GIDLYLTAWG TDFADPLQIY EYFKSGNFYN FAGFSDPRLD
ALLNDASRTT DEQRKAELVT GAQRIVVDEL LWIPLYAPYN TLFMNKRITG APASYVQLHY
PWAAAIGSAG