Gene Cwoe_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0130 
Symbol 
ID8730558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp131473 
End bp133101 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content70% 
IMG OID646500744 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003391941 
Protein GI284041601 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.870572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGACC TGCGGCCCAG CCGACGGGAG CTGCTGCAAG GAGGCGGGGC GGCGGTCGGC 
CTCCTCGGCC TCGGCGGAGT CCTCTCCGCG TGCGGTGGAG GCGAGTCGAC GAGCAGAGCG
GGATCGAGCC AGGCCGCCGC GACCGGCGAG CGCGGCGGCG AGATCACCTG GGGCCTCGCC
GCCGATCCCG TCTTCATGGT CCCGTTCGGT GCGACGCTCG GCGCGACGCG CGAGGCGACC
GAGCCGATCT ACGAGTCGCT GCTGACGTGG GACCGCGACC TCAGAATCGT GCCGGCGCTC
GCCGCGTCCT ACAGCTCGCC GGACGACACG ACGTACGAGT TCGTCCTGCG CGACGGCGCG
AGATTCCACG ACGGCAGACC GGTCCGTGCG AGCGACGTCG TGTACTCGAT CGGCCTCCAG
CGCAGACCGC CGCCGCCCGG CACGCCCGAC ATGGCCGTGC ACGTGCCGGC GATCAAGGAC
GTCGTCGCCG TCGACGAGAG AACGGTCCGC ATCAGCATGG ACGGGCCGGA CGCGCGCCTG
CCCGGCTTCC TCGCCTGGGG CCGTCACTCC TCGATCGTGC CGGAGGGGAT GTACGACGAC
ATCGACCCGC GCACCGAGGC GAACGGGACC GGGCCGTTCA GACTCGTCGG CTACAAGCAG
AACGACCGCG TCACGTTCGA GGCCAACAGG GACTACTGGC AGTCCTCGAT GCCCGGCGTC
GACAGAATGA CGCTGCGCGT GATGACCGAC GAGCAGGGTC GCCTCGCCGC GCTGCGCGCC
GGCGATATCG ACGGCTGCTC GGTCTCGCCC GACCTCTCCG AGGTGCTCGC CGCCGATCCG
AACGTCGAGG TGCTGAAGGG CCTCGTCGCC GCGCACCGCG AGCTGCAGTT CACGATCAAG
ACCGGCGAGA GAAAGCCGTG GCACGACAAG CGCGTGCGAC AGGCGATCAA CCACGCGATC
GACCGCCAGG ACCTGATCGA GCGCGTCTAC GCCGGCAGCG CCGAGTTCTC CGGCGTCGTC
CCGCCCGGCT ACGGCGACTG GCCGCTCGCC GACGAGGAGC TGCGCAGCGA TCTGCTCGCG
CACGACCCCG AGAGAGCGAG AGCGCTGCTC GCTGCCGCCG GCTTCGGCGA CGGCTTCACG
ATCGAGCTGG AGGTCGCCTC GACGACGACC GACCTCGTCA AGACCGCCGA GATCCTGCAG
CAGCAGCTCG CGGACGTCGG CGTCGAGCTG AAGATCCGGC CGATCGAGCT GGCCGCCTTC
GCGAAGAACA ACGGCGAGGG CAGATTCGAC CTCCACCTCA CCTACCGCGG GATGCGCGGT
GACGTCGCCG GCTACGTCAG CGACTTCGAT CCCTCGCAGC CGCTGTACAG AGACGTCTGG
TTCCCCGGCG CGACCGACGT CGACCCCGAG CTCGGCAGGC TGATCAGACA GGGCGCGACG
ACGATCGCCG AGGCCGACCG GCGGCCGATC TACGAGCAGG TCCAGCGGCT CGCGCTGGCC
GAGGCGCTGC ACGTCCCGCT CTGCAACCCC TACAAGTTCC AGGCCGTCTC CAAGCGCGTC
TCCGGGATGT ACGTCGCCTA CACCGACTTC AACCCCGGAC TGGCGACCGC CCGCGTGACC
GCGAGCTGA
 
Protein sequence
MRDLRPSRRE LLQGGGAAVG LLGLGGVLSA CGGGESTSRA GSSQAAATGE RGGEITWGLA 
ADPVFMVPFG ATLGATREAT EPIYESLLTW DRDLRIVPAL AASYSSPDDT TYEFVLRDGA
RFHDGRPVRA SDVVYSIGLQ RRPPPPGTPD MAVHVPAIKD VVAVDERTVR ISMDGPDARL
PGFLAWGRHS SIVPEGMYDD IDPRTEANGT GPFRLVGYKQ NDRVTFEANR DYWQSSMPGV
DRMTLRVMTD EQGRLAALRA GDIDGCSVSP DLSEVLAADP NVEVLKGLVA AHRELQFTIK
TGERKPWHDK RVRQAINHAI DRQDLIERVY AGSAEFSGVV PPGYGDWPLA DEELRSDLLA
HDPERARALL AAAGFGDGFT IELEVASTTT DLVKTAEILQ QQLADVGVEL KIRPIELAAF
AKNNGEGRFD LHLTYRGMRG DVAGYVSDFD PSQPLYRDVW FPGATDVDPE LGRLIRQGAT
TIAEADRRPI YEQVQRLALA EALHVPLCNP YKFQAVSKRV SGMYVAYTDF NPGLATARVT
AS