Gene Cwoe_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0567 
Symbol 
ID8730995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp597123 
End bp598223 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content70% 
IMG OID646501180 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003392377 
Protein GI284042037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.149111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAG GCATCAGAGC GGGCGCCGCG ATCGCCGTCG CGGCGCTGCT GGCCGCCGGC 
TGCGGCAGCG GCGGAGGCGG CGAGCTGTCG GTCGCCGACC TGGGCAGCGG CCCGCCGAGA
GCGGGCACCG TCGAGCCCGG CGCGCTCGAC GGCAAGAAGC TCACGTTCGT CAGCTACGGC
GGCGACTCGC AGAGAGCGCA GATGGAGGTC CTGAGCGGAT TCGAGCAGGA GTCCGGCGCG
CAGCTGCTGG AGGACTCGCC GCCCGACTAC GCGAAGATCA AGGCGCAGGT CGAGTCCGAC
AACGTCACCT GGGACGTCGT CGTCGTGGAC GGGATCTGGG CAGCCGGCCA GTGCGGCAGG
CTGCTGGAGG ACCTCGACCC GGACGTGATC GACACCTCGC ACCTGCCGAG GGGCGTCGAG
GCGACGAGGT GCGCGATGCC GGGCAACCTC GACGGCAACG TCTTCGCGTA CGACGCGCAG
CGCTTCGCGG ACGACCCGCC GAGCTCGTGG GCGGACTTCT TCGACACCGC GAGATACCCG
GGCAAGCGCG CCGTCGACGC GAGCGACCCG AGCGTCACGC TGGAGATCGC GCTGCTCGCC
GACGGCGTCA GAGCCGACGA CCTCTATCCG ATCGACGTCG ACCGCGCGCT GCGCAAGCTC
GACACGATCC GCGACGACCT CGTCTTCTGG AGCTCGGGCG CCCAGCAGCA GCAGATGATG
ACCTCGCGCC AGATCGCGAT GGGCACGATG TGGTCCGGGC GCGTGTACTT CGCGCTGCAG
GCGGGCGCGC AGTTCGACGT CGTCCACGAC CAGCCGCTGC TGACGACGAC CACCTGGGTC
GTGCCGAGGG GCGCCCGCGA CCCGATCGGC TCGATGGCGA TGATCAACTG GTGGCTCGGC
GCCAGACAGG GCGCGCAGTA CACCGCGCTG ACCTCGTACC CCAGCGTCAA CGCCGACGCG
AGACCGGTGC TCGACGCCGA CGCGAGAAAG GTCGCGGTGA TGGACCCGCC GTTCACCGAC
CAGGTCGTCG TCAGCGACGA GTACTGGTCG AGAAACATCG GCAGGCTCAC CGACGTCTGG
ATCGACTGGG TCAATGGCTA G
 
Protein sequence
MRRGIRAGAA IAVAALLAAG CGSGGGGELS VADLGSGPPR AGTVEPGALD GKKLTFVSYG 
GDSQRAQMEV LSGFEQESGA QLLEDSPPDY AKIKAQVESD NVTWDVVVVD GIWAAGQCGR
LLEDLDPDVI DTSHLPRGVE ATRCAMPGNL DGNVFAYDAQ RFADDPPSSW ADFFDTARYP
GKRAVDASDP SVTLEIALLA DGVRADDLYP IDVDRALRKL DTIRDDLVFW SSGAQQQQMM
TSRQIAMGTM WSGRVYFALQ AGAQFDVVHD QPLLTTTTWV VPRGARDPIG SMAMINWWLG
ARQGAQYTAL TSYPSVNADA RPVLDADARK VAVMDPPFTD QVVVSDEYWS RNIGRLTDVW
IDWVNG