Gene Cwoe_5088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5088 
Symbol 
ID8735554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5438577 
End bp5440238 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content68% 
IMG OID646505713 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396872 
Protein GI284046532 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.416607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTTC GACGACAGAC CGCTCGCCGC CCGTTCCGCG GCCTCGCGAC CGCCGCGTCG 
GTCGCCCTGA TGGCGGCGAC CCTCGCCGCC TGCGGCAGCA GCGACAGCGG CAACAGCAGC
AGCACCGGCG ACGCCACCAG CAGAGGGGCG AACGTCGACT CGGGTCGCTG GGGAGCGGTC
ATCAACACCG GCGGCAGACC CGTCAGAGGC GGGATCCTGC GCGTCAACCA GGATGACGCG
CCCGCCGGCA TCAGCCCGCT GTACCTGCTG ACCGACCCGA GAAACGACAC GATCCAGGTC
GTCATGCAGG TCTTCGACCA GCTGACCGAG CTGCGGCCCG GCTCGATCGA TCCGCAGCCC
GGGCTGGCGG AGAGCTGGGA GGTCAGCCCC GACGGCAAGA CCTACACGTT CAGATTGCGC
GACGCGAAGT TCTCCGACGG CGCGCCGGTG ACCTCCGGCG ACGTGCGCTA CTCGCTCGAC
CGCGTGCGCG CGAGAGGGTC GTTCTACGTC GACCTCTACG CCAGCATCGC GACGATCGAG
ACGCCGGACC CGTCGACCGT CGTGCTGAGA CTGAAGCAGC CGACGCCGGC GATGCTCTCC
TACCTGTCGT TCGCCGGCGC ATCGGTCGTG CCGGAGAGAC TCGTGCGCGC CGACGAGAGA
GGGTTCAACC GCAGACCGAT CGGCAGCGGC CCGTTCGTCG TCAGAAGCTG GAAGCCCGAC
CAGGCGATCG AGCTGACGCG CAACGACAGC TACTGGCAGA GAGGCCTGCC GTACCTCGAC
GGGGCGGTCC TGACCTCGGT GCCCGACGAC AACACGCGCG TGCTCAACGT GCAGTCGGGC
GAGGCGCAGG TCGCCGACTT CGTGCCGTTC GCGCAGATAG ACGCGATCGA CAAGGCGGGC
AAGGCGAGAG TGCTGATCGG ACCGGGCGCC GACACGACCG CGATCTGGGT CAACAACTCC
AGAAGACCGT ACGACGAGCG CGAGGTCCGC CAGGCGCTGA TGTACGCGAC GCCGGTCGAG
TCGATCATCG ACGTCGTCTT CCACGGCAAG TCGCCGCAGG CGAACACGAT CATCCCGAAG
CTGGAGTACT GGACCGACCA GGCGAGAGCC TATCCCTACG ATCTCGAGAG AGCGAGAGAG
CTGCTGACGA GATCGTCGGT CCCGGACGGC TTCACCGCGA CGATCCAGAT CAACGCCGAC
GACCAGGCGG CGAGCCAGAT CGTGCAGATC CTCGAGCAGG CGTGGGCCAG AATCGGCGTC
AGACTCGTGC GCGACCAGGC CGACGCGGCG ACCGTCGCGG AGAAGTTCTA CGGCGGCAGA
TACGAGCTGA ACCTCGTTCG CCCGGGCGCC TTCACCAGCG ACGTGCCGGT CGACGACCAG
TTCGCCGAGC TGCAGTTCAA CTCGCCCGCG ACGGGCAACC TGTTCACGTT CTCCAGACCG
GCGCAGGCGC GCGACTACGC GCGCAGAGCG GTCGTCGAGA CAGACCAGCA GAAGCGCAGA
GAGCTGTTCG CGCAGATGCA CGTCGCCTCG ATGGAGGAGC TGCCGACGCT GCCGCTCGTC
TACACGCCGA ACCGCGCCGC GGTCGCGAAC GAGGTGCGTG ACTTCAACTA CATGCTGACC
GGCTACTGGC GGCTGGAAAG CGCCTGGCTG GAGCAGCCGT GA
 
Protein sequence
MDLRRQTARR PFRGLATAAS VALMAATLAA CGSSDSGNSS STGDATSRGA NVDSGRWGAV 
INTGGRPVRG GILRVNQDDA PAGISPLYLL TDPRNDTIQV VMQVFDQLTE LRPGSIDPQP
GLAESWEVSP DGKTYTFRLR DAKFSDGAPV TSGDVRYSLD RVRARGSFYV DLYASIATIE
TPDPSTVVLR LKQPTPAMLS YLSFAGASVV PERLVRADER GFNRRPIGSG PFVVRSWKPD
QAIELTRNDS YWQRGLPYLD GAVLTSVPDD NTRVLNVQSG EAQVADFVPF AQIDAIDKAG
KARVLIGPGA DTTAIWVNNS RRPYDEREVR QALMYATPVE SIIDVVFHGK SPQANTIIPK
LEYWTDQARA YPYDLERARE LLTRSSVPDG FTATIQINAD DQAASQIVQI LEQAWARIGV
RLVRDQADAA TVAEKFYGGR YELNLVRPGA FTSDVPVDDQ FAELQFNSPA TGNLFTFSRP
AQARDYARRA VVETDQQKRR ELFAQMHVAS MEELPTLPLV YTPNRAAVAN EVRDFNYMLT
GYWRLESAWL EQP