Gene Cwoe_0576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0576 
Symbol 
ID8731004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp606414 
End bp608015 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content70% 
IMG OID646501189 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003392386 
Protein GI284042046 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGACG GTGTGAGAGG TCTGACGCGG CGTGACGCGA TGCGCGGCGC CGCCGCGGGC 
GCGGCCGTGG TCGGCGCCGG CGGTCTGCTG GCGGCCTGCG GCAGCGGCGG CTCGTCGAGC
GGCACGACGG CGTCCGGCGA GACGACCGCC ACGGCGGGCG GCACGCCGAG AAGCGGCGGC
ACGCTGCGCG TCGGCGGAAC CGGCGGCGGC GCGCGCGACT CGCTCGACCC GAACCGCCAG
CAGACGGCGC TCGACTTCGC CCGCTGCTTC GCGCTCTACG ACCCGCTCGT CGAGCTGACC
GAGCAGTTCA CCTACGAGCT GGCGCTGGCC GAGGAGATCA CGCCGGACGA CGGCAGCGCG
AAGGTGTGGA CGGTCCGGCT GAAGGACGGG ATCGAGTTCC ACGACGGCAA GACCGCCGAC
GCTGAGGACC TGATCTTCAG CATCGGCCGC GTGATCGACC CGAGAGCCCC GGGCGCCGGT
GCGAACGCGC TCAGAGGCGT GACTCTGAAC GGGATGAGAA AGCTCGACGC GCGCACCGTC
CGCTTCACGC TCGAGCAGCC GATCTCGATC TTCGACAAGC GCGTCGGCGG GTACCTCTCG
CCGCTGCTGC CGGTCGGGTA CGACCCGGCG AGACCGGTCG GCGCCGGCCC CTTCAAGCTG
CAGAGCTTCA AGGCCGGCGA CCGCTCCGTG ATGGTCCCGC ACCCGAACTA CTGGGGCGAG
AGAGCGCACG TCGACCAGCT CGACATCATC GGCATCGCCG ACGCGTCGGC GGCGGTCAAC
TCGCTGCTCT CCGGCCAGAT CGACATCCTT CAGGGCCTGC CGCCGGCGCA GGCCGAGGTC
GTCACCTCGG GCGGCGGCAA GCTGCTGGAG ACGAACGACT CCGCATGCTT CATGTTCGGC
ATGCGGATGG ACATGGCGCC GTTCGACGAC GTGCGCGTGC GGCAGGCGAT GCGGCTGATC
GCCGACCGCG ACCAGATGGT CGAGCAGGTG ATGGCCGGCC GCGGCGACGC CGCCAACGAC
CTCTTCGCCC GCTACGACCC CGACTACCTG TCGGACGTCC CGCAGCGCGA GCAGGACCTG
GAGCAGGCGA GAGCGCTGCT GAAGCAGGCC GGCCAGGACG GCATGCGGCT GGAGATCTCG
ACGACCGGCG CCTATCCGGG CCTGCTGGAG TCGGCGCAGG TCTTCGCCGA GCAGGCGAAG
GGCGCCGGTC TCGACGTCAA GGTCAGAAGC ATCGACCCGG ACACCTTCTA CGCCCGCTAC
TACCGCAGAA CGCCGTTCTC GCCGGACCTC GTCTCGCCGC AGCTGTACCT GACGGTCGCG
ACCTCCTACA ACACGCCGGG CGGCCCCTAC GACACCGTCT ACAACAGAGA CCCCGAGTAC
CTCGCGCTCT ACAGAGACGC GCTCGCGGAG CTGGACGAGG CCAAGCGCGG CGAGCTGATC
GAGGCGATGC AGCGGATCGA CCACGAGCGT GGCGGCTACG TCTGCTGGGG CTTCTCGAAG
AGCCTCGACG CCTATCGCGA CGACGTCAAC GGGCTGGTGC CGGGGACGAA GGCGGCGTTC
AGCGTCAACA ACGGGGCGTT CAACCGGCTC TGGCTCAGCT AG
 
Protein sequence
MGDGVRGLTR RDAMRGAAAG AAVVGAGGLL AACGSGGSSS GTTASGETTA TAGGTPRSGG 
TLRVGGTGGG ARDSLDPNRQ QTALDFARCF ALYDPLVELT EQFTYELALA EEITPDDGSA
KVWTVRLKDG IEFHDGKTAD AEDLIFSIGR VIDPRAPGAG ANALRGVTLN GMRKLDARTV
RFTLEQPISI FDKRVGGYLS PLLPVGYDPA RPVGAGPFKL QSFKAGDRSV MVPHPNYWGE
RAHVDQLDII GIADASAAVN SLLSGQIDIL QGLPPAQAEV VTSGGGKLLE TNDSACFMFG
MRMDMAPFDD VRVRQAMRLI ADRDQMVEQV MAGRGDAAND LFARYDPDYL SDVPQREQDL
EQARALLKQA GQDGMRLEIS TTGAYPGLLE SAQVFAEQAK GAGLDVKVRS IDPDTFYARY
YRRTPFSPDL VSPQLYLTVA TSYNTPGGPY DTVYNRDPEY LALYRDALAE LDEAKRGELI
EAMQRIDHER GGYVCWGFSK SLDAYRDDVN GLVPGTKAAF SVNNGAFNRL WLS