Gene Cwoe_5889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5889 
Symbol 
ID8736365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6300656 
End bp6302443 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content69% 
IMG OID646506515 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003397664 
Protein GI284047324 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAC TCGTCGTGCG GACGCTCATC GCGAGCCTCC TCTGCATGAC CCTCGCCTCG 
ACGATCGCCG CGTGCGGGTC CGGCGACGAC GACGGTGGCG GCGGGAACGC CCGCACGACC
GCGAGAGCGG ACAGCGGCGC CGGCGGCAGA AGAGGGGGGA CGCTGAGAGT CCTCACCGAG
GAGGACGTCA GCGGGCTCGA CCCGGGCGTG ACCTACTCGA GCGCCGCGTT CAACCTGCTC
TCCGGGACGG TCCGCCCGCT CTACCGTTAC GCGCCCGAGA ACCCGACCGA CATCGAGCCC
GACCTCGCCG CCTCCCAGCC GCAGATCTCG GCGGACGGCA GAACCGTCAC GGTGAGAATC
CGCAGAGGGG TGAGATTCGG CCCGCCGGTG AACCGGGAGG TGACGTCGAG AGACGTCAAG
TACGCGATCG AGCGCGGCTT CAATCCGAGC GTCGGCAACC CCTACGCGCC GACGTACTAC
GGCGACCTCG TCGGCGTCGA CAGAGCTGAC GGCGGCCCGA TCGCCGGGAT CGAGACGCCC
GACGAGCAGA CGATCGTCTT CCGCCTCACG AGACCGACAG GCGGCGTCCT CGCGCAGGCG
ACGACGCTGC CGCTGTCGGC TCCCGTGCCG CAGGAGTACG CGAGCAGATT CGACGACAAG
CCGGAGGGCG AGCTGACCGA CTACGGCAAC TGGCAGATCT CGTCGGGGCC GTACATGTTC
GCCGCCGACG CGAACGGCAG AGCGCTCGGC AACGGGATCG TGCCCGGTCG CAGACTCGAG
CTCGTGCGCA ACCCGAACTG GGACGCCGCG ACCGACGGGC GGCCCGCCTA CCTCGACGGC
ATCGACTGGT CGGTCGGCAA CGAGCCGAAC GTCGCCGGCC GGCAGGTGCT CGACGGCAGC
GGCCTGACGC TCGGCGACAC GCCGACCGCC GAGACGGTCA AGCGCGCGGT CCAGCGCTAC
CCGGAGCAGA TCTTCTTCAG CCCCGGCGCC GGCAACCGCT ACGCCGCGCT CAACACCGCG
ATCCCGCCGT TCGACGATCC CGACCTGCGC AAGGCGGTCG CCGCGCAGCT CGACCGCGAG
CAGATGCGGC TCGTCCGCGG CGGCGCCTCG ATCGGCGACA TAGCGACGCA CCTCCTCTAC
CCCGGCGTCG CCGGCTTCGA GGAGGCCGGC GGGATGAGAG GCCCGGAGCT CGACTTCCTC
GCGAACCCCG CCGGCGACCC GGCGATCGCG AGAAAGTACA TGGCGGCGGC CGGCTACCCC
GACGGCAGAT ACACCGGCAG AGAGACGGTC GAGGTCGTCG GCGTCTCCGG CGACCCGGCC
GACAAGGACT CGCAGCTGGT CGACGAGGCG CTCAGACAGC TCGGCTTCAG AACGAAGCTG
CGGCTCGTCG ACTCGGACAC GATGTACGGC AGATTCTGCG CGTCGCCGAA GGCGAGAGCC
GAGGTCTGCC CGATCCTCGG GTGGATACGC GACTTCGCCG ATCCGCAGAC GGTGCTCGAC
GCCGCCTTCA ACGGCACGAC GATCTCGCAG GAGGACGGCA CCAACTCGAA CTGGCCGCAG
CTGAACGATC CGAGAATCAA TGCGGCGATG GCGAGAGCGG AGTTGGTCGT CGACAAGCAG
GAACGTGCCG AGGCGTGGGC GAACATCGAT CGCATGATCA CCGAGACCGG TGCCGCGATC
CCATGGCTGT GGGACAAGCA GCCGGTCATC TCCTCAAAGG ACGTCCGCTG CGCCAACCAG
CTGTGGAACC AGGGGCACTG CGACTTCGCC TACAGCTCCC TGAGATAG
 
Protein sequence
MPRLVVRTLI ASLLCMTLAS TIAACGSGDD DGGGGNARTT ARADSGAGGR RGGTLRVLTE 
EDVSGLDPGV TYSSAAFNLL SGTVRPLYRY APENPTDIEP DLAASQPQIS ADGRTVTVRI
RRGVRFGPPV NREVTSRDVK YAIERGFNPS VGNPYAPTYY GDLVGVDRAD GGPIAGIETP
DEQTIVFRLT RPTGGVLAQA TTLPLSAPVP QEYASRFDDK PEGELTDYGN WQISSGPYMF
AADANGRALG NGIVPGRRLE LVRNPNWDAA TDGRPAYLDG IDWSVGNEPN VAGRQVLDGS
GLTLGDTPTA ETVKRAVQRY PEQIFFSPGA GNRYAALNTA IPPFDDPDLR KAVAAQLDRE
QMRLVRGGAS IGDIATHLLY PGVAGFEEAG GMRGPELDFL ANPAGDPAIA RKYMAAAGYP
DGRYTGRETV EVVGVSGDPA DKDSQLVDEA LRQLGFRTKL RLVDSDTMYG RFCASPKARA
EVCPILGWIR DFADPQTVLD AAFNGTTISQ EDGTNSNWPQ LNDPRINAAM ARAELVVDKQ
ERAEAWANID RMITETGAAI PWLWDKQPVI SSKDVRCANQ LWNQGHCDFA YSSLR