Gene Cwoe_2323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2323 
Symbol 
ID8732766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2456779 
End bp2458335 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content73% 
IMG OID646502940 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003394122 
Protein GI284043782 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.250322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0484761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACTT CCCGCGTCCT GGCGGCGCTC GCGTGCATCG GCGGTCTCGT CGCCGCCGGC 
TGTGGCTCCG CCACCACCAA CAGCGATCGA CCGCTCTCGA CGCTGCGCGT CCCGTTCCAG
TCGGACGTCC CCGGCGGGGT CGACCCCGAC GTCTTCTACG ACGTCGAAGG GCTGCAGATC
ACCAACTCGG CCTACGAGGG CCTGCTCGGC TACGCCGACG ACGGCAGACT CGTCGGCGAG
CTCGCGACCG ATTGGAGAGC GGGCGCCGAC GGGCGGACGT ACGACTTCAC GCTGCGCCCC
GGCGTGCGCT TCTCGGACGG GACGCCGTTC GACGCGCAGG CGATGAAGGC GAGCTTCGAG
CGCCGCAGAC AGGTCGACGC CGGCCCCGCC TACATGCTCG CCGACGTCGA GCAGGTGGAG
GCCCTCTCCC CGAGACGTCT GCGCGTGCGC CTGAGCAGAC CGAACGCGGC GTTCCTCGAC
CACCTCGCCA GCCCCTACGG GCCGAAGGCC GTCAGCCCGA CCGCGGTGCA GCGCCACGCC
CGCGACGGCG ACCTCGCGAA GGGCTGGCTG CAGACGCACA CGGCCGGCAC GGGCGCGTAC
GAGCTGACCG AGGCGGTGCC GGGGCAGCGC TTCGTGATGC GCGCCTCGCC GACGTGGAGA
CGCAGCAAGC CGACCGTGCG CGAGGTGCAG TTCACCGTCG TCCCGGACGC GGCGACGCAG
GTCACCGAGC TGCGCGGCGG CCAGCTCGAC CTCATCACGC ACGGCCTCAC GACCGCCGAC
GTGCAGGCGC TGCGCGGGGC CGGCGGCGCG AAGGTGACGA CGCGCCCCTC GACCCTGCGC
ATGATGCTCT ACCTCAACAC CGCCGCGGGC ACGCTGCGCG ACGCTGAGGT GCGGCGCGCG
TTCCTGAAGT TCGTCGACCG CGACGCGCTC GTCGACACCG TCTACGGGGA TCTCGCGCGG
GCGAGCGACA GCTTCTACCC GGACGGCACG TCGATCGCGA GAGCGGCGCC GCTCGACGTG
CCGGTCGACC CCGACCAGCT CAGAGCGCTG TCGTCGCGCT TCACGGAGCC GCTCGTGATC
GGGGCGGTGC AGGGAGACGG CCCCGCCGCT GGCCAGATCG CCCAGCTGCT CCAGGGACAG
TTGCAGCAGG CAGGCATCAA GGCGACCACA CGCGACATCC CGCTCGCGCA GGTGTACGAC
CTCACGACGC GCCCCTCCGC GCGCCCGGAC GTCCTGATCG TCACCAACGT CCCTGACGAC
CTCGCGCCCG ACAGCTGGTC GCGCGTCTAC CTGCGCACCG GCGGCTCGGT CAACTGGCTC
TCCTGCTCGG TGCCGGAGGC CGACCGGCTG ATCGACGAGG CGGTCGTCGC TCGCGGCGCC
GCCCGTCAGC GAGAGCTCGG CGTCGAGGCG GCCGCGGCGT GGATGGAGCA CGGCTGCGTG
CTGCCGCTCG CCGAGCTGCA GAACGTGACG GTCTCGCGCA AGGGCGTCGA GAACGTGAAG
GGCGGCCCGG CACGGCCGTT CGCGGTCGAC GTCGACAAGC TGCGGCAGGC GAGATGA
 
Protein sequence
MRTSRVLAAL ACIGGLVAAG CGSATTNSDR PLSTLRVPFQ SDVPGGVDPD VFYDVEGLQI 
TNSAYEGLLG YADDGRLVGE LATDWRAGAD GRTYDFTLRP GVRFSDGTPF DAQAMKASFE
RRRQVDAGPA YMLADVEQVE ALSPRRLRVR LSRPNAAFLD HLASPYGPKA VSPTAVQRHA
RDGDLAKGWL QTHTAGTGAY ELTEAVPGQR FVMRASPTWR RSKPTVREVQ FTVVPDAATQ
VTELRGGQLD LITHGLTTAD VQALRGAGGA KVTTRPSTLR MMLYLNTAAG TLRDAEVRRA
FLKFVDRDAL VDTVYGDLAR ASDSFYPDGT SIARAAPLDV PVDPDQLRAL SSRFTEPLVI
GAVQGDGPAA GQIAQLLQGQ LQQAGIKATT RDIPLAQVYD LTTRPSARPD VLIVTNVPDD
LAPDSWSRVY LRTGGSVNWL SCSVPEADRL IDEAVVARGA ARQRELGVEA AAAWMEHGCV
LPLAELQNVT VSRKGVENVK GGPARPFAVD VDKLRQAR