Gene Cwoe_2811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2811 
Symbol 
ID8733254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2999485 
End bp3001107 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content72% 
IMG OID646503423 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003394605 
Protein GI284044265 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.490323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCG GCAAGTCTGC ACCTGCGGCG GCGATCGTCG GCGCGCTCTG CGCGTTCGCG 
ATCGCCGGCT GCGGCGGTGA CAGCGACTCG TCGTCGACGA CGGACACGAG CGCGAGCAAG
GGCGGAGGCG GAGGCGCCGC CGCGACAGGC GCGCTCAGAA GCGGCGGCAC GCTGACCGCC
GCGCTGACGG CCGAGCCCGA CTACCTCGAC CCAGCGAGAG CGCAGTCGGG GCAGAGCTGG
CAGGCGCTGG TGCCGATCTG CGAGGGCCTC TACGCGCTCG ACGGCGGCGC GACGAAACCG
CAGCTCGCCG TCGGCGAGCC GGTCCTCCGG GACGGCGGCA AGACGGTCAC GATCAGGCTG
CGCAGAGGCG TCAGGTTCAA CGACGGCACG CCGTTCGACG CCGAAGCGGT AAAGACGTCG
CTGCTGCGCA ACCACAGAAC GTCGGTGCTG TTCCAGGGCA TCCCGCTGGA GCAGGTGCTG
ACGCCGTCGG CAGACACGAT CGTGCTGAAG CTGGCGGCGC CGTACGCGCC GCTGATCTCC
GACCTCGCCG GCCCCGGCGG CATGATCGCC TCGCCCGCGC AGCTGGAGAA GCTGGGCGAC
AGATTCGGCG ACGACCCGGT CTGCGTCGGG CCGTTCGAGT TCGTCAGCCG CAGAGCCGGC
GACGCGATCA CGCTCAGACG CGCGCCCGGG TACTACGACG CCGACCGCGT CAAGCTCGAC
GAGCTGGTCT TCAAGATCAT GCCGGACGAG GACGCGCGCA GCACGAGCCT GCGCTCCGGC
ACGATCGACG TCGCGCTGGA CCCGGCCGAC GCGGAAGGGC TTGCGGGCGA CGACGCGCTG
CGCGTCGCGG AGGTTCCCGG CGCGGGCTGG CACGGCGTCT ACTTCAACGT CGGCAACTCC
GCCGGCATGG GCAAGCCGCT GGCGCCGCGC GACAGCGCGC TCGGCAGATC GGCCGCCGTG
CGCAGAGCGT TCGAGCGCAC GCTCGACCGC GAGGCGCTGC TGTCGCTCGG CCACGACGCC
GGCGCGGACG TCTCGTGCAG CATCATCTCG ACGACCAGCT CGCTGCGCTC CGACGTCCCG
TGCGAGGCGA ACGCCGATCC CGAGGCGGCG CGACAGCTGC TGGAGGAGGC CGGGGTGGAG
ACGCCGGTCA AGGCCCAGCT GAACGTCTCG GCGTCGCCGG ACCTGCTGCG TGAGGCGCAG
GCGATCCAGG CGATGGCGAG AGAGGGCGGG TTCGAGGTCG AGATCGACCA GTGCGACGTC
GCGAGCTGCA TCAAGCGGCT GATCGGCGGC GACTTCGACC TGGCGCTCGG CGGCTTCTCC
GGCTTCCCCG ACCCCGATCC GAGCATCAGC CCGTTCGTCT CGACCAGAGG CGGGTTCAAC
TTCGTCGGCA TGTCGGACCC GGAGCTCGAC AGACTGCTGG AGCAGGCCCG CGCCGCGTCC
GGCGACGAGG CGGAGCGCAG ACAGCTCTAC GCACGCGCTC TGGAGATCGT CAGCGAGCAG
CTGCCGCTGG CGGTCATCGG CAACCCCGGC GTGACCGTCG CGTCGCGCAG CGACGTCGGC
GGCTTCGAGG TCTCCGCGAG CGAGATCGTC GACTTCACCG GCGCCGGGTT CACGCAGGGC
TGA
 
Protein sequence
MLIGKSAPAA AIVGALCAFA IAGCGGDSDS SSTTDTSASK GGGGGAAATG ALRSGGTLTA 
ALTAEPDYLD PARAQSGQSW QALVPICEGL YALDGGATKP QLAVGEPVLR DGGKTVTIRL
RRGVRFNDGT PFDAEAVKTS LLRNHRTSVL FQGIPLEQVL TPSADTIVLK LAAPYAPLIS
DLAGPGGMIA SPAQLEKLGD RFGDDPVCVG PFEFVSRRAG DAITLRRAPG YYDADRVKLD
ELVFKIMPDE DARSTSLRSG TIDVALDPAD AEGLAGDDAL RVAEVPGAGW HGVYFNVGNS
AGMGKPLAPR DSALGRSAAV RRAFERTLDR EALLSLGHDA GADVSCSIIS TTSSLRSDVP
CEANADPEAA RQLLEEAGVE TPVKAQLNVS ASPDLLREAQ AIQAMAREGG FEVEIDQCDV
ASCIKRLIGG DFDLALGGFS GFPDPDPSIS PFVSTRGGFN FVGMSDPELD RLLEQARAAS
GDEAERRQLY ARALEIVSEQ LPLAVIGNPG VTVASRSDVG GFEVSASEIV DFTGAGFTQG