Gene Cwoe_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1250 
Symbol 
ID8731688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1309023 
End bp1310675 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content69% 
IMG OID646501867 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003393054 
Protein GI284042714 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.405882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0838621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAG CCCAGGACGA CCGGGCGCGC GAGGCCCTGG AGATCGCCGG GCGCAAGCTG 
CGTCGCGGGG AGATCTCGCG CCGTGACTTC GCACGCCTGA CCTCGATGCT CGGAATCGTC
GCCGTCGCGC CGACCGCGCT CGCCGCGTGC GGCAGCAAGG CGACGACGGG CGGGGGCGGC
ACGACGGCGT CGGCCGGCGG CGGGGGCGGG GGCAAGGACT CGCTGCGCTT CCTCGTCGGC
GAGTCGTTCT GGGCCAACTG GCATCCGTAC AACCACACCG CGCAGATCGG CTTCAAGATC
CAGCGCAACC TCTTCGACCG GCTCGTCGAG GTGCAGCCCG ACATGAGCCT GAAGCCGGGG
CTGGCGGAGT CGTGGAGACA GATCGACGCG CGCACGTGGG AGTTCAAGCT GCGCGAGGGC
GTGACGTTCC ACGAGGGTCA GGAGCTGACC GCCGAGGACG TGAAGGCGTC GGTCCAGCTC
GCCTCGGGCT TCGACGGCGA CAAGAAGCAG CCGCTCGCGA TGGCGGCGAC GTGGGGCGTC
CCGCACAAGG GCGAGGTCGT CGACAGACTG ACCGTCCGCC TGACCGGCGA GAAGCCGTTC
GGTCCGCTGC TGAACACGCT CGCGATCACC GACATCCTCT CCGCGAAGGA CATCGCGCGA
GGCAGAACGA CGCTCGAGAA TCGGCCCAAC GGCACCGGCG CCTTCAAGCT CGTCGAGGAC
AAGCCGAACG CGAAGACGCT GGAGCGCTTC GACGACTACT ACCGCGGTCC GGCGAAGCTG
AGGACGATGA CGTGGGAGTT CATCCAGGAC TCCCAAACGC GCCTCAACGC GCTGCTCGCG
GGGCAGGCCG ACGTGATCGA CCGCGTCGAG CCCGATCAGC TGCCGTTGAT CGAGAAGAGC
GACGGGGCGT CCGCGATCTC CGTCACCGCG CCCGAGATCC AGTCGATGTG GTTCCGGATG
GACAAGGATC CGTTCGGCTC CAACGCCGGC CTGCGCAGAG CGTTCGCGTG GTCCCTGGAC
CGCGAGTCGA TGGCCGGCCT CGTCGGCGGC AAGGCGACCG TCGGCGACTC GCATCTCGCG
AGTGGGATCG AGTTCCGCTC CGCGCAGGAG CCGATGTACT CGTTCGACCC TGAGCGCGCC
AGAGCGGAGC TGGCGAGAGC CGGGGGTCCG GTGAGCTTCG AGCTGGCCAG CTCGACCGGC
TTCTACCCGA AGTCGAAGGA GATCTGTGAG CTGGCGAAGC AGAACCTCGA CGAGGTCGGC
TTCGACGTCA AGCTGACCCT GATGGAGCTG GCGGCGTGGA TCGACATGCT GTTCGGCAAG
GGCAGACCCG GCGAGGTCTT CTACGGCGGC TGGGGCAATC TCACGAAGGA CCCCGACTTC
GCGCTCGCGA CGCTGCTGCA CTCGCCCGGC GCGTGGACCG GCGCGCACGA CAGAAGAGCC
GACGCGCTGA TCGACGCCGG CAAGACGGCG ACCGAGCCGG CCAGACGCGA GCAGATCTAC
GGCGAGCTGC AGACGTACTT CTGGGACGAG TACGTGCCGT CGGTCCCGGT CCTCTACAGC
GACTTGTCCA ACGGGCTGCG CGAGAACGTG CAGGGGTACG AGGTCTACCC GACCGCCGTG
CAGGAGTTCT GGCCGGTGGA GATCGGCGGG TGA
 
Protein sequence
MAGAQDDRAR EALEIAGRKL RRGEISRRDF ARLTSMLGIV AVAPTALAAC GSKATTGGGG 
TTASAGGGGG GKDSLRFLVG ESFWANWHPY NHTAQIGFKI QRNLFDRLVE VQPDMSLKPG
LAESWRQIDA RTWEFKLREG VTFHEGQELT AEDVKASVQL ASGFDGDKKQ PLAMAATWGV
PHKGEVVDRL TVRLTGEKPF GPLLNTLAIT DILSAKDIAR GRTTLENRPN GTGAFKLVED
KPNAKTLERF DDYYRGPAKL RTMTWEFIQD SQTRLNALLA GQADVIDRVE PDQLPLIEKS
DGASAISVTA PEIQSMWFRM DKDPFGSNAG LRRAFAWSLD RESMAGLVGG KATVGDSHLA
SGIEFRSAQE PMYSFDPERA RAELARAGGP VSFELASSTG FYPKSKEICE LAKQNLDEVG
FDVKLTLMEL AAWIDMLFGK GRPGEVFYGG WGNLTKDPDF ALATLLHSPG AWTGAHDRRA
DALIDAGKTA TEPARREQIY GELQTYFWDE YVPSVPVLYS DLSNGLRENV QGYEVYPTAV
QEFWPVEIGG