Gene Cwoe_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1930 
Symbol 
ID8732371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2030379 
End bp2031878 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content67% 
IMG OID646502547 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003393731 
Protein GI284043391 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.15113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC TCGTCACGGT GGCGGGCTGC GGAGACGGCT CGTCGTCCTC GACGTCGTCG 
TCGCGTTCCG GGGAGAAGGT GCTGCGCTAC GGGGTGACGT CGAAGCTCGA CACGTACAAC
CCGGCGAGAG ACTCCGTCAC CGGCGCGACC AACATCCGCT ACCTGACGAC GGAGACGATC
CTCGAGAAGG ATCCCGAGAC CGGCCAGTAC GGGCCCGGCC TCGCCACGGA GTTCGGCTTC
GCCGGCAGAG GCAACACCGC GTACGAGTTC ACGCTGCGCG AGGACGCGAG CTTCTCCGAC
GGCACGCCGC TGGACGCCGC GGCAGTGAAG AAGTGGCTGG AGTACTTCTC CAGAGCCGGC
GGGCCGTGGG TCGGTCTCGT CGCGCTCAGA TCGATCGAGA CGCCCGGCAG ATACACGGTA
CGGCTGAACT TCAGAGTGCC GAGCCCCAAC ATCGAGTACT TCCTCGCCGG CGGCAACAAC
TGGGGCTTCG TCTCCAGCCC CAGAGGGGTC GACGACCCCA AGCTGCTGGC GCAGGACATG
ATGGGCGTCG GCCCGTACGT GATGGACCCC GGCGAGAGCG TCGCGGGCGA CCACTACACG
TTCACGCCCA ACGACCGCTA CTACGACCAG TCGAAGGTCA AGTGGGACAA GATCGTCGTG
AAGGTGATCA CCGATCCCTC GACGATGGTC AAGGCGCTGC AGGCGGGCGA GCTCGACGCC
TCCCAGGGCG ACTTCTCGAC CGTCGGCACG GCCGAGAGAG TCCCCGGACT GAAGGTCCAC
TGGGGACAGG GCGGCTGGGA TCCGATCCTG CTGCTGACCA AGCACTCCGA GCCGCTGCGC
GACGTGCGCG TCCGTCAGGC CCTCAACTAC GCGATCGATC GCGAGGCGAT CACCCAGGCG
ATGCTGGGCA AGTACGGCGA GCCGACCTCC GAGTGGGTCA CGACCGACGG CTTCGACCCC
GAGTACCAGG ACTACTACGA GTACGACCCG GAGAAGGCGA AGCGGCTGCT CGAGGCGGCC
GGATACGGGG ACGGGCTGAC GCTGAAGGTG GTCGACCAGG GCTACTACGG CAACCTCGGC
GACCGAATGG TCCAGATCGT CGCCGACTAC ATGAGCAGAG TCGGCGTCAC GTTCCAGGTC
ACCAAGGCGA CGTCCGCCTC CGAGCATCTG GAGAAGGGCT TGTCGGGCGC GTTCGACGCC
TGGCAGTTCG CGGTCGGCAG CGTGCCGACG GCCACCTTCC TGGACTTCTT CGGCGAAGGG
TTCGGCGTCA TCCCCGACCC CGAGCTGGAC GAGATCGCGG CGCGCGCGTC CGTCGCGCCC
AAGGAGGAGA TGCCGGAGAT CTGGAAGGAG TTCTCGCGGC GCACCGTCGA GCAGGCGTCG
ATGCTGAACA TCTTCACGAC GCCGGTCCTG CTCTACGCCC GCGACGACAT CGAGGGCGTC
GTCGCGACCC CCGCGTTCGG CGTGTCGCCG GCGCTGCTGC AGTGGGAGCC CGCGAGGTGA
 
Protein sequence
MALLVTVAGC GDGSSSSTSS SRSGEKVLRY GVTSKLDTYN PARDSVTGAT NIRYLTTETI 
LEKDPETGQY GPGLATEFGF AGRGNTAYEF TLREDASFSD GTPLDAAAVK KWLEYFSRAG
GPWVGLVALR SIETPGRYTV RLNFRVPSPN IEYFLAGGNN WGFVSSPRGV DDPKLLAQDM
MGVGPYVMDP GESVAGDHYT FTPNDRYYDQ SKVKWDKIVV KVITDPSTMV KALQAGELDA
SQGDFSTVGT AERVPGLKVH WGQGGWDPIL LLTKHSEPLR DVRVRQALNY AIDREAITQA
MLGKYGEPTS EWVTTDGFDP EYQDYYEYDP EKAKRLLEAA GYGDGLTLKV VDQGYYGNLG
DRMVQIVADY MSRVGVTFQV TKATSASEHL EKGLSGAFDA WQFAVGSVPT ATFLDFFGEG
FGVIPDPELD EIAARASVAP KEEMPEIWKE FSRRTVEQAS MLNIFTTPVL LYARDDIEGV
VATPAFGVSP ALLQWEPAR