Gene Cwoe_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2501 
Symbol 
ID8732944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2661842 
End bp2663668 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content68% 
IMG OID646503116 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003394298 
Protein GI284043958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGATG GTCGTCGACG GCGCTTGTCG CCCGTCGGGA TGATCGGCGC CGCCGTGCTG 
CTGATCGTGC TGGTCGTCGT GGTCGTGTCG GTCGCCGGTG GCGGTGATGA CGACGACAAG
ACGAGCGTGA GCGCAGGCGG CGGGTCCACC ACCGCGCCGG CCAGAAGCTC GGGCGGGGCG
GCGACGCGTG AGGAGACGCT CGTGCTCGGG CAGTACCGCC CGCCCACGGG CAAGATCGGC
AACCCGTACG TGCAGGCGAG CGACGCGCTC GTCTCCGACG GGCTGCACGA GCTGGTCTAC
GAAGCGCTGT TCTACGTGAA TTACCAGACC GGTGAGACCG AGCCGTGGCT CGCGACCGGC
TATGAGTACA GCGACGACAA CAGAACGATC ACGCTGAGAC TGCGTGACGA CGTCAGCTGG
AACGACGGCA AGCCGTTCAG CGCCGACGAC GTCGTCTACA CGATGAGACA GATCCTGGCG
GCCAGAGCGC CGTTCCGCGC GGCCAACATA CAGGGCGCGG TCAGATCGAT CAGAAAGCTG
TCGCCGACCG AGGTCCGGAT CGACCTCAGA GCGCCCAACC CGCGCTTCGT CGACAGCGAG
CTGTCCTCCT ACGTCTACAC CGCGAACTTC ATCCCGCTGC CCAAGCACGT CTTCGAGGGG
CAGAGATTCG AGACGTTCGC CTTCTACGAC CTCGCCAGAG GCCTGCCGCT CGGCACCGGC
CCGTACCGCC TCACCGACGT CACCGCCTCC GCGGCGACGC TGCAGCGCAA CGACGACTGG
TGGGCGGCGA GAGCCGGCGT CGCCGACGTC GTGCCGAAGA AGGTCGTCTA CACGAGCCCC
GGTCCCGAGG ACTCGGCCGT CTCCGGCCTG GAGAGCAGCG CGCTCGACTA CGCCGGCCAG
TCGGTCCCGT CCGTCGCCGG CTTCATCGCC GCGAAGGAGC GCAACCCGCA GCTCGTCAAC
TGGGACGGCG ACCTTGGCTG GCTCGACCCG TGCCCGTACG CGCTGACCGT CAACACGAAG
CGCAGACCGT GGGACGACGC CGAGCTGCGC TGGGCGCTCA ACGCCTCGAT CGACAAGGAG
CAGTTCAGCC GCCTCTTCAA CACCCCCGGC GAGTCGACCC CGGCGCGCAC GACGTACCCC
GAGTACCCGC AGCTGAGCGA GCTGATAGAC GCCAACGAGG ACCTGCTCGC CGAGTACCCG
ACGCTCGACC ACGACCTCGA CAGAGCGGCG CAGATCTTCG AGTCGAAGGG CTACAGAAGA
GAGGGCGGCG TCTGGACCAA GGACGGGCAG AAGCTGTCGC TGAAGCTCAA CCTCTTCTCG
CCGGCCGCGC TCGGCCCGGT CTGGGGCGAT GCGGCGCAGC TGCTCAACCA GCAGTTGAGA
GAGGCCGGGA TCGCCGTCGA GGTCGACCCG GGCGACTTCA ACACGATCGC GGCCAACCGC
GCCGAGGGCA GATTCGACGC GCAGTCGTGG TTCGAGTGCG GCAGCGTCAC CGATCCGTGG
GCGACGCTCA ACCGCTACAC GAACGCGCCG GGCAACGACA ACGCCGGCAG ATGGAGCAAC
GCCGCCTACG ACAGAATCGT CGCGCAGATG GGCGAGCTGC CGCCGGGCGA CGCGCAGATC
AGAGAGCTGT ACGCGCAGGC GATGGAGATC TGGCTCAGAG AGCTGCCGGT GATCCCGCTC
AACCAGCGGC CGACGCCGAT CGTGATGAAT CAGACGTACT GGAGAAACTG GCCGACCGCG
GACAACGGCT ACACGCAGCC CGCGCCGTTC GGGATGAACT TCCACCAGGT CATCACCAGA
CTCCAGTCCG CGAGAGGCGA GCAGTGA
 
Protein sequence
MEDGRRRRLS PVGMIGAAVL LIVLVVVVVS VAGGGDDDDK TSVSAGGGST TAPARSSGGA 
ATREETLVLG QYRPPTGKIG NPYVQASDAL VSDGLHELVY EALFYVNYQT GETEPWLATG
YEYSDDNRTI TLRLRDDVSW NDGKPFSADD VVYTMRQILA ARAPFRAANI QGAVRSIRKL
SPTEVRIDLR APNPRFVDSE LSSYVYTANF IPLPKHVFEG QRFETFAFYD LARGLPLGTG
PYRLTDVTAS AATLQRNDDW WAARAGVADV VPKKVVYTSP GPEDSAVSGL ESSALDYAGQ
SVPSVAGFIA AKERNPQLVN WDGDLGWLDP CPYALTVNTK RRPWDDAELR WALNASIDKE
QFSRLFNTPG ESTPARTTYP EYPQLSELID ANEDLLAEYP TLDHDLDRAA QIFESKGYRR
EGGVWTKDGQ KLSLKLNLFS PAALGPVWGD AAQLLNQQLR EAGIAVEVDP GDFNTIAANR
AEGRFDAQSW FECGSVTDPW ATLNRYTNAP GNDNAGRWSN AAYDRIVAQM GELPPGDAQI
RELYAQAMEI WLRELPVIPL NQRPTPIVMN QTYWRNWPTA DNGYTQPAPF GMNFHQVITR
LQSARGEQ