Gene Cwoe_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2945 
Symbol 
ID8733390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3148415 
End bp3150112 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content69% 
IMG OID646503559 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003394739 
Protein GI284044399 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.472455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0524757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCTG ACGCATTGAG AAGGTCGCGG CGCGTGCTGG CGCCGCTCGC CGCGGTGCTC 
GCGGCGCTGC TCGTCCTGGC CGGCTGCGGC GGCGGCGGAT CGGACCTGCC GGACGGCGTC
ACGCAGAGCG GTGACGCGAG CACCGCGGCC GCCGCGGACG GCGGTGCGGG CGGCGGATCC
GCCGGCAGAC TCGCGTTCAC CCGCCTGGGT ATCAACACGC CCGGCTTCGG CCCGTGGAAC
CAGAGCACGG GCAACGACGC GATCGTCAAC TCGCTGCTGT TCTCGAACCT CGTGAAGGTC
AGATCGGACG AGAGAACGCT CGCGCCCGAC CTCGCCGAGA GCTGGGAGGC CTCCAGCGAT
CAGCGCACCT TCACGTTCAG ACTGCGCGAC GACGTCAGCT GGAGTGACGG CACGCCGTTC
ACCGCCAGAG ACGTCGTCTT CACCGCGACG CAGGCGGCGC AGTTCGGGCC GGAAGCGTAC
GTCGGCTACC AGCCGACGCA GTGGCGCGAC ATCGAGGGCG GCGCCGAGAT CGAGGGCACC
AGCAGACCGC TGCGCGGCAT CAGAGCGCTC GACGAGCACA CCGTCGAGAT CAGACTCGCG
AAGCCGAACG CCGAGTACGT CCGCAACCTC ACCGACGCGG TCTACTCGAT CATGCCCGAG
CACCTGCTCG CCGACGCGAC CGCGGCGGAC GTCAGAAGAA CCGCCTTCGC GACGAGCAGA
CCGGTCGGAA CGGGCCCGTA CACGCTGACG CGGATCGCAC CGAACCAGTA CTACGAGTTC
GCCGCCAACG ACGGCTACTT CGGCGGCGCG CCGAAGATCG GGACGCTCTT CTTCAAGCTC
GACGTCAAGC CCGAGTCTGC CGTCGCGCAG CTCGAGTCCG GCGAGCTGCA GCTCGTGATC
AACGCGTCGC CGAACGACGA GTCGCGGCTG ACGCGCGTCG ACGGGCTCAG AAACGAGTAC
GTCGTCTCGC CGGCGGTGCA GATGCTGCAG TTCCGTACCG ACCACCCGCA GGCGAGAGAC
GCGCGCGTGC GGCAGGCGAT CTACTCCGCG ATCGACCGCC GCGCGATGCT CAGAAGCCTC
TTCGGCGACC ACGGCGAGAT CCGCTGGGTG CTGCCCGGCT TCGACCAGGA GGACCCCGCG
CTCGATCGTT ACGAGCACGA CCCGCAGAAG GCGAGAGCGC TGCTCGCGGA GGCCGGCTTC
GACGGCGACG CGCCGTTCAA GATCGCCTAC GCGACCGACG TCGACCCGCT CTGGAGACAG
ATGACGCCGG TGATCCAGAA GAACCTGCAG GACGTCGGCA TCAACGCCGT GCTGGAACCG
CTCGACGCGG CCAAGTGGTC GGCCGCGAAC GTCGACAGAA ACCCGCAGAC CCCGGTCACG
CTCAACTCGG GTGGCGCGAT GGGGCTCTCG CCCGACCGCA GCTCGGTCTA CTACAACTGC
AGAGCGCCGC TCTCGTCGTT CTACGCCAAC TGCGACCTCG ACGCGCTCTA CGTGCAGGCG
CGCGGCGAGG CCGATCCGGA GAGACGCGCG CAGCTGTACG CGAGAGCGGC GCAGATCCTC
AACAGAGACG TGCCGCAGGC CGCGCTGTGG CAGACCGCGA ACTTCCACGC CTACAGCGAC
AAGCTCGGCG GGACGTTCGC GATCTTCCCG AACGACCGCG ACAGCGCGTT CGAGATCGCC
GGCTGGACGC TCGGCTAG
 
Protein sequence
MGSDALRRSR RVLAPLAAVL AALLVLAGCG GGGSDLPDGV TQSGDASTAA AADGGAGGGS 
AGRLAFTRLG INTPGFGPWN QSTGNDAIVN SLLFSNLVKV RSDERTLAPD LAESWEASSD
QRTFTFRLRD DVSWSDGTPF TARDVVFTAT QAAQFGPEAY VGYQPTQWRD IEGGAEIEGT
SRPLRGIRAL DEHTVEIRLA KPNAEYVRNL TDAVYSIMPE HLLADATAAD VRRTAFATSR
PVGTGPYTLT RIAPNQYYEF AANDGYFGGA PKIGTLFFKL DVKPESAVAQ LESGELQLVI
NASPNDESRL TRVDGLRNEY VVSPAVQMLQ FRTDHPQARD ARVRQAIYSA IDRRAMLRSL
FGDHGEIRWV LPGFDQEDPA LDRYEHDPQK ARALLAEAGF DGDAPFKIAY ATDVDPLWRQ
MTPVIQKNLQ DVGINAVLEP LDAAKWSAAN VDRNPQTPVT LNSGGAMGLS PDRSSVYYNC
RAPLSSFYAN CDLDALYVQA RGEADPERRA QLYARAAQIL NRDVPQAALW QTANFHAYSD
KLGGTFAIFP NDRDSAFEIA GWTLG