Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4287 |
Symbol | |
ID | 8734749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4563041 |
End bp | 4564984 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646504913 |
Product | Ricin B lectin |
Protein accession | YP_003396076 |
Protein GI | 284045736 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.718232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.560117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGTC CGAGGAGAGA GATCGTGTCC AGCAGGCCCA GGAGCGCCTG GCGCTCAGTC GTGCTCGCAG CCGGTGCCGC GATGGCGCTG GCGGTCGTCC CGTCGTCCGC GCTGGCGGTC GGCGAGACGC TCAACGTCTG GCAGACCGGC GGCAGCGGAT CGAGCGGCTT GACGCCGCAG ACCGACGTCA GACTCGGCAG TCCCGCGCGC GGGACGTACA ACGTCGAGGT CGACGACAGC AAGGTGACGC AGACGGTCGA CGGCGGCTTC GGCGCCTCCT TCACCGACTC CTCCGCGTAC CTGCTGATGA GACTGAAGGC GTCGAACCCG AGCGGCTACT CGACGCTGAT GAACAAGATC TTCGGGACGG CCGCCGCGGA CGGCATCGGG CTGCGCTTCT GGCGCGTGCC GATGACCTCC TCGGACTTCA CGGCCGCGAG ATCGCACTGG ACCTCGCGCG ACAGCTCCGG CTCGCCGTTC GCGCTCAGCG CGCAGGACAC CGGCCGCATC ATCCCGGTGA TCAGAGACGC GCTCGCGATC AACCCGAACC TGCGGATCAT GGCGAGCCCG TGGAGCGCGC CCGGCTGGAT GAAGTCCAAC GGCTCGATGA TCTGCAACAC CGGCTCGGGC AACAGCACGC TGCTGCCGGC GCACTACCAG GACTGGGCGG ACTACTTCGT CAGCTGGATC AGAGCGTACG AGTCCAACGG GATCCCGATC TGGGGCGTCA GCGCGCAGAA CGAGCCCGGC TACTGCCCGA ACAACTACCC CGGCATGACG TGGACGCCGG CGGCCGAGGG CGCCTGGGTC GCCAACTACC TGCGCCCGTC GCTGACGAGA GCGGGCCTCA GCAAGCAGAT CCTCGGCTTC GACCACAACT GGGAGTTCTT CGCCAATCCG CTCGCGCTGA TGAACGGCAG CGCGGCGAGC TTCGACGGAC TCGCGTGGCA CTGCTACGAC AACCCCAGCG ACCCGGCCGC GATGACGAAG CTGCGGAACC TGTTCCCGAC CAAGAGCGTC TACGAGACCG AGTGCTCGTC GGACACGACC CCGACGGACA TCATCAGATA CGGCACGGCC GACATGACGC TGAAGTCGGT GCAGAACTGG GCGCAGGGCG TGATCACCTG GAACCTCGCG CTCGACCAGA CGGGCGGCCC GCAGCTCGGC GGCTGCGTCG GCTGCGTCGG GCTGATCACG ATCGACTCCG CGACGAGCGC GGTGACGCTG CGCAACAACT ACTTCCAGCT CGGTCAGATC TCGAAGTTCG TCGCGCCCGG CGCGAGACAC ATCGCATCGA CCGTCGACGC GCACGGGATC GTCACCGCCG CGTTCAAGAA CCCCGACGGC CAGGAGGTGC TCGTCGCCCA CAACACGAAC GCGACGTCGA CCTCGTTCAC CGTCAGCTGG AACGGCAGAG GCTCGTTCAA CTACACGCTC CCGTCGCGCG GCACGGTGAC CTTCCGCGGC ACGGTGCCGG CGGCGACCTC GCTGCCGGCG ACGCCCGCCG CGGGGCGGAC GTTCAAGTTC GTCAGCCGCA CGAGCGGCAA GCCGTTCGGC GTCTCCGGCG CCTCGACGGC GAACGGCGCG AGAGTCGTCC AGTGGCTCGA CAACAGCGAC TGGGACCAGC AGTGGAGACT CGTCGACGCC GGCGGCGGCT ACTGGAACCT GATCAACCGC AACAGCGGCC TCGGGCTCGA CAACGGCGGC ACCTCGACCG ACGGCGTGCA GATGCAGCAG TGGGCGGTCG TCGGGACCGG CAACTTCAAC CAGCAGTGGC AGATCACGGC GGCCAGCTCC GGCTATTACC GGATCGTCAA CCGCACGAGC GGCAAGTCGC TCGACGTCAG AGACGGCAGC CTCGCCGAGG ACGCCGCGAT CCAGCAGTGG ACGACGTACA GCGGTGAGCC CAACCAGGAG TTCCAGCTCG TCCCGACGAG CTGA
|
Protein sequence | MSSPRREIVS SRPRSAWRSV VLAAGAAMAL AVVPSSALAV GETLNVWQTG GSGSSGLTPQ TDVRLGSPAR GTYNVEVDDS KVTQTVDGGF GASFTDSSAY LLMRLKASNP SGYSTLMNKI FGTAAADGIG LRFWRVPMTS SDFTAARSHW TSRDSSGSPF ALSAQDTGRI IPVIRDALAI NPNLRIMASP WSAPGWMKSN GSMICNTGSG NSTLLPAHYQ DWADYFVSWI RAYESNGIPI WGVSAQNEPG YCPNNYPGMT WTPAAEGAWV ANYLRPSLTR AGLSKQILGF DHNWEFFANP LALMNGSAAS FDGLAWHCYD NPSDPAAMTK LRNLFPTKSV YETECSSDTT PTDIIRYGTA DMTLKSVQNW AQGVITWNLA LDQTGGPQLG GCVGCVGLIT IDSATSAVTL RNNYFQLGQI SKFVAPGARH IASTVDAHGI VTAAFKNPDG QEVLVAHNTN ATSTSFTVSW NGRGSFNYTL PSRGTVTFRG TVPAATSLPA TPAAGRTFKF VSRTSGKPFG VSGASTANGA RVVQWLDNSD WDQQWRLVDA GGGYWNLINR NSGLGLDNGG TSTDGVQMQQ WAVVGTGNFN QQWQITAASS GYYRIVNRTS GKSLDVRDGS LAEDAAIQQW TTYSGEPNQE FQLVPTS
|
| |