Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1623 |
Symbol | |
ID | 8732063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1709334 |
End bp | 1710815 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646502241 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003393426 |
Protein GI | 284043086 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGA TCCTCGCCAT CGGACTGATC GTGGTCGCGG CCCTCGCCGT CGTGGTCTTC GGCACGGGTG CAGCGTCTGA CGACAGCTCC TACAAGGTGC GCGCGATCTT CGACAACGCC GGCTTCCTCG TCTCCGGCGA GGACGTGAAG GCCTCCGGCG TCGTGATCGG ATCGATCGAC TCGCTCGAGG TGACGCCGGA CAAGAAGGCC GCGGTGATCC TCAACATCAC CGATCCGGCG TTCAAGAACT TCAAGCAGGA CGCGAGATGC GCCGTGCGGC TGCAGTCGCT GCTCGGCGAG AAGTACGTCT CCTGCATACC GACCCAGCCG AAGAACCCGG GTGACAGACC GTCGCCGCCG CTCAGAAAGA TCGAGGACGG GGCGGGCGAG GGCCAGTACC TGCTGCCGGT GTCGCACACC TCCTCGCCGG TCGACCTCGA CATGCTCAAC AACGTGATGC GGCTGCCCGA GCGGCAGCGC TTCTCGCTGA TCCTGAACGA GTTCGGCACC GGTCTCGCGG GGAGCGGCGA CGAGCTGAGA GCCGTCATCC GCCGCGCCAA CCCCGCGCTC GACGAGTTCG ACAAGGTCCT CAGAATCCTC GCGGACCAGA ACAGAGTCCT CGCGAAGCTG GCCGAGGACG GCGACGTCGC CGTCGGGCCG CTCGCACGCG AGGCCGACGC GATCAGCAAC TTCATCGACA AGGCCGGCAA GACCGCCGAG GCGACCGCCG AGCGCGGCGA CGACCTCGAA CGCAACTTCG CGCTGTTCCC GGAGTTCCTG CGCCAGCTCA ACCCGACGAT GGCGCAGCTG GAGAACTTCT CCAAGTCCGC CACGCCGGTC TTCACCGACC TGCGGGCCGC CGCACCGTCG ATCAACAAGA TCTTCGAGCA GCTCGGCCCG TTTAGCAGAG CCGCGCTGCC GACGCTGCGC ACCTTCGGCG ACGCCGCCGA GATCAGCAGA AGAGCCCTGA TCGCCGCCAG ACCCGTCATC CAGGACATCG ACCAGCTCGC CAGAGCCACC GGCCCCCTCG CCAGAAACCT CGCGGTCGGC CTCAGCGACC TGGAGAGACA GCGCGGCATA GACCGGTTCA TGCGGACGGT GTACGGCTTC ACCGGCGCGC TGAACGGCTT CGACAGCATC GGGCACTATC TGCGGACGCA CGTCATCTTC GAGGGCCAGT GCCTCAGATA CTTCACTGTG ACGAGCGGTT GCGACTCCAA CTTCCGGGTC AGACAGATCG GCGAGGAAGA CGCAACAGCG AGCGCGGCCA CTTCGGACGC TCCCGCTCCG GAGAACAAGC GGTCCTCCGA CGACATGCGC CTGCCGCAGA TCACGCTGCC GGCCGCCAAG CCCGACGAGT CGAGCTCGTC CTCCACCACC GCTGACGAGG CGGTCGCCGG GCAGGACACG ACAGCCAACT CGCAAGAGGA CCCACGCGCC GGCGTCCTCG GCTACCTGCT CGGAAGCGAG TCCGTGCGAT GA
|
Protein sequence | MKRILAIGLI VVAALAVVVF GTGAASDDSS YKVRAIFDNA GFLVSGEDVK ASGVVIGSID SLEVTPDKKA AVILNITDPA FKNFKQDARC AVRLQSLLGE KYVSCIPTQP KNPGDRPSPP LRKIEDGAGE GQYLLPVSHT SSPVDLDMLN NVMRLPERQR FSLILNEFGT GLAGSGDELR AVIRRANPAL DEFDKVLRIL ADQNRVLAKL AEDGDVAVGP LAREADAISN FIDKAGKTAE ATAERGDDLE RNFALFPEFL RQLNPTMAQL ENFSKSATPV FTDLRAAAPS INKIFEQLGP FSRAALPTLR TFGDAAEISR RALIAARPVI QDIDQLARAT GPLARNLAVG LSDLERQRGI DRFMRTVYGF TGALNGFDSI GHYLRTHVIF EGQCLRYFTV TSGCDSNFRV RQIGEEDATA SAATSDAPAP ENKRSSDDMR LPQITLPAAK PDESSSSSTT ADEAVAGQDT TANSQEDPRA GVLGYLLGSE SVR
|
| |