Gene Cwoe_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1623 
Symbol 
ID8732063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1709334 
End bp1710815 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content67% 
IMG OID646502241 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003393426 
Protein GI284043086 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGA TCCTCGCCAT CGGACTGATC GTGGTCGCGG CCCTCGCCGT CGTGGTCTTC 
GGCACGGGTG CAGCGTCTGA CGACAGCTCC TACAAGGTGC GCGCGATCTT CGACAACGCC
GGCTTCCTCG TCTCCGGCGA GGACGTGAAG GCCTCCGGCG TCGTGATCGG ATCGATCGAC
TCGCTCGAGG TGACGCCGGA CAAGAAGGCC GCGGTGATCC TCAACATCAC CGATCCGGCG
TTCAAGAACT TCAAGCAGGA CGCGAGATGC GCCGTGCGGC TGCAGTCGCT GCTCGGCGAG
AAGTACGTCT CCTGCATACC GACCCAGCCG AAGAACCCGG GTGACAGACC GTCGCCGCCG
CTCAGAAAGA TCGAGGACGG GGCGGGCGAG GGCCAGTACC TGCTGCCGGT GTCGCACACC
TCCTCGCCGG TCGACCTCGA CATGCTCAAC AACGTGATGC GGCTGCCCGA GCGGCAGCGC
TTCTCGCTGA TCCTGAACGA GTTCGGCACC GGTCTCGCGG GGAGCGGCGA CGAGCTGAGA
GCCGTCATCC GCCGCGCCAA CCCCGCGCTC GACGAGTTCG ACAAGGTCCT CAGAATCCTC
GCGGACCAGA ACAGAGTCCT CGCGAAGCTG GCCGAGGACG GCGACGTCGC CGTCGGGCCG
CTCGCACGCG AGGCCGACGC GATCAGCAAC TTCATCGACA AGGCCGGCAA GACCGCCGAG
GCGACCGCCG AGCGCGGCGA CGACCTCGAA CGCAACTTCG CGCTGTTCCC GGAGTTCCTG
CGCCAGCTCA ACCCGACGAT GGCGCAGCTG GAGAACTTCT CCAAGTCCGC CACGCCGGTC
TTCACCGACC TGCGGGCCGC CGCACCGTCG ATCAACAAGA TCTTCGAGCA GCTCGGCCCG
TTTAGCAGAG CCGCGCTGCC GACGCTGCGC ACCTTCGGCG ACGCCGCCGA GATCAGCAGA
AGAGCCCTGA TCGCCGCCAG ACCCGTCATC CAGGACATCG ACCAGCTCGC CAGAGCCACC
GGCCCCCTCG CCAGAAACCT CGCGGTCGGC CTCAGCGACC TGGAGAGACA GCGCGGCATA
GACCGGTTCA TGCGGACGGT GTACGGCTTC ACCGGCGCGC TGAACGGCTT CGACAGCATC
GGGCACTATC TGCGGACGCA CGTCATCTTC GAGGGCCAGT GCCTCAGATA CTTCACTGTG
ACGAGCGGTT GCGACTCCAA CTTCCGGGTC AGACAGATCG GCGAGGAAGA CGCAACAGCG
AGCGCGGCCA CTTCGGACGC TCCCGCTCCG GAGAACAAGC GGTCCTCCGA CGACATGCGC
CTGCCGCAGA TCACGCTGCC GGCCGCCAAG CCCGACGAGT CGAGCTCGTC CTCCACCACC
GCTGACGAGG CGGTCGCCGG GCAGGACACG ACAGCCAACT CGCAAGAGGA CCCACGCGCC
GGCGTCCTCG GCTACCTGCT CGGAAGCGAG TCCGTGCGAT GA
 
Protein sequence
MKRILAIGLI VVAALAVVVF GTGAASDDSS YKVRAIFDNA GFLVSGEDVK ASGVVIGSID 
SLEVTPDKKA AVILNITDPA FKNFKQDARC AVRLQSLLGE KYVSCIPTQP KNPGDRPSPP
LRKIEDGAGE GQYLLPVSHT SSPVDLDMLN NVMRLPERQR FSLILNEFGT GLAGSGDELR
AVIRRANPAL DEFDKVLRIL ADQNRVLAKL AEDGDVAVGP LAREADAISN FIDKAGKTAE
ATAERGDDLE RNFALFPEFL RQLNPTMAQL ENFSKSATPV FTDLRAAAPS INKIFEQLGP
FSRAALPTLR TFGDAAEISR RALIAARPVI QDIDQLARAT GPLARNLAVG LSDLERQRGI
DRFMRTVYGF TGALNGFDSI GHYLRTHVIF EGQCLRYFTV TSGCDSNFRV RQIGEEDATA
SAATSDAPAP ENKRSSDDMR LPQITLPAAK PDESSSSSTT ADEAVAGQDT TANSQEDPRA
GVLGYLLGSE SVR