Gene Cwoe_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1625 
Symbol 
ID8732065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1712260 
End bp1713939 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content70% 
IMG OID646502243 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003393428 
Protein GI284043088 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.164573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCTC CGAAGAAGAG AGACAGCCGC GCTGTGCCGC CCACCAAGCG CAAGCATTGG 
AGCCGCTTCC GCGTCGGCCT GATCGCGATC GTCGTGCTGA TCATCCCGGT CTACCTGGCG
TTCACGAAGG ACATTCCGTT CACCAGCGGC TACCGCGTGA CGGCGGTGTT CGAGTCGGCC
AACAACCTGC GCGCCGGCTC GCCCGTGCGC ATCGCGGGCG TGAACGTCGG CAGAGTCAAG
TCGGTCGCCC GCTACAAGGA CACCAACCTG TCGCAGGTCG AGATGGAGAT CAGCGAGGAC
GGCCTGCCGA TCCACGAGGA CGCGACGCTC AAGATCCGCC CGCGCATCTT CCTCGAGGGC
AACTTCTTCG TCGACCTCAG ACCCGGCACG CCCGGCTCGC CCGACGTGCC CGACGGCGGC
ACGATCGGCG TCACGCAGAC GTCGACCCCC GTCCAGCTCG ACCAGCTGCT GACCGCACTC
CAGTCGGACT CGCGCGAGGA CCTTCAGCAC GTGCTCGAGG AGTACGGCGC GGCGCTCAAC
TCGAAGCCGA CCCCGGAGCA GGACGCCGAG CTGCCCGAGT CCGTGCGCGG CCTCACCGGC
GCACAGGGCC TCAACAACGC GGCGGCGCCC GGGGCGAGAG CGCTGCGGAA CGCGACGATC
GTCAACGACG CGATCCGCGG CGAGAAGCCG GGCGACCTCG CCAAGACGAT CGCCAGCGTC
GCACGGTTGT CGAGAACGCT CGAAAGCCGC GAGGGACAGT TGCAGGACCT GATCGTGAAC
TTCAACCTCA CCGCCAGCGC CTTCGCCAAC CAGAGCGGCG CGCTGAGCGA GACGATCCGC
CTGCTCGGGC CGACGCTCGC GACCGCGAGA AGCGCGCTGC GCAGCGTCGA CGCCGCGCTG
CCGTCGACGC GCGCATGGGC GCGCGAGATC CTGCCCGGCG TGCGCGAGAC GGCGGCGACC
GTCAACGCGT CCTTCCCGTG GATAGAGCAG ACGCGCGCGC TGCTCGGCCC GGACGAGCTG
CAAGGGCTGA TGGCTGAGCT GACCCCCGCC ACGAAGGACC TCGCGAGACT CACGAACGCG
TCGATCAGAC TGCTGCCGGA GATCGACGAC TTCTCGCAGT GCTTCGCGAA GGTCATTCTC
CCGACGGGCA ACGTCGGCCT CGAGGACGGC GCGCTCACGA ACCGCCGCTC GGACGGCAGC
ATCGTCGAGA GCTACAAGGA GTTCTGGTAC GGCCTCGTCG GCCTGACGAG CGCCGGGCAG
GGCTTCGACG GCAACGGCGC CTACCTCCGA GCCACCGCGG CTGGAGGCCA GTGGAACGTC
GCTCCCGGGA TCTCGCGGTA TGCCGCGGGT GGAACCGTCG AGAAGACGCT GACGGGGCTC
GCGACGCAGA GACCGCTCGG CACGCGGCCG CTCTATTCGG CGAGATCCCC TGCGATCAAG
ACCGACGTGC CGTGCCGGAG CAACCCGGTT CCGGACCTCA ACGGCCCGCA GGCCGGCCCA
GGTGCGGCAC CGAGAAGCAT CCAGGTGCCG ACGCCGCCGC CGGTCGAGAG AAGAGTGGAG
ACGCCCACGA CCCCGCCCGC CAGAACGGCC GCATCAGACG ACACCTCGGC CAGAACGGCG
TCGGTCGGCT CTGAGCTGCT CTCGCGCCTC AGCCCGCTCG CGAACGGGGG CGGCAGATGA
 
Protein sequence
MSPPKKRDSR AVPPTKRKHW SRFRVGLIAI VVLIIPVYLA FTKDIPFTSG YRVTAVFESA 
NNLRAGSPVR IAGVNVGRVK SVARYKDTNL SQVEMEISED GLPIHEDATL KIRPRIFLEG
NFFVDLRPGT PGSPDVPDGG TIGVTQTSTP VQLDQLLTAL QSDSREDLQH VLEEYGAALN
SKPTPEQDAE LPESVRGLTG AQGLNNAAAP GARALRNATI VNDAIRGEKP GDLAKTIASV
ARLSRTLESR EGQLQDLIVN FNLTASAFAN QSGALSETIR LLGPTLATAR SALRSVDAAL
PSTRAWAREI LPGVRETAAT VNASFPWIEQ TRALLGPDEL QGLMAELTPA TKDLARLTNA
SIRLLPEIDD FSQCFAKVIL PTGNVGLEDG ALTNRRSDGS IVESYKEFWY GLVGLTSAGQ
GFDGNGAYLR ATAAGGQWNV APGISRYAAG GTVEKTLTGL ATQRPLGTRP LYSARSPAIK
TDVPCRSNPV PDLNGPQAGP GAAPRSIQVP TPPPVERRVE TPTTPPARTA ASDDTSARTA
SVGSELLSRL SPLANGGGR