Gene Cwoe_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2118 
Symbol 
ID8732561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2224042 
End bp2225193 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content69% 
IMG OID646502736 
Productprotein of unknown function DUF1100 hydrolase family protein 
Protein accessionYP_003393918 
Protein GI284043578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAAT ACTTTTCGAC GAACTACTGG TGGTCGCTGA TGGCGATGGG CGCGCTCCAG 
ACGGGCGGCG AGATCTCCGA GATCGACGAG ATCTGCCGCC CGCTCGAGAC GCTCAGCCGG
CGCGGCGTGG AGGACCACGG CGCGAACGAG GAGTGGTTCG CGGCGTGGAC GAAGATGGGC
CACCGGCTCG AGCGGCTGGC GAAGGCCGAC GCGGAGGCCG GCTTCCCGCT CGGGGCCGCG
CGCAAGCACT ACCGCGCGGC GCTGTACTTC AGCGTCGGCG AGTTCGCGCT CGATGCGCGC
GTCGTCGCGC GCGGGCGCGA GGCCTACGTG CGCGGGCTGG AGAACTTCCG CCGCGCGGTC
GAGCTGTCCG GCGACCCGGT CGAGCTGGTC GAGATCCCGT ACGAGGACAC GACGCTCGAC
GCGCTGTTCA TCCGCGCCGA GGGCGACGGG CCGGCGCCCT GCGTGATCCA CTTCGGCGGC
GCCGACTCGG TCAAGGAGCA CCTCTACCTG CGCTACAAGA ACGAGTTCAG CAAGCGGGGC
GTCTCGCTGC TGATCGTCGA CCACCCCGGC GTCGGCACGG CGATCCGCCT GAAGGGGCTG
CCGACCCGCG CCGACATCGA GGTCGCCGGC ACCGCCTCCG CCGACTACCT CGCGACGCGC
TCGGACGTCG ACATGGAGCG GCTCGGCATC TGCGCGATGA GCCAGGGCGG CTACTACGCG
CCGCGCATCG CCGCGCTCGA GAAGCGCATG AAGCTGTGCG TCGTGTGGGG CGCGATCTGG
GACATGGAGA CGATCGTCAA CGAGTACAAC TCGATCACGC GCGGCAAGCT CAAGCCGAAG
CAGCACCGCA CCTTCGGCGG GCTGGAGCCC GAGGAGGTGA TGGAGCGCAT CAAGCAGATG
ACGCTCGAGG GCCTCGCCGA CAAGATCGAG TGCCCGATCC TCGTCATCCA CGGCGAGAAC
GACCAGCAGG CGCCGCTGTG GCGTGCGCGC AAGACGTGCG ACGAGGCGAT CAACGCCCCG
CGGCGCGACC TGAAGGTCTT CACGGTCGAG GACGGCGGCG CCGAGCACTG CCAGGTCGAC
ATCATGACGA TGGCGACCGA CTACATCGTC GATTGGACGG CGCAGCGCTT CCGCGAGATG
GAGGCGGCCT GA
 
Protein sequence
MFQYFSTNYW WSLMAMGALQ TGGEISEIDE ICRPLETLSR RGVEDHGANE EWFAAWTKMG 
HRLERLAKAD AEAGFPLGAA RKHYRAALYF SVGEFALDAR VVARGREAYV RGLENFRRAV
ELSGDPVELV EIPYEDTTLD ALFIRAEGDG PAPCVIHFGG ADSVKEHLYL RYKNEFSKRG
VSLLIVDHPG VGTAIRLKGL PTRADIEVAG TASADYLATR SDVDMERLGI CAMSQGGYYA
PRIAALEKRM KLCVVWGAIW DMETIVNEYN SITRGKLKPK QHRTFGGLEP EEVMERIKQM
TLEGLADKIE CPILVIHGEN DQQAPLWRAR KTCDEAINAP RRDLKVFTVE DGGAEHCQVD
IMTMATDYIV DWTAQRFREM EAA