Gene Cwoe_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4044 
Symbol 
ID8734505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4295535 
End bp4296551 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content74% 
IMG OID646504672 
Productoxidoreductase domain protein 
Protein accessionYP_003395836 
Protein GI284045496 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.568923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.775986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA GACGTGTCAT CGGTGTCGGG GTCATCGGCC TCGGCGAGAT CGGGCAGTTC 
CACCTGCGCG GCTACGAGCG CTCCCCCGGG GCGCGCGTGG CGGCGGTGAC CGACCTGAGC
GGGGAGCTGC TGGCGCGCAC CGCGGCGGCG ACCGGCGCCA CGGCGCACTC GTCGATCGGC
GCGCTGCTCG CCGATCCCGC CGTCGAGGTC GTCTCGGTCT GCCTCCCCCA CCACCTCCAC
CTGCCGGTCG CGCTGCAGGC GATCGCCGCC GGCAAGCACC TGCTCGTCGA GAAGCCGCTC
GCGCTGACCG TCGCCGAGTG CGACGAGATC GTCGCAGCGG CCGAGGCCGC GGGCGTGACC
GTCGGCGTGC AGCACAACCA GCTGTTCCAC GGCCCGCACG TGCGGGCGCA GGAGCTGATC
GACTCGGGCG CGATCGGTAG ACCGGTCCAC ATCCGGCTGC GGCTCGGGAT CGGCGGCAAG
CTCGACGGCT GGCGCGCCGA CCCGAAGGTC GTCGGCGGCG GGCTGCTGTT CGACGCCGGC
GTGCACCGCT TCTACATGGC GCGCAAGCTG TTCGGCGAGG TCGCCGAAGT GCGTGCGCTG
GTCGACCGCG GCCTGGACGT CGGCGAGGAC CAGGCGGTCG TGACGCTCCG CTTCGAGAAC
GGCGCGCTCG GCGTGATCGA CGCCAACTAC CACTGCCCGC CGGGCGCGTT CGACGACGCG
ATCGAGATCG TCGGCAGCGA CGGGATGCTC TACCTGTCCG GCTGCGAAGC CGAGTTCGAG
GGCTTCCGCA CCGGTCCGGC GCTGCGCGTC TACGACGGCT CCTGGCGCGA CGAGCGCGTC
CCGCAGGGCG ACTGGGCCGA CTCGGTCGCC GCCTCGATCG ACGCGTTCGT CGTCGCGCTG
GCGGCCGGCG AGCCGCCGCC GGTGACGGCC GCCGAGGGCC GGCGGATCGT CGAGCTGATC
CATCAGGCGT ACACGTCGGC CGCCGCCGGT GACCCCGGCG GCGCGGGAGC GACGTGA
 
Protein sequence
MAERRVIGVG VIGLGEIGQF HLRGYERSPG ARVAAVTDLS GELLARTAAA TGATAHSSIG 
ALLADPAVEV VSVCLPHHLH LPVALQAIAA GKHLLVEKPL ALTVAECDEI VAAAEAAGVT
VGVQHNQLFH GPHVRAQELI DSGAIGRPVH IRLRLGIGGK LDGWRADPKV VGGGLLFDAG
VHRFYMARKL FGEVAEVRAL VDRGLDVGED QAVVTLRFEN GALGVIDANY HCPPGAFDDA
IEIVGSDGML YLSGCEAEFE GFRTGPALRV YDGSWRDERV PQGDWADSVA ASIDAFVVAL
AAGEPPPVTA AEGRRIVELI HQAYTSAAAG DPGGAGAT