Gene Cwoe_5059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5059 
Symbol 
ID8735525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5406948 
End bp5410166 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content76% 
IMG OID646505686 
Producthypothetical protein 
Protein accessionYP_003396845 
Protein GI284046505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGCGG CTCACGAGCG CGCCCGCGCC GACGTCTTCT TCGCCAGCCC CGCACTTGCG 
GGCGCGTTCG CGGGCTCCGA GCTGGAGGCG GCGGCGCGCG CCGAGGCCGC CGGCCAGCCG
GCGCTGATCT GGGTCGCTGC GGGCCCGTCG CCGCGCGTGA CCGTGCTCGC GCATGACGGC
GAGGGCCACG CCGCGCGCGC CGTGGAGCTG CCGGGCGGGC TGACGCTCCC GGCTGGCACC
CGCTTCGAGG CGGCGCTCGA CGGGCCCGAC GCGGGCATCC TGCGGTTCGA CGCCGCGAAC
GCGCTCGACC CGTGGATCCG CTACTCGTAC GAGACCGGCG AGACCGGCGA GACCGGCGAG
ACCGGCGGCG GCGCTGGCGC GGCGCCGGCG TTCGGCAGCC TCGCCTTCCG GCTGCTCGAC
GCGCCCGCCT CCGCCGCCGT GCGGATCGAT CCCGCGCGGC CGCTCGATCG CGAGCGCACC
GTCGCGGCCG CGACGGCGGC GCCGGTCCGG TCACGCTTCC GCACGCGCTG GGGAGCGACC
GTGACGTTCG CCGCGCGACC GCCCGACTCG CGCTACGTGC TCGCGTTCGA CCCGGCGCGC
GACGAGCTGT ACTGGACGCT CGACGGCACC TGGGGCTGGG GCGTCGCGGA CTCGCACGGC
GGCGAGCTGG AGCTGATGCC CGGCGTCTCC GGCGCCGAGT ACGTGACGGC GCCGGACGGC
TACGTCGTCC GCTTCGTCGC GGGCGCGCCT GCCTTCGCGA GCGGGTTCCG GCCCGACGCG
CCGAGCGATC GCGCGGTGGG CGGCGCCGCG CCGGGCGCCG CGCGCGGCGC CGCGCCCCTC
CCGCTGGTCG CGAGCGCCCC CGGCGTGAGC GACCCGGTCA CGACCGCGTG GGCGTACGTG
GAGCCGGCCC CAGCCGCGTC CGCACCCGGC CCGAGCGCCG GCTACTTCTC CCAGCCCGAC
CGCGCCGGGC TCTTCACGGC GGGCGGCGGG CCGTTCCTCG ACGCGCTCCC GCTGCGCGCC
GCCGAGCTGC CGGCCGACGC GAGGACGCCG TACGTCGAGC GGGCGAGCTT TCCCGCCGCT
CCGTGGGCCG GGCTCGACGC GGGGCCGGGC CGGTCGATCG AGCAGCGGTT CGAGGCGGAG
GTGCTCGTCG CCGCCCGCGC GCTCGTGATC GCGAGCCTGG AGGGGGGAGC GGCGCTCGCC
GCGGCCCCGC CCGCGGGCCC GTCCCCGATC AGCGCGGTCA CCCCCCAGGG CCTGCTCGCG
AGCTTCTCGC GCGAGAGCGG CGCGTGGGAG AAGCTCGTGC TCGCCCAGGC CGGCGGCGGC
GCCCAGCAGC TCGCGCTGAC CGACGTGCGC GGCAAGCTGC GCGAGGCGCT GCTCGCCAAC
CGCCTCTTCC TCGTCGTCTC CGACGTGGAG CGGCTGTTCG AGCACTGCTC GACGACATTC
TGCGTCGTGC AGCGCACGCT GGCGCTCGCG CAGGCGGCGG CGGTGCCCGC TCCGGTGCTC
GCGCGCGTCG CGCAGGCGCT GCTCGGCTCC GTCTTCGCGA CCAGAGCACG CTTCGAGGAC
GCGCTCAGAC CGATCCTCGG GGCCGACTAC GCGGCGTTCG GCAGAACGTT CGTCAGATAC
GCCGAGCAGG CGCAGCTGGA GATCGCCGGC TGGAGCTTCG ACCTCAGCTC GTGGCGCTGG
CGCGACGAGA ACGCGCCGAC GATCCTGATC GTCAAGTTCA CCGACGGCGA CCTCGCCTCG
CTGATCGCCG ACCGCTCGCG CTGGACGATG GGCGCCGAGT TCAACGCCGG CGACGGCGCC
GACGCGCAGC GGAAGCTGCT GGAGATCGTC GCCGACGCGG AGCGGCGCGC GCCCACCGAG
CCGGAGCTGC GGTACTTCGC CGACACGGTG CTCGCGGGCA AGCCGACGTC CGGCGGCTGG
AACGGCGTCC TGTACCTCAA CCCCGTCGTC CCGACGAGCA CGCTGCCGCC CGAGCTGGCC
GCGCTCGCCG CCGGCCTGCC CACCGGCGAG GTGCGCGCCC ACCACCTCGG CGTCACGCTC
ACCTCGTTCG CGCTCGCCGG CCGCGCGATC GCGCTGGAGG ACTCGTCGCT GTTCGGGCTG
ATCCTCTACG GCGACGCGAC CGACCTCGTC TACAGCGGCG ACGCGTACGA CTTCAAGGTC
ACGTCGCTGC GCGTGCGCTT CGCCAACTCG GCGATCGCGT CGTTCTCCAG CGAGGTCGAG
CTGCTCGTCG GCCAGCTGTT CAACGAGCTG TCGAGCCTGC CGGGGAGCCT GCGCGGCGAC
AACCTGCTGT TCGAGGGGAC GTTGCAGAGA GGCGCCTACC GCTTCACGAC CACGACGCAG
AGCCGCTTCA CGATCGCGAG CCAGGTGCTC GACTTCGCCG AGGTCGCCGA CGCCGTCTTC
GTTACCGTCA CCGGCGAGAG CGACGCGACG CGGACCGTCG CCCGCTTCAT CCTCGACGGC
ACGCTCGGCT TCCGCGGCTT CGGCGACGTC GACCTGTTCG GCTACGGCGC GTCCGGCCGC
GGCGACGGCT CCGGCCTCGC CTACTCGGGG CTGATGGTCT CGATGCGGTT CGACCCGCGC
GACCCGCGCG CGACGCGCAG ACTGGAGTTC GTCGCCGGGC AGACGACGTT CGACCTCGCG
CGCAGCGCGG CCCGTCCCGG CAGCCTCCCG CGCCGCTTCC CGCTCGTCCC CGCGGCGCTG
CGCCAGAGCG AGCGCTCCAG GTCCGGCTCC GGCGACGGGC GCGAGCGCGC GGCGACCCCG
GCCGAGCTGG GGTTCCTGCC GGTCGACGCG CCGCTGCCGA CCGGCGCGCT CGGCGACGTC
TGGTTCGGGC TGGAGCTGAC GCTCAGCTTC GGCAGCCCCG GCGCGCTCGC GCCGCAGCTC
GGCTTCAGCG GCGCGCTGCT GCTCGCGTGG GCGCCGAGCG CGGAGGCGCC GAACGTCGCC
GTCGGGCTGC GCCTGCCGGG ATCCGGCGGC GGCCGCGCGC TCACGATCAT GGGGCCGCTG
AGACTGTCGG TCGGCCGGCT GAGCCTGCTG CGCGACACGA GCACCGACGG CTACCTGCTG
CGGCTCGCGT CGGTCGCGCT GAGCTTCCTC GGGCTCAGCT TCCCAGCGAG CGGGAGAGCG
AACGCGCTGC TGTTCGGCGA CCCGGACCCG AACGCCGCCA ACAGCGCGCT CGGCTGGTAC
GCCGCGTACG ACAAGGAGCC GGCGTGCCCG ACAGACTGA
 
Protein sequence
MIAAHERARA DVFFASPALA GAFAGSELEA AARAEAAGQP ALIWVAAGPS PRVTVLAHDG 
EGHAARAVEL PGGLTLPAGT RFEAALDGPD AGILRFDAAN ALDPWIRYSY ETGETGETGE
TGGGAGAAPA FGSLAFRLLD APASAAVRID PARPLDRERT VAAATAAPVR SRFRTRWGAT
VTFAARPPDS RYVLAFDPAR DELYWTLDGT WGWGVADSHG GELELMPGVS GAEYVTAPDG
YVVRFVAGAP AFASGFRPDA PSDRAVGGAA PGAARGAAPL PLVASAPGVS DPVTTAWAYV
EPAPAASAPG PSAGYFSQPD RAGLFTAGGG PFLDALPLRA AELPADARTP YVERASFPAA
PWAGLDAGPG RSIEQRFEAE VLVAARALVI ASLEGGAALA AAPPAGPSPI SAVTPQGLLA
SFSRESGAWE KLVLAQAGGG AQQLALTDVR GKLREALLAN RLFLVVSDVE RLFEHCSTTF
CVVQRTLALA QAAAVPAPVL ARVAQALLGS VFATRARFED ALRPILGADY AAFGRTFVRY
AEQAQLEIAG WSFDLSSWRW RDENAPTILI VKFTDGDLAS LIADRSRWTM GAEFNAGDGA
DAQRKLLEIV ADAERRAPTE PELRYFADTV LAGKPTSGGW NGVLYLNPVV PTSTLPPELA
ALAAGLPTGE VRAHHLGVTL TSFALAGRAI ALEDSSLFGL ILYGDATDLV YSGDAYDFKV
TSLRVRFANS AIASFSSEVE LLVGQLFNEL SSLPGSLRGD NLLFEGTLQR GAYRFTTTTQ
SRFTIASQVL DFAEVADAVF VTVTGESDAT RTVARFILDG TLGFRGFGDV DLFGYGASGR
GDGSGLAYSG LMVSMRFDPR DPRATRRLEF VAGQTTFDLA RSAARPGSLP RRFPLVPAAL
RQSERSRSGS GDGRERAATP AELGFLPVDA PLPTGALGDV WFGLELTLSF GSPGALAPQL
GFSGALLLAW APSAEAPNVA VGLRLPGSGG GRALTIMGPL RLSVGRLSLL RDTSTDGYLL
RLASVALSFL GLSFPASGRA NALLFGDPDP NAANSALGWY AAYDKEPACP TD