Gene Cwoe_4871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4871 
Symbol 
ID8735337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5195548 
End bp5198685 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content74% 
IMG OID646505499 
Producthypothetical protein 
Protein accessionYP_003396658 
Protein GI284046318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGCGG CGGTACGGCT CCTCGCGCTC GGGCTCGCGC TTCTGCTTGC CCATACCGGC 
AGCGCCGTGG CGGCGACGCC GTCGGCCGGC TGGGAGATCG GCTCCACGGC GCTGCCGTCG
ACGTTTGCGC CCGGGGACAC CGGCGCGCAG TACCGGATCC TCGCCAAGAA CGCCGGGGCG
GCGGCGACCG ACGGCAGCGC GGTGCAGGTC AGAGCGGTCC TGCCCGCCGG CGTGACGGTG
ACGGCGATCG TCGGGGACGC CGACTACGTC GGCACGACCT GGACGTGCGA CGTCGCGACG
CTCACGTGCG ACCTCGTCCC GGGCTTCAAC GGGCCGGCCG TGAAGGCCGG CCAGGTGCTG
CCGCCGATCC TCCTCACGGT CACGGTCGAC GCGGGCCTGT CGGGCGACGT CGTCAGCGGC
GCCACGATCG AGGGCGGCGG CACGCCGGCC GTCTCAACGG CGACGACCAC CCCGGTCGGG
TTCGCGCCGG TTCCGTTCGG CGTCCGCGAC GGCAGCTTCC GCGCCGAGGT CGTCGACGAG
GCGGGCAGAG CCGTGAGCGA GCTGCAGGCA GGCGAGCATC CGTTCAGCGT CGTGGTGGGC
CTTGCGGTCC CGGCCGCCCG CTTCGACGAC GGCAACGGCG GCAGCTACGC CGCGCCGGCC
GACACCGTGC GCAACGTCCA GGTGCGGATG CCGGCAGGCT TCTACGGCAG CACGCGGACG
GCCGCGAAGT GCACCAACGA CCAGCTCGCG CTGACGCTCG GGCAGGGAGC CGGCTGCCCC
GCCGGCAGCC AGGTCGGCAC GGTCGACCTG ACGCTCTTCA ACGGGACCTC GCTGTACTCG
TGGGCGGACT CGCAGCAGAT CGCGGTCTAC AACATGGTCC CGCCGAAAGG CGTCGTCGCC
GACTTCGCGT TCGCCCTGAT CGGCAACCCG GTGCACGTCC GGATCGAGCT CGATCCGGAC
GATCATTCGC TCGTCGCGAC GATCCCGAAC GTGACCGAGC GCTTCCCCGT GCTCGACCAG
CGGCTGACGC TGTGGGGCAC GCCCGGCGAC CCGCTGCACG ACGCCGAGCG CTTCAACCCG
GCCGACCCGT TCGGAGGGCT CCCGCTGCCG TTCCCGGGCG ACGCGAGCCC GTTCCTGACG
CTCCCGTCGC GCTGTGGTCA GCTCGACGGC GCGAGCGTGA GCGTCAGCTC GTGGGGCGCG
CCGGGCCGGG TCAGCTCGGC CCGGACGGCC GCGAGGACGG TGAAGGGCTG CGAGCGGCAG
CGGTTCGGCG CCTCGATCGG CCTCGGCATG GACACGACGC GCGCCGACGC TCCGAGCGGC
ATCTCCGTTC GCGTCGACGT CGATCAGCAG ACGGGCTGGA GAGGGCTCGC CACACCCCCG
CTGAAGGACG TCGCGGTCGC CCTCCCCGAG GGGGTGTCGG TCTCGCCGTC GTCGGCCGAC
GGCCTCGCCG GCTGCTTCCC CGACCAGATC CGGCTCGGGA CCGACGCCGA GCCGGCGTGT
CCGGACGCGT CGCGGATCGG CAGCGTCGAG ATCGCGACGC CGCTGCTCGA CGAGCCGCTT
CGCGGCGGTG TCTTCCTCGC GCAGCCGCGT GCCAACCCGT TCGGCTCGCT GATCGCGTTC
TACGTGGTGG CGCAGGGATC CGGCGTCACG CTGAAGCTGC CGAGCCGGGT GACGACCGAT
CTCGCGACCG GGCGCGTCAC GACGACGTTC GAGCAGCTGC CGCAGCTGCC GTTCTCGACG
TTCAGGGTGC GCTTCAAGGG CGGGCAGCGC GGGCTGCTCG CGACGGCGCC GACGTGCGGG
ACGGCCGCCG CGTCCGCGCG TCTGACGCCG TGGAACGGAT CGCTGCCGGC GATCGTGATC
GAGCAGCCGA TGACGACCGA CGCCGACGGG GCCGGGGGCG CGTGCGGCGC GTCGCGGTTC
GAGCCGGCCT TCCGTGCCGG GACGGCCGAT GCGACCGCCG GCAGGACGTC GCCGTTCGCG
CTCGCCGTCG CCCGTCCCGA TCAGCACGAG CAGCTCGAGG CGATCTCGAC GGAGCTGCCG
GCCGGGCTGA CCGGCCGGAT CGCCGCCGCG ACGCTGTGCG CCGACGCGGC GGCCGCCCGC
GGCACCTGTC CGGTCGCCGC GCAGGTCGGC TCGGTGCAGG TCGGCTCGGG TCCGGGCGCG
AGCCCGCTGT TCCTCGACGG GAAGGTCTAC GTCACCGGTC CGTATCGCGG GGGTGCGTTC
GGGCTGAGCG TCGCCGTGCC GGCGGTCGCC GGTCCGTTCG ACCTCGGCAC GGTCGTCGTG
CGGGCGGCGA TCTTCGTCGA CCCGCTGACG ACACGGCTGA GGATCGTGTC GGACCCGTTC
CCAGCCAGCC TGGAGGGGAT CCCGCTGCGG ATCCGCGACG TGCGGCTCGC AGTCGATCGG
CCGGGGTTCA TGCTGAACCC GACGAACTGC TCGCCCGCGA GCGTCGCCGG GCAGCTGCGC
TCGACGCGCG GCCGGATCGC GACCGTCGCG AGCCGCTTCC AGGTCGGCGA CTGCGGGGCG
TTGCGGTTCA GACCGCGGAT GACGCTCCGC GCCGGTTCCA GACGGCACCG GCGTGGCGGC
GACTCGACGC CGCTCGAGGT CGTGCTCGCG ATGTCGCCGG GGCAGGCGAA CGTCAGATCG
GTGTCGGTGA CGCTGCCGCG GACGTTGAGC GCCCGGCTCC AGGTCTTGAA CACGCGGAAC
GCGTGCACGC TGCAGCAGTT CAGGTCGGAC AGCTGCCCGA TCGACGTCGG CTCCGCGGTA
GCGGTGACGC CGCTGCTGCG CGATCCGCTC GTTGGGCGGG TCGCGCTCGT GCGCAACCGG
GCGAGCAGAC TGCCGGACGT GATGGTCGCG CTGCGGGGGC AGGGCGACGC GCGGGCGGTC
CGGGTCGAGC TGGCCGGCAA GATCGCGATC ACGAGAGCGC TGCAGATCCG CACCACGTTC
GCCGCGGCGC CCGATGCGCC GATCTCGAAG TTCCGCCTCA GCTTCGCCGC CGGCAGACAC
GCGGCGATCG CCGCGAGCGA GAACCTCTGC AGCGCGAGAG CGAGACGACG GTCGATCGCG
CAGCTGACGT TCGTCGCGCA GAACGGCAGG CGCGTCGCGC GCGACCAGCG CATCGCGATC
GCCGGCTGCC GGCGCTGA
 
Protein sequence
MRAAVRLLAL GLALLLAHTG SAVAATPSAG WEIGSTALPS TFAPGDTGAQ YRILAKNAGA 
AATDGSAVQV RAVLPAGVTV TAIVGDADYV GTTWTCDVAT LTCDLVPGFN GPAVKAGQVL
PPILLTVTVD AGLSGDVVSG ATIEGGGTPA VSTATTTPVG FAPVPFGVRD GSFRAEVVDE
AGRAVSELQA GEHPFSVVVG LAVPAARFDD GNGGSYAAPA DTVRNVQVRM PAGFYGSTRT
AAKCTNDQLA LTLGQGAGCP AGSQVGTVDL TLFNGTSLYS WADSQQIAVY NMVPPKGVVA
DFAFALIGNP VHVRIELDPD DHSLVATIPN VTERFPVLDQ RLTLWGTPGD PLHDAERFNP
ADPFGGLPLP FPGDASPFLT LPSRCGQLDG ASVSVSSWGA PGRVSSARTA ARTVKGCERQ
RFGASIGLGM DTTRADAPSG ISVRVDVDQQ TGWRGLATPP LKDVAVALPE GVSVSPSSAD
GLAGCFPDQI RLGTDAEPAC PDASRIGSVE IATPLLDEPL RGGVFLAQPR ANPFGSLIAF
YVVAQGSGVT LKLPSRVTTD LATGRVTTTF EQLPQLPFST FRVRFKGGQR GLLATAPTCG
TAAASARLTP WNGSLPAIVI EQPMTTDADG AGGACGASRF EPAFRAGTAD ATAGRTSPFA
LAVARPDQHE QLEAISTELP AGLTGRIAAA TLCADAAAAR GTCPVAAQVG SVQVGSGPGA
SPLFLDGKVY VTGPYRGGAF GLSVAVPAVA GPFDLGTVVV RAAIFVDPLT TRLRIVSDPF
PASLEGIPLR IRDVRLAVDR PGFMLNPTNC SPASVAGQLR STRGRIATVA SRFQVGDCGA
LRFRPRMTLR AGSRRHRRGG DSTPLEVVLA MSPGQANVRS VSVTLPRTLS ARLQVLNTRN
ACTLQQFRSD SCPIDVGSAV AVTPLLRDPL VGRVALVRNR ASRLPDVMVA LRGQGDARAV
RVELAGKIAI TRALQIRTTF AAAPDAPISK FRLSFAAGRH AAIAASENLC SARARRRSIA
QLTFVAQNGR RVARDQRIAI AGCRR