Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4871 |
Symbol | |
ID | 8735337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5195548 |
End bp | 5198685 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646505499 |
Product | hypothetical protein |
Protein accession | YP_003396658 |
Protein GI | 284046318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTGCGG CGGTACGGCT CCTCGCGCTC GGGCTCGCGC TTCTGCTTGC CCATACCGGC AGCGCCGTGG CGGCGACGCC GTCGGCCGGC TGGGAGATCG GCTCCACGGC GCTGCCGTCG ACGTTTGCGC CCGGGGACAC CGGCGCGCAG TACCGGATCC TCGCCAAGAA CGCCGGGGCG GCGGCGACCG ACGGCAGCGC GGTGCAGGTC AGAGCGGTCC TGCCCGCCGG CGTGACGGTG ACGGCGATCG TCGGGGACGC CGACTACGTC GGCACGACCT GGACGTGCGA CGTCGCGACG CTCACGTGCG ACCTCGTCCC GGGCTTCAAC GGGCCGGCCG TGAAGGCCGG CCAGGTGCTG CCGCCGATCC TCCTCACGGT CACGGTCGAC GCGGGCCTGT CGGGCGACGT CGTCAGCGGC GCCACGATCG AGGGCGGCGG CACGCCGGCC GTCTCAACGG CGACGACCAC CCCGGTCGGG TTCGCGCCGG TTCCGTTCGG CGTCCGCGAC GGCAGCTTCC GCGCCGAGGT CGTCGACGAG GCGGGCAGAG CCGTGAGCGA GCTGCAGGCA GGCGAGCATC CGTTCAGCGT CGTGGTGGGC CTTGCGGTCC CGGCCGCCCG CTTCGACGAC GGCAACGGCG GCAGCTACGC CGCGCCGGCC GACACCGTGC GCAACGTCCA GGTGCGGATG CCGGCAGGCT TCTACGGCAG CACGCGGACG GCCGCGAAGT GCACCAACGA CCAGCTCGCG CTGACGCTCG GGCAGGGAGC CGGCTGCCCC GCCGGCAGCC AGGTCGGCAC GGTCGACCTG ACGCTCTTCA ACGGGACCTC GCTGTACTCG TGGGCGGACT CGCAGCAGAT CGCGGTCTAC AACATGGTCC CGCCGAAAGG CGTCGTCGCC GACTTCGCGT TCGCCCTGAT CGGCAACCCG GTGCACGTCC GGATCGAGCT CGATCCGGAC GATCATTCGC TCGTCGCGAC GATCCCGAAC GTGACCGAGC GCTTCCCCGT GCTCGACCAG CGGCTGACGC TGTGGGGCAC GCCCGGCGAC CCGCTGCACG ACGCCGAGCG CTTCAACCCG GCCGACCCGT TCGGAGGGCT CCCGCTGCCG TTCCCGGGCG ACGCGAGCCC GTTCCTGACG CTCCCGTCGC GCTGTGGTCA GCTCGACGGC GCGAGCGTGA GCGTCAGCTC GTGGGGCGCG CCGGGCCGGG TCAGCTCGGC CCGGACGGCC GCGAGGACGG TGAAGGGCTG CGAGCGGCAG CGGTTCGGCG CCTCGATCGG CCTCGGCATG GACACGACGC GCGCCGACGC TCCGAGCGGC ATCTCCGTTC GCGTCGACGT CGATCAGCAG ACGGGCTGGA GAGGGCTCGC CACACCCCCG CTGAAGGACG TCGCGGTCGC CCTCCCCGAG GGGGTGTCGG TCTCGCCGTC GTCGGCCGAC GGCCTCGCCG GCTGCTTCCC CGACCAGATC CGGCTCGGGA CCGACGCCGA GCCGGCGTGT CCGGACGCGT CGCGGATCGG CAGCGTCGAG ATCGCGACGC CGCTGCTCGA CGAGCCGCTT CGCGGCGGTG TCTTCCTCGC GCAGCCGCGT GCCAACCCGT TCGGCTCGCT GATCGCGTTC TACGTGGTGG CGCAGGGATC CGGCGTCACG CTGAAGCTGC CGAGCCGGGT GACGACCGAT CTCGCGACCG GGCGCGTCAC GACGACGTTC GAGCAGCTGC CGCAGCTGCC GTTCTCGACG TTCAGGGTGC GCTTCAAGGG CGGGCAGCGC GGGCTGCTCG CGACGGCGCC GACGTGCGGG ACGGCCGCCG CGTCCGCGCG TCTGACGCCG TGGAACGGAT CGCTGCCGGC GATCGTGATC GAGCAGCCGA TGACGACCGA CGCCGACGGG GCCGGGGGCG CGTGCGGCGC GTCGCGGTTC GAGCCGGCCT TCCGTGCCGG GACGGCCGAT GCGACCGCCG GCAGGACGTC GCCGTTCGCG CTCGCCGTCG CCCGTCCCGA TCAGCACGAG CAGCTCGAGG CGATCTCGAC GGAGCTGCCG GCCGGGCTGA CCGGCCGGAT CGCCGCCGCG ACGCTGTGCG CCGACGCGGC GGCCGCCCGC GGCACCTGTC CGGTCGCCGC GCAGGTCGGC TCGGTGCAGG TCGGCTCGGG TCCGGGCGCG AGCCCGCTGT TCCTCGACGG GAAGGTCTAC GTCACCGGTC CGTATCGCGG GGGTGCGTTC GGGCTGAGCG TCGCCGTGCC GGCGGTCGCC GGTCCGTTCG ACCTCGGCAC GGTCGTCGTG CGGGCGGCGA TCTTCGTCGA CCCGCTGACG ACACGGCTGA GGATCGTGTC GGACCCGTTC CCAGCCAGCC TGGAGGGGAT CCCGCTGCGG ATCCGCGACG TGCGGCTCGC AGTCGATCGG CCGGGGTTCA TGCTGAACCC GACGAACTGC TCGCCCGCGA GCGTCGCCGG GCAGCTGCGC TCGACGCGCG GCCGGATCGC GACCGTCGCG AGCCGCTTCC AGGTCGGCGA CTGCGGGGCG TTGCGGTTCA GACCGCGGAT GACGCTCCGC GCCGGTTCCA GACGGCACCG GCGTGGCGGC GACTCGACGC CGCTCGAGGT CGTGCTCGCG ATGTCGCCGG GGCAGGCGAA CGTCAGATCG GTGTCGGTGA CGCTGCCGCG GACGTTGAGC GCCCGGCTCC AGGTCTTGAA CACGCGGAAC GCGTGCACGC TGCAGCAGTT CAGGTCGGAC AGCTGCCCGA TCGACGTCGG CTCCGCGGTA GCGGTGACGC CGCTGCTGCG CGATCCGCTC GTTGGGCGGG TCGCGCTCGT GCGCAACCGG GCGAGCAGAC TGCCGGACGT GATGGTCGCG CTGCGGGGGC AGGGCGACGC GCGGGCGGTC CGGGTCGAGC TGGCCGGCAA GATCGCGATC ACGAGAGCGC TGCAGATCCG CACCACGTTC GCCGCGGCGC CCGATGCGCC GATCTCGAAG TTCCGCCTCA GCTTCGCCGC CGGCAGACAC GCGGCGATCG CCGCGAGCGA GAACCTCTGC AGCGCGAGAG CGAGACGACG GTCGATCGCG CAGCTGACGT TCGTCGCGCA GAACGGCAGG CGCGTCGCGC GCGACCAGCG CATCGCGATC GCCGGCTGCC GGCGCTGA
|
Protein sequence | MRAAVRLLAL GLALLLAHTG SAVAATPSAG WEIGSTALPS TFAPGDTGAQ YRILAKNAGA AATDGSAVQV RAVLPAGVTV TAIVGDADYV GTTWTCDVAT LTCDLVPGFN GPAVKAGQVL PPILLTVTVD AGLSGDVVSG ATIEGGGTPA VSTATTTPVG FAPVPFGVRD GSFRAEVVDE AGRAVSELQA GEHPFSVVVG LAVPAARFDD GNGGSYAAPA DTVRNVQVRM PAGFYGSTRT AAKCTNDQLA LTLGQGAGCP AGSQVGTVDL TLFNGTSLYS WADSQQIAVY NMVPPKGVVA DFAFALIGNP VHVRIELDPD DHSLVATIPN VTERFPVLDQ RLTLWGTPGD PLHDAERFNP ADPFGGLPLP FPGDASPFLT LPSRCGQLDG ASVSVSSWGA PGRVSSARTA ARTVKGCERQ RFGASIGLGM DTTRADAPSG ISVRVDVDQQ TGWRGLATPP LKDVAVALPE GVSVSPSSAD GLAGCFPDQI RLGTDAEPAC PDASRIGSVE IATPLLDEPL RGGVFLAQPR ANPFGSLIAF YVVAQGSGVT LKLPSRVTTD LATGRVTTTF EQLPQLPFST FRVRFKGGQR GLLATAPTCG TAAASARLTP WNGSLPAIVI EQPMTTDADG AGGACGASRF EPAFRAGTAD ATAGRTSPFA LAVARPDQHE QLEAISTELP AGLTGRIAAA TLCADAAAAR GTCPVAAQVG SVQVGSGPGA SPLFLDGKVY VTGPYRGGAF GLSVAVPAVA GPFDLGTVVV RAAIFVDPLT TRLRIVSDPF PASLEGIPLR IRDVRLAVDR PGFMLNPTNC SPASVAGQLR STRGRIATVA SRFQVGDCGA LRFRPRMTLR AGSRRHRRGG DSTPLEVVLA MSPGQANVRS VSVTLPRTLS ARLQVLNTRN ACTLQQFRSD SCPIDVGSAV AVTPLLRDPL VGRVALVRNR ASRLPDVMVA LRGQGDARAV RVELAGKIAI TRALQIRTTF AAAPDAPISK FRLSFAAGRH AAIAASENLC SARARRRSIA QLTFVAQNGR RVARDQRIAI AGCRR
|
| |