Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1446 |
Symbol | |
ID | 8731886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1523262 |
End bp | 1526030 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646502064 |
Product | hypothetical protein |
Protein accession | YP_003393249 |
Protein GI | 284042909 |
COG category | [S] Function unknown |
COG ID | [COG3247] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.09163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0544825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGGG CGCTCGTCGT GCTCGCGATC GCGATGGCGC TGGCGATCGC CGGAGCCGTC GCAGCTTCGC TCGCGCACGC GCAGGACACG CCCGGCCCGC CGTCGCCGAC CGAGCAGCCG GGCCCGTCTC CGACCCCGGA CACCGAGCCC GACGACGACG AGCCGCGCGA GCCGACGCCG GGCCTCACGC CTCCGGCGGA CAGCCCAGGG CCGCCGTCGC CGACGGAGGA GCCGCAGCCC GAGCCGTCGC CGTCGCCCGG ACCCGACCCT GACCCCGATC GGCCGGAGAC GTCGCCGGCG CCTGAAAGAC CGCGCGACAC GGCCGACCTC GAAGGCTTCG CGGGCGACTC GCCGATGTGC GACGAGGAGC GCAATGCGAT GAAGCTCGAG GCCGGCGGCA TCGCGCAGCG CAACTGCAGT CGCTCCGGTT CGATCGCGCA GCCGCACCCG ACCGCGCACT ACGGCCTCGA CCAGAACGTC GACATCAAGG TCACCGAGCC CGAGACGATG ATCGCCGGCG CGCTCCACTC CGGCAGCCAG ACGGTCTGGC TCGGGCTCGT CTACATGATG CGCGGCGCGC TCGCGATCGT CGAGCTGGGC TTCTCGCACA GCCTCGTGCT CAGCTCGATG GCCGAGATCA GAAGCGGGAT CGCCCGCCTC AAGTCGATCT TCCTCGCCGA CAGCTGGCAG GTCGCGGCGC TCACCGTGCT CGGCCTCTGG GGGATCTGGA TGGGCCTCGT GCGGCGCAAG ACGATCAACA CGATCGGCGG CATCGCGACG GCCGTCGGGC TGATGATCGG CGCGCAGGTG ATCATGATCA ACCCGGAGGG GACCGTCGGC AAGGTCGCGA CGCTCTCCAA CGAGGGCGCG CTCGTCGCGC TCGGTGCCGC GTCGACCGGC GACGTCGACA AGCCGGCGGA CACCTTCGCG GTCGCCCACC AGCGGCTCTT CACCTCGCTC GTCGTGCGCC CGTGGTGCGC GCTGCAGTTC ACCGACGTCG AGGGCTGCAT GAAGGTGCAG AACATCAAGG ACGGCCCCAA CGTCTCGATC GCTGATGTCT GGCTGTCGTT CCCGTCCAAC AGCCCCGAGC GGGAGAAGCT CTACGACAAC CGCAAGGGCG AGCCGTTCAA GCTCGAGTTC ACGCCCGAGA AGTGCAAGCG CAAGGGCTGG AAGAAGTTGG TCTTCGACGA CATCGCGTCG GACTACTCCG GGCTCGGCAT GCTCGGCAAC TTCTCGACGT GGGCGTGCGA GAAATGGAAC GGGCCGGACG GCAGCGGCCA GAGAATCTCG CTGAGATACA ACGACGAGAA GGCCGGGATC CGCACGGACG CCACGATCCA GAGCGGGAAG GGCGCGTTCA CGCGGCTCGG GATGCTGGCG CTGATCGCGG CCGGGATGGT GGGCGCGATC GCGGTGCTGC TGTGGATCGG CGTCCGGCTG CTGCTGACCG GGGTCTTCTC GCTCGTCCTG ATCCTGCTGA CGCCGATCGT CTTCCTGCTC GCCGCGTTCG GAGAGGGCGG GCGGCGCAGC GTCGTCGCGT GGGGCCAACG CCTGCTCGGG CTGCTGATCG CCAAGTTCGT CTTCGCGCTG ATGCTCGCGG TCGTCGTCCT GATCGCGAAC ATCATCCAAG GGCTCGACGT CGGCTGGACG TCGATCTGGA TGTTCAACAT CGCCTACTGG TGGGGCCTGC TGCTCAAGCG CAGAGAGCTG CTCGGGTTCC TGACGCTCGA GAGACCCGCC TCCGAGGGCG GCCTCGGCCT TGCCGGCAGC GGTCGTGGCG GGTCGGGCGG CCTCAGCAGC CTGTACTACG GCTGGCGGAT GGCGAACGAT GCGTTCAGAC AGGTGCGCAA GCCGTTCGAC ATGGCGGCCG ACGCCGGGCG CAACCGCGCG GCGCGAACGG CCGACAAGCG CGCCGACCGC GACGCGGACG CCGCCGATCG CGCCGACGCG ACGCACGAGC GCGGCCAGGA CGACCGCGGC GCGCAGACGG CCGCGATCCG CGAGCACGAC CGCGAGGAGC TGGGCGCCGA AGCGAGCCGC GGCCGCGACG TCGCCACGCA TCGCGACGAC CTCCGCAAGC ACCGCGACAA GCAGCGCGAG CGGCGAAAGG CCGAGCAGGA GCTGAAGGCG AACCGCGCGC GGCAGGCCGC GCTCGCCGCC AAGCACCCGC CGAAGGTCCC GACGACCGCG GCCGAGCGCA AGGCCGCTGA CGAGCAGCGG GCGCTCAAGC AGCGCGAAGG CCAGCTGCGC GGCAAGCTCG ACGCCTGGCG CAACGACCCG TCGCGCACGA AGCGGCCGGT GCTCACCAAC AGCGACCCGG TCAAGGGCCG CGAGCTGGAC GACTACATCT CGGCCCGTCG CGAGGAGATC TCGACGCTGC CGCCCGAGCA CGAGCGCAAC CTGCTCGCGG CCGGGATCGA CCCCGACCGC TACCGTCGCG CCGGCAGCGC CGAGAAGGAC GCGCTGCGCG AGCGTTCGGC CGACGCGATG GACCGTTCGC GGACGCTGCT GGAGACCGCG ACCGCGGAGC GTCCGATGAC AGAGCGGAAG CTGAGAAAGG CCTGGGAGCG GATCGACGCC GGACCGCGCG GGACCCGAAC ACGCCAGTGG GCGCAGGAGA ACCGCGAGCA GGCCAGCCGC GCGACGCGCG ACCGCAACGC CGAGCGCACG CGCGAGGAGG CGCGGCGCAC ACGCCAGCAC GAGCGCGAGC GGCGCGCGCG CCGCCGCGAG GACGAGGCCC GTTCGCGGGC GCGGCGGGGG GTGCGATGA
|
Protein sequence | MRRALVVLAI AMALAIAGAV AASLAHAQDT PGPPSPTEQP GPSPTPDTEP DDDEPREPTP GLTPPADSPG PPSPTEEPQP EPSPSPGPDP DPDRPETSPA PERPRDTADL EGFAGDSPMC DEERNAMKLE AGGIAQRNCS RSGSIAQPHP TAHYGLDQNV DIKVTEPETM IAGALHSGSQ TVWLGLVYMM RGALAIVELG FSHSLVLSSM AEIRSGIARL KSIFLADSWQ VAALTVLGLW GIWMGLVRRK TINTIGGIAT AVGLMIGAQV IMINPEGTVG KVATLSNEGA LVALGAASTG DVDKPADTFA VAHQRLFTSL VVRPWCALQF TDVEGCMKVQ NIKDGPNVSI ADVWLSFPSN SPEREKLYDN RKGEPFKLEF TPEKCKRKGW KKLVFDDIAS DYSGLGMLGN FSTWACEKWN GPDGSGQRIS LRYNDEKAGI RTDATIQSGK GAFTRLGMLA LIAAGMVGAI AVLLWIGVRL LLTGVFSLVL ILLTPIVFLL AAFGEGGRRS VVAWGQRLLG LLIAKFVFAL MLAVVVLIAN IIQGLDVGWT SIWMFNIAYW WGLLLKRREL LGFLTLERPA SEGGLGLAGS GRGGSGGLSS LYYGWRMAND AFRQVRKPFD MAADAGRNRA ARTADKRADR DADAADRADA THERGQDDRG AQTAAIREHD REELGAEASR GRDVATHRDD LRKHRDKQRE RRKAEQELKA NRARQAALAA KHPPKVPTTA AERKAADEQR ALKQREGQLR GKLDAWRNDP SRTKRPVLTN SDPVKGRELD DYISARREEI STLPPEHERN LLAAGIDPDR YRRAGSAEKD ALRERSADAM DRSRTLLETA TAERPMTERK LRKAWERIDA GPRGTRTRQW AQENREQASR ATRDRNAERT REEARRTRQH ERERRARRRE DEARSRARRG VR
|
| |