Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5526 |
Symbol | |
ID | 8736001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5917635 |
End bp | 5920625 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646506156 |
Product | hypothetical protein |
Protein accession | YP_003397306 |
Protein GI | 284046966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTTT CCCCCCCACG GGCGCTGGTC GCAGCGCTCG TCGCGCTGTT GGTCCCACTC GTGTTGTCGG CGAGCGCACA CGCGTTCTCG CTGACGAGCG TCGGCGCAAC GCCCGGCAGA CCGGCGAACA GCCCGCTCGG CGCCAGCGAT CCGGCCGTGA GAGATCCGCT CTCAGCCGCT GCACATCCCG ACATGGCGAT CAGACTCGAC TTCGACGATG TCGGCAACCC GACCGCCGAC AGCGTCGACA GTCTCGAGAT CGGGCTCGCG CCCGGCATCG TCGCGTTCGT CAACCACATC GAGACGTGCA CCACTTGGGA CATGTCGCAG ACGAGAACGA ACTGCCCGAG ATCGGTGATC GGTTCCGCCA TCACCAGAGC GTCTGCACCT GTGCTCGGAG CGCTCACGCT CAACGGCGCG ATCTACCGGA TCCCATCGCC GGATCCGTCG AGAGTCCCGA CGGCGTTCGG CATCGACATC GAGCCACCGC TTCCCGGCCT GAGACGGATC AAGCTCGTGT CGCCGATCAC GGTCAACCCT CTGAACTTGG GCCTGACGGC TTCCCTGAGC GGCCTTCCGA ACTTCGCCGA AGTGCCCCTC GTCGGCAGAC TGCCGGTCCA CATCGACTCG ATCACGCAGA TCCTCAATGG CTACAACGCC GAGGGCAGAT CGTTCTTCAC CAACCCGACC TCCTGCATTC CGGCGGTTGT CAGCGTGACG GCCAGATCCC ATGGCGGCGC CCCGACGTCG GGCAACGGCT CGTACACCCC GACCGACTGC GAGAACGTGC CGTTCAACAC GACGCTCGCG TCGTCAGCCG ACCCGAACAC CGCCGACTCG ACGTCGGCGA TCAGCACGGA CGTGATCCCC GGGACGGAGG ACATCCCGCG CGTCAGCTCG ATGGTCCGCG GGACGACGTT CCTCGGCGCC CCGGGCATGC TGCTCAACCC GGCGCTGGCA GCTCGGCTCG ACGCCTGCAC CGACGCGGGC TTCGCGCTCG CCGACTCCTC TGTCGCAGCG AACTGCCCGG CATCGTCGGA GGTCGGGACG ATCGACTTCA CGTCGCCGAT CCTCGGCAAC TTCCCCGGCA AGGCGTACTT CGGCACGCAG ACGCCGACCG ACCGGCTGCG GCTGTTCCTC GACGTGCCGC TGTTCGGCGC GCACATCAAG CTCTCCGGCA CGGTCAACCC GGACTTCAGA ACCGGCCAGA TCACGCTGAA GTTCAGCGAC CTCCCGCAGG TCGCGTTCAC GAACTTCAGA CTCACGTTCA AGGGCGGACC GCAGTCGGCG CTCGTGACGC CGACGACGTG CGGCCCGGCG ACGACGATCG CGACCGTCAC ACCGTGGTCG GGCGGCGCGG CGAGAACGCC GTCCGCCTCC TACGACGTCG TCGACTGCGC ACGCAGATTC ACGCCGTCGA TGGCGACCTC GGTCAGCAAC CCGCAGGCCG GCGCGGACAC GAGCTTCACG CTCTCGTTCG ACCGCCCCGA CAGAACCGTC CCGGTCGGCA GAGTCGCGTT CGACCTGCCG CCCGGCCTCA TAGGCTCGCT CGCGCTGCCG GGCCTGACGA AGTGCGCGCT CGCGACCGCC GCGGCGAACG CCTGCCCGGC GTCGAGCCGG ATCGGCAGCG TCAACGCGAT CGTCGGCTCC GGAACCGAAC CGCCGACGCT GCCCGGCAGC ATCTACCTGA CCGAGCCGCG CGTCGCGGGC GACCCGGCCG GGCTGTCGAT CTCCGTGCCG GCGAGACTCG GGCCGGTCGA CGCCGGCATC GTGACCGTCG CGCAGCGGCT GACGCTCGGC AACGACGGCG GCCTGGACAT CGTCAGCGAC CCGATCCCGG CACTCCAGCT CGGAATCCCG CTCGCGATCC GCAGACTGAC CGTCAACGTC GACCGCGCCG GCTTCATGAA GAACCCGACC AGCTGCGGCA CCCCGAAGGC CTCCGGCGCG TTCTCGCCGC TCGACGGCGG AGCGGGTGCG ACCGCGAGCT CCACGATCAC GGTCAACGGC TGCGACAGAC TGCCGTTCGG CCCGAGAATC ACCGGCACGA TCGGCGGCAG AGGCCAGAAC CGCGAGGGCC AGCACCCGTC GTTCACGACG CGGATCACGC AGCGGGCCGG CGAGTCCGCG ATGAAGACGG CGGTCGTGAC GCTGCCGAGA GCGGTCTCGA CGAACCTGCC GACGCTGAGA GCGGCCTGCC CGCTCGCCAC GTACAACGCG AGAAGATGCA CCGCGCGGAC GATCGTCGCC ACCGCGACGG CCGTCAGCCC GCTGATCAAC AGACCGATCA CCGGCCCGAC GTATCTCGTC AGAACCGCCT CCGGCGGCCT GCCGAGCCTC GCGGTCGAGC TGCGCGGCGA GGTCTCGCTC GACCTGCTCG GCACCAACGC CGTCAGAGGC GGCCTCGTGA TCACGACGTT CGGCAGCATC CCCGACCTGC CGCTGTCCAG CTTCGAGCTG AGATTCCGCG GTGGCAGAGA GGGTGTCCTG ACGACCGTCG GGGATCTCTG CGCCCGGCCC GTGCTCGGCG CCAGATTCAC CAGCCAGAGC GGCAGAGCGA CGTCGCAGAG CCCGCGGCTC ACGGCGCTCG GCTGCGTGCC CAGACCGAAG TCGGGCGCGA CGCTGAGATT CCGCAGAGGC GCCGGCAGAC TCGGCGTCCG GACCGGCGTC GCGAGAAGCG GCAAGCCACT CTCCAGCGTG CGGATCGGCC TGCCGAGAGG CCTGCTGATC AGAGGCGCGT CGGTGACGTC GAAGGCCGGC AGAACCCGTC TCGCGAGAAG AGCGATCCGG GTCAGCGGCC GGACGATCAC CGTGAAGCTC TCCAGACGCG GAGCGCGCAA GGTCACGGTC AACGTCCGCG GCGTCAGAGC CAGCTCGGCG AGACTCGCGA GACGGCTCGC GGCGCGCAGA GGCAGACTGA CGGTGACGGT CCGCACCGCC CAGAAGGGCG GGCCGCGCGT CACGCAGAAG GTGAAGCTGA AGCTGCGGTA G
|
Protein sequence | MRFSPPRALV AALVALLVPL VLSASAHAFS LTSVGATPGR PANSPLGASD PAVRDPLSAA AHPDMAIRLD FDDVGNPTAD SVDSLEIGLA PGIVAFVNHI ETCTTWDMSQ TRTNCPRSVI GSAITRASAP VLGALTLNGA IYRIPSPDPS RVPTAFGIDI EPPLPGLRRI KLVSPITVNP LNLGLTASLS GLPNFAEVPL VGRLPVHIDS ITQILNGYNA EGRSFFTNPT SCIPAVVSVT ARSHGGAPTS GNGSYTPTDC ENVPFNTTLA SSADPNTADS TSAISTDVIP GTEDIPRVSS MVRGTTFLGA PGMLLNPALA ARLDACTDAG FALADSSVAA NCPASSEVGT IDFTSPILGN FPGKAYFGTQ TPTDRLRLFL DVPLFGAHIK LSGTVNPDFR TGQITLKFSD LPQVAFTNFR LTFKGGPQSA LVTPTTCGPA TTIATVTPWS GGAARTPSAS YDVVDCARRF TPSMATSVSN PQAGADTSFT LSFDRPDRTV PVGRVAFDLP PGLIGSLALP GLTKCALATA AANACPASSR IGSVNAIVGS GTEPPTLPGS IYLTEPRVAG DPAGLSISVP ARLGPVDAGI VTVAQRLTLG NDGGLDIVSD PIPALQLGIP LAIRRLTVNV DRAGFMKNPT SCGTPKASGA FSPLDGGAGA TASSTITVNG CDRLPFGPRI TGTIGGRGQN REGQHPSFTT RITQRAGESA MKTAVVTLPR AVSTNLPTLR AACPLATYNA RRCTARTIVA TATAVSPLIN RPITGPTYLV RTASGGLPSL AVELRGEVSL DLLGTNAVRG GLVITTFGSI PDLPLSSFEL RFRGGREGVL TTVGDLCARP VLGARFTSQS GRATSQSPRL TALGCVPRPK SGATLRFRRG AGRLGVRTGV ARSGKPLSSV RIGLPRGLLI RGASVTSKAG RTRLARRAIR VSGRTITVKL SRRGARKVTV NVRGVRASSA RLARRLAARR GRLTVTVRTA QKGGPRVTQK VKLKLR
|
| |