Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2320 |
Symbol | |
ID | 8732763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 2451343 |
End bp | 2454093 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646502937 |
Product | hypothetical protein |
Protein accession | YP_003394119 |
Protein GI | 284043779 |
COG category | [A] RNA processing and modification |
COG ID | [COG5178] U5 snRNP spliceosome subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0234116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGGGA AGCTCGCACG TGTTCCGCGT GCGATCGGGA TCGGCGTCGT CCTCGCTCTG TCGATCGCTT CGGGAGCATC AGCCGCGGAG ATCTCTGGGT CGGTCTTCCA CGACTACAAC ACCAACGGGA TCCGCGACAC GGACCCGAGA TCCGGTGCGG TGGACGTGGG CGTCGGCGGG TTCACCGTGA GAGCGGTGCG GGGCGCGTCG AACGTCGTCG CGACGGCGAC GACGGCCGCG GACGGCACCT ACAGACTCAC CGCGCCGAAC GCGAACGTCC GTCTCGACCT GACCGTCCCG CAGCCGTGGT GGCCGACGCG GCAGCTCAAC GGCCTGCGCT CCGACGAGCA GTTCGTTGAC GCCTCCGCGC CGGTGAGAGG CGCGAACTTC GGCGTGCACC GGCCGCGCGA GTGGTCGTTC GACGATCCGA CGGTGTTCTG GCCGACGCAG TCGCCCGGAC CGGTGTCCGG CGTGCGTGCA GGCTCCGACG CGATCCGCGG GATCGACTAC TTCACCAATC CGCTGACGAA CGGCACGTTC GCCGCCGAGC CGAGCAGACG GGTGCTGGCG AGCTTCGGGC AGGTCGGCTC GGTCTTCGGC ATCGGCGTCG ACCAGAGCAC GGGCAACCTC TACGCCGGCG CGTTCTACAA GCGCTTCTCC GGCCTGACCG CGGACGGTCC CGGCGCGATC TTCCGCGTGA CGCAGGGCGG CGGCGTCCTG CCGTGGGCGA GACTCCCGGC CGGAACCGAC GGCCACCCGA CGAGCACGGT CACGAACCAG TGGTTCACGA CCGACCTCGA CAGATCGTGG GATCTCGTCG GGCGGGCCGG GCTCGGATCG ATCAAGGTCG ACCCGGCCGA CAGAAACGTC TACGCGGTCA ACCTGGCCGA CAAGAGCCTC TACCGGCTCC CGCTTGGCGG CGTGCCTCCG GTCACACCGG CGGCCGTCAC GGAGATCCCC GACCCGGGCT GCCCCGGCGG CGACTGGCGG CCGTTCTCGA TCGGCTTCGA GCGCGGCAGC GGCGCGATGT ACGTCGGCGG AGTCTGCTCG GCGGAGACGT CGCAGGACCC GACTCAGCTG GCCGCCGTCG TGCTGCGCGT CAGCGATCCC GAGGGCAGCC CGACGTTCGA CATCGCCCAG ATCGTGTCGC TGGGCTACGA CCGCGTGAAC CGCGCGCCGG TCAACCACGA CAACACGAGC AACTGGCGGC CGTGGCCGAC CGCGGCGACG ATCACCGACA GAGGCTTCCA GTCGCCGCCG GACGACAGCC GCGGAACGCG CGCGTCGACC AGCGGGCCGG TGCTCGTCTA CGACGAGTCC TACCCACAGC TCGCCGCGAT CGGCTTCGAC TCCGACGGCT CGATGCTGCT CTCCTTCCGC GACCTGATGG GCGACATGTC CGGCCAGGCC GTCCCCGGCG AGGGCTCCGG CCCGCACCAG TCGGTCGTCG CCCAGGGCGA CCTGCTGCGC GCCGCGCCCA ACGGCTCCGG CGGCTTCACG CTCGAGTCCA GAGGCGTCGT CGGCGGGCGC ACCGGCTTCG GGCTCATCCC CGTCTCCAGC GGTGGGTACG GGCCGATAGG GCCCGGCGGC GGCTACTTCT ACGACCCGCA GCCGGGCCAC GGCTTCGATG CCGGACCGAC CAACGAACAG GGCGGTCTGC TGCAGATCCC GGGCTTCAGC GACGTGCTGA CGACGCAGAT CGACGCGATC GACATCAGAG ACAACGGCCT GCTCTGGTAC GACAACGCGA ACGGCGGCAA GACGCGCCGG TTCCAGAACG TCGTCACCAC CGTCGGCTCG GCCGGCGGGT TCTCGAAGGC GAACGGCCTC GGCGACCTCG ACGCCTACAG CGGCCGCGCA CCGGTCCAGA TCGGCAACCG CCTCTGGTAC GACGTCGACG CCGACGGGAT CCAGGACACC GACGAGGACC CGGTCGTCGA CACCGTCGTC GAGCTGCTCG ACGAGAACGG CGTCGTGATC GACAACACCA CGACCGACTC CAGAGGCGAG TACGTCTTCG CGATCGCGCC CGACACGAGA TACAGCGTCC GCGTCCCGCT CGCGCAGCCG TCACTCGACG GCTGGGTCGT GACGCAGGCG TTCGCCGGCG ACGACCGCCG CACCGACTCC AACGGACGCG AGCAGGACGG CCGCTCCGTC GCGTCGGTGA GCGCGCACGA GGTCGGCCGC AACGACCACT CCTACGACTT CGGCTTCACG AGAGCCAGAA GACCGCCGCC TCCGCCTCCG CCACCACCAC CGCCTCCGCC TCCGCCCGAG GAGCCGCAGC CGCCGGTCCC GCCGCCGCTC GCTCCGCCCG CGCCGCCGCA GCCGACCTAC GCCGGCGCAG AGCCGTCGCC GAACACGCCG ATGGTGCTCG TCAAGTACCT CGTCAGACGC TCGGGCGGCA GAGTCTCCCT GAGAAGACTC CAGTTCGGCA TGGTCCTGCG CAACGCCGGC CCCGAGACCG TCAACCGCCT GCGGCTGTGC GACAACCTCC CCCGGCTGCT CACCGTCCTC GGCGCGACCA GACTCGCAAG AAGATTCCCC AGAGCACGCC AGGTCTGCTG GAGACTCGCG AGCCTCCCGA GCGGCCACGC GCGCAAGTTC CACGTCCTCA CCCAGCTGCG CAGAAGAGTC CTGATCGGGC TCCTGATCAA CCGCGCACGC GCCGTCGCAC AGAGAGTCAG ACCGGCGCAT GCGAGAGCGG TCGTGCGGCC GAGACGACCC GCTCCGAGAG TCACCGGCTG A
|
Protein sequence | MKGKLARVPR AIGIGVVLAL SIASGASAAE ISGSVFHDYN TNGIRDTDPR SGAVDVGVGG FTVRAVRGAS NVVATATTAA DGTYRLTAPN ANVRLDLTVP QPWWPTRQLN GLRSDEQFVD ASAPVRGANF GVHRPREWSF DDPTVFWPTQ SPGPVSGVRA GSDAIRGIDY FTNPLTNGTF AAEPSRRVLA SFGQVGSVFG IGVDQSTGNL YAGAFYKRFS GLTADGPGAI FRVTQGGGVL PWARLPAGTD GHPTSTVTNQ WFTTDLDRSW DLVGRAGLGS IKVDPADRNV YAVNLADKSL YRLPLGGVPP VTPAAVTEIP DPGCPGGDWR PFSIGFERGS GAMYVGGVCS AETSQDPTQL AAVVLRVSDP EGSPTFDIAQ IVSLGYDRVN RAPVNHDNTS NWRPWPTAAT ITDRGFQSPP DDSRGTRAST SGPVLVYDES YPQLAAIGFD SDGSMLLSFR DLMGDMSGQA VPGEGSGPHQ SVVAQGDLLR AAPNGSGGFT LESRGVVGGR TGFGLIPVSS GGYGPIGPGG GYFYDPQPGH GFDAGPTNEQ GGLLQIPGFS DVLTTQIDAI DIRDNGLLWY DNANGGKTRR FQNVVTTVGS AGGFSKANGL GDLDAYSGRA PVQIGNRLWY DVDADGIQDT DEDPVVDTVV ELLDENGVVI DNTTTDSRGE YVFAIAPDTR YSVRVPLAQP SLDGWVVTQA FAGDDRRTDS NGREQDGRSV ASVSAHEVGR NDHSYDFGFT RARRPPPPPP PPPPPPPPPE EPQPPVPPPL APPAPPQPTY AGAEPSPNTP MVLVKYLVRR SGGRVSLRRL QFGMVLRNAG PETVNRLRLC DNLPRLLTVL GATRLARRFP RARQVCWRLA SLPSGHARKF HVLTQLRRRV LIGLLINRAR AVAQRVRPAH ARAVVRPRRP APRVTG
|
| |