Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1922 |
Symbol | |
ID | 8732363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2017079 |
End bp | 2019088 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646502539 |
Product | Endopygalactorunase-like protein |
Protein accession | YP_003393723 |
Protein GI | 284043383 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.449199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGGC TTCGACCATC TGCCCGTGCC GCGCTGCTGT GTTGCGCGGC GCTCGCGGCG CTGCCCGCGC TCGCCGCCGC CAGAACGTAC GACGTCACCG CACCGCCGTA CTCGGCGGCC GGCAACGGCA CGACGAACGA CCGCCTCGCG ATCCAGCAGG CGATCGAGGA CGCGTCGGCC GCGGGCGGCG GGACGGTGCT GGTGCCCGCC GGGAGAACGT TCCTCTCAGG CGGCATCCGC CTGCGCTCGA ACGTCACCTT CCAGCTCGAC GGGACGTTGC AGCAGAGCTT GAACACCGCC CACTACGCGG TCGCGCCGAT GGTCGGCTGG GACGTGCCCG GCTCGACGCT CAACTGGGAC TCGACGGCGT TTCACAACCA GCCGTTCGTG TTCGCCGCCG ACGCCGAGAA CGTCACGCTG ACGACGTCGG GCAGAGGGAC GATCCAGATG GGCGTGACGC CGACCTCCGC AACCGCGATC CGCGTCGACG CGATCGGCTT CCGCGACGTC GAGAGATGCT GGATCAACAA CATCACGACG CGTGACGTGA TCGGCTTCAA CATCGTGCTC GACCGCGCGA ACCACTGCGA CATCACCGGC ACGTACCTCA ACTCGAAGGC CGGCAGCCTC GGCAGCGACG GCATCAACAT CACGGGCTCC CAGCACGTGA AGGTGCTCTA CAACCACGTC AGCGCGGGCG ACGACGGCCT CTACATCGCC GTCAGCTACG GCGACCCGCG CTTCACCGGC CCGTGGCGGG CGCCCGACAC CGGCGGCGCC GCGCGCTACA TCGAGATCGC GAACAACGAG GTCGTCGACT TCGGCCACCA GAACGCGTTC ACGCTGATCC CGTGGGGCAG CCTCGACCCC GACCAGCGCA ACGTCGAGAT ATCCGACGTC TCGATCCACG ACAACACCTT CATCGCGGAC GTCGCGCAGG CGGTCGACTG CCGCTGCGAC AACCCCTGGA GAGGCACCAG AAGATACTTC CAGGACACCG ATCGCGGCGA CCAGTCGCCG ATGACGCGCT TCTCGATGTG GAACAACGTC TTCATCTCGA GAACGAGAGT GCCGAACTTC CCGACCTGGG TCGGGGCGAC CTTCACCGAC TCGCAGTTCG GCGGCCTCTC AGGTGCGCCC GGGGCGATCA GAAGCAGTCC GTCGATCCAG AACGGCGGCT TCGAGCGGAC CGGCAGCGCG TGGTGGAGCA TCGGCGGCGT CGGCGGCGCC ACGAACGACC CGGCGATGCT GCCGGCCGGC GCGGGCGCGG CGCTCAGAGC ATTGGGCGGC TGGGCCGGGT TCGTGCTGCC GAGCGGCACC GCCACGACAA GCCTCGTGCA GGGGCTCGGG TTGGAGAACG CCGCCGACCT CGGGCTCCCG CTCGTCGGCG TCGCCGGCGC GGCGACATAC CGGCTCGACG CGACCGTCGT GACGAACGGG CAGCCGTTCC GCGTCTACGC GCACGACACG TGCGAGAACA GAGTGCTCGC GCAGCAGACG GTCTCCGCGA CGACGGCGAC GCGCGTGTCG CTCCCGTTCA GCGTCACCAG AAGCTGCGGC AACGTCCATC TCGGGATCGA CCGGGGCGGA GCCACGAGCG GCTGGGCGCT GATCGACGAC GTCGAGCTGC GCGCTCCCGT CGTCGGCAAC GAGGATCCGT CGCTGCGCAC GGTCGGCACG TGGGGCCGCG ACTGGGCCGG CGGCGACATG GGCGGCACCC ACCACCATGA CAACGGCACC GGCAGCACGG TGACGATCCC GTTCACCGGC ACGCGCGGCA GAGTCCTCGC GCCGAAGGGG CCGGGCTGGG GCATCGCGTC GGTCTCGGTC GACGGCGGGC CGGCGGTCGA CGTCGACCTC TACGGCGCCG CGGCCGCGTG GCACGCGACC GTCTTCGACA CCGGCGTCCT CCCGTTCGGG AGACACACCG TCACGATGAC CGTCTCCGGC CGCAAGCACC CCTCGTCGAC CGGCACCTGG ATCGCGTTCG ACGCGCTGCT CGTGAGCTGA
|
Protein sequence | MHRLRPSARA ALLCCAALAA LPALAAARTY DVTAPPYSAA GNGTTNDRLA IQQAIEDASA AGGGTVLVPA GRTFLSGGIR LRSNVTFQLD GTLQQSLNTA HYAVAPMVGW DVPGSTLNWD STAFHNQPFV FAADAENVTL TTSGRGTIQM GVTPTSATAI RVDAIGFRDV ERCWINNITT RDVIGFNIVL DRANHCDITG TYLNSKAGSL GSDGINITGS QHVKVLYNHV SAGDDGLYIA VSYGDPRFTG PWRAPDTGGA ARYIEIANNE VVDFGHQNAF TLIPWGSLDP DQRNVEISDV SIHDNTFIAD VAQAVDCRCD NPWRGTRRYF QDTDRGDQSP MTRFSMWNNV FISRTRVPNF PTWVGATFTD SQFGGLSGAP GAIRSSPSIQ NGGFERTGSA WWSIGGVGGA TNDPAMLPAG AGAALRALGG WAGFVLPSGT ATTSLVQGLG LENAADLGLP LVGVAGAATY RLDATVVTNG QPFRVYAHDT CENRVLAQQT VSATTATRVS LPFSVTRSCG NVHLGIDRGG ATSGWALIDD VELRAPVVGN EDPSLRTVGT WGRDWAGGDM GGTHHHDNGT GSTVTIPFTG TRGRVLAPKG PGWGIASVSV DGGPAVDVDL YGAAAAWHAT VFDTGVLPFG RHTVTMTVSG RKHPSSTGTW IAFDALLVS
|
| |