Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0125 |
Symbol | |
ID | 8730553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 123264 |
End bp | 125165 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646500739 |
Product | Endopygalactorunase-like protein |
Protein accession | YP_003391936 |
Protein GI | 284041596 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.300956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0208856 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGAC TGCTTGTCAG CGGCCTGGTG GCCGCGGTGA CGGCATGCCT TGCCCTGCCC GCGGCCGCCT CCGCGCTCGA CGTGGACGTG ACCGCAGCGC CGTATGGAGC TGCCGGGGAC GGCGTGACGA ACGACCGTGC GGCGATCCAG AGCGCGATCG ACGACGTCAG CGCGGCGGGT GGCGGGAAGG TGACGCTGCC CGCGCCGAGG ACGTTCCTGT CCGGCGACGT CTCGGTCAAG AGCAACGTGA CGCTGGAGAT CGCCGGCGGC GCGACGCTGA AGATGAGCCA GAACCAGTCG CACTGGGCGC ACAGACCGGT GCTCGGCCAC ATGATCGACG GCACGATCGA GTGGAACATC GCGATGTACC GGAACTATCC GCTGATCCAT TCCGGCAACG CGAGAAACGT CACCGTGACG GGCGGCGGCA CGATCGAGAT GACGCGCGCG CTGACCGACG AGACGACGAT CCACGCGATG GGGATCGGCT TCCACAAGGT CGACGGGTTC AGAATCTCCA ACCTGACGAT GATCGGCGCG AGCGCGAGCA ACGTCTCGCT CTACACCGTC GACAACGGGA TCGTCTCCGG GATGACGATG AGAGACGACC TCGACACCAA CGTCGAGGGC GTCGCGATCG GCAACTCGCA GCACGTGCGC GTCACCGGCA ACACGATGGA CGCGACCAGC GACGACACGA TCGTGCTGTG GACGCAGTGG AACGACCCGC GCGGCACGAC GTGGTGGTCG ACGGCCGTGC CCGAGCCGAT CAACGACATC GAGATCGACA ACAACTACGC GACCTCGACC TGCTGCAAGG CGATCGCGCT GATCCCCTGG GGCACCGCGT ACGGCGACCA GCGCCAGGTC GAGATGAGCG ACGTGCGCGT CCACGACAAC ACGCTCGCGG CGACCGACGC GGTCGGCTGC TGGTGCGACG ACCCGTACAC CGGCGAGCCG CCGTCGAGAT TCACCAACCA GGAGCAGGAC CAGGCGCCGA TGAAGGACCT GACGTTCGCC AGAAACAGAT ACACCGGATC GACCGCGCTG ACGAAGGCGC ACATCCAGAA CCTCGTCGCC GACTTCGCCA AGACGAGCCC GACGTTCATC CGCAACGGCG GCTTCGAGCG CACCGGCCAC GCGTACTGGT CGCGCGTGAG CAGAGCCGGC CAGTCCGACG CCGCCGACTA CTCCGTCGGC CAGAACGGCT CGTGGTACGG CTACATCCAG TACTTCGACG TCGGCTACAC GACGCTCTAC CAGGGCGTGT CGCTGACCGG CGGGCAGACG TACACGCTGC GCGCGCGGAT CCAGACACCG GACGGGAGCC CGGCGCGGAT GTATGTGTAC GACACGTGCG GCGGCACGGG CGTGACGCAG AACGTCAGCT CGCTCGCGTG GACCGACGTC AGCCTCAGAT TCACCGCGGC GTCGAGCTGC GGCCGGTACA TAGTCGGGTT CGACTCGGGC ACGTTCACCT CAGGCGGCCT GCGCGTCGAC GACGTGCTGC TCGAAGGGCC GCGGATCGAC AACGACGACC CGGTGATCTC GTACGAGGGC CTCTGGTACC TCTACGAGCG CGCCGGCGAC CACGGCGGGA CGGAGCGGAT GGCGACGCAG GCCGGCACCA GATCGAGCGC GACCGTCACG TTCCGTGGCA CGCGCGCGAG AGTGCTCGCC GTCCGCGGCG GCGACCGCGG CAGAGCCGAC GTCTACCTCG ACGGCGTCTA CAAGACGACG ATCGACCAGT ACAGCCCGAC GACCGACCTG CAGTACGTCA CGTACGACAC CGGCACGGTC GCGGCCGGGA CGCACGAGCT GAGAGTGGTG CCGACCTGGA CCAAGCACCC CGCGTCGGCG AACACGGTGA TCTCGGTCGA CGCGGTCGAG GTCGTACGGT AG
|
Protein sequence | MRRLLVSGLV AAVTACLALP AAASALDVDV TAAPYGAAGD GVTNDRAAIQ SAIDDVSAAG GGKVTLPAPR TFLSGDVSVK SNVTLEIAGG ATLKMSQNQS HWAHRPVLGH MIDGTIEWNI AMYRNYPLIH SGNARNVTVT GGGTIEMTRA LTDETTIHAM GIGFHKVDGF RISNLTMIGA SASNVSLYTV DNGIVSGMTM RDDLDTNVEG VAIGNSQHVR VTGNTMDATS DDTIVLWTQW NDPRGTTWWS TAVPEPINDI EIDNNYATST CCKAIALIPW GTAYGDQRQV EMSDVRVHDN TLAATDAVGC WCDDPYTGEP PSRFTNQEQD QAPMKDLTFA RNRYTGSTAL TKAHIQNLVA DFAKTSPTFI RNGGFERTGH AYWSRVSRAG QSDAADYSVG QNGSWYGYIQ YFDVGYTTLY QGVSLTGGQT YTLRARIQTP DGSPARMYVY DTCGGTGVTQ NVSSLAWTDV SLRFTAASSC GRYIVGFDSG TFTSGGLRVD DVLLEGPRID NDDPVISYEG LWYLYERAGD HGGTERMATQ AGTRSSATVT FRGTRARVLA VRGGDRGRAD VYLDGVYKTT IDQYSPTTDL QYVTYDTGTV AAGTHELRVV PTWTKHPASA NTVISVDAVE VVR
|
| |