Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1555 |
Symbol | |
ID | 8731995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1644915 |
End bp | 1646852 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646502173 |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003393358 |
Protein GI | 284043018 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.541945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA GGGCCGCCCC GAGAAGAGGC CGGCGCGCGA CGCGTCCGTC CGCCCGCTCC AAGCCGCGCG CGAACGCGAG CAGCCGCCGC AGAGCGCCCG CCAGCTCGCC GTTCCCGCCG ATCGGCGACT ACGCGTTCCT GTCCGACTGC CACACCGGCG CGCTGATGGC GCCCGACGGC ACGATCGAGT GGCTGTGCGC GCCGCGCTTC GACTCGCCGA GCATCTTCGG AGCGCTGCTG GACCGCGGCG CCGGCGGCTT CCGGCTCGGC CCGTACGGCA CGACCGTCCC CGGTGCGCGT CGCTACGAGC CCGGCACGAA CATCGTCGAG ACGACGTGGA TGACCCAGTC GGGCTGGGTG ATCGTGCGCG ACGCGCTCGC GATCGGCCCG TGGAGCGACG ACAACAGCGC GGTCACGACG CGGACGCGGC CGCCGACCGA CGACGACGCC GAGCACATGC TGGTGCGGAC GATCGAGTGC ATCCAGGGCA GCGTGCAGAT CGAGATGGTC TGCGAGCCGA TGTTCGACTA CGCGCGTGAG CCGGCCGAGT GGACGGTCGT CGGCGACGGC TACGACGCCG TCGACGCCCA CCACCACACC GCCTCGCTGC GGCTGACGAG CGACCTGCGG ATCGGGATCG AGGGCAATCG CGCTCGTGCG CGCCACACGC TGCGGGAGGG CGAGACGCGC TTCGTCGCGC TGTCGTGGGG ACAGGGCCTG CGCGCGCCGC AGGACGCGGC GGAGGCGACG GCGATGCTGG AGCGCACGTC GCACTTCTGG CGCGACTGGC TCGACGACGG CCACTTCCCC GACCATCCGT GGCGCGTCTA CCTGCAGCGC TCGGCGCTGG TGCTGAAGGG GCTGACGTAC GCGCCGACCG GGGCGATGGT CGCCGCGCTG ACGACGTCGC TGCCGGAGAC GCCCGGCGGC GAGCGCAACT GGGACTACCG CTACACGTGG ATGCGCGACG CGACGTTCAC GCTGTGGGGG CTGCACGCGC TCGGGCTCGA CTGGGAGGCG GACGACTTCA TGCAGTTCGT CGCCGACGTG CCGCGCAACC CGGACGGCTC GCTGCAGATC ATGTACGGCA TCGACGGCGA GAAGGAGCTG ACCGAGCGGA CGCTCGACCA CCTCACCGGC TACGACGGCG CGCGGCCGGT GCGGATCGGC AACGGCGCCT TCGACCAGCG CCAGAACGAC GTCTACGGCG CGGTGCTCGA CTCGGTCTAC CTGCACACGA AGGTCGGCGG CCGGCTGCCG CAGCGGCTGT GGCCGGTGCT CGAGGCGCAG GTTCGCTGCG CGATCGAGGT CTGGGACGAG CCCGACCAGG GGATCTGGGA GGCGCGCGGC GAGCCGAAGC ACTATGTCTC CTCGAAGCTG ATGTGCTGGG TCGCGCTCGA CCGCGGCGCG CGGCTGGCGG AGATCCACGG CGAGCCGGAG CTGGCGGGCG AGTGGCAGGC GGTCGCCGAC CAGATCAAGG CGGACATCCT CGAGCACGGC GTCCGCGACG GCGTCTTCCG CCAGCACTAC GACACCGACG CGCTCGACGC CTCGACGCTG CTGGTCCCGC TCGTGCGCTT CCTCGGTCCC GAGGACGAGC GCGTCGAGAG AACCGTGCGC GCGATCGCGC GCGACCTGAC CGACCACGGC TTCGTGCTGC GCTACCGCAC CGAGGAGACC GACGACGGCC TCTCAGGCGA GGAGGGGACG TTCCTGATTT GCTCGTTCTG GCTCGTCGCG GCGCTGGAGG AGATAGGCGA CCATCGCCGC GCGGTGACGC TGCTGGAGCG GCTGCTGGCG GGCGCCTCGA AGCTCTACCT GTTCGCGGAG GAGCTGGACC CGCACAGTGG GCGGCAGCTC GGCAACTACC CGCAGGCGTT CACGCACCTC GCGCTGATCA ACGCCGTGAT GCACGTGATC GAGGGGGAGA CGACCTAG
|
Protein sequence | MSERAAPRRG RRATRPSARS KPRANASSRR RAPASSPFPP IGDYAFLSDC HTGALMAPDG TIEWLCAPRF DSPSIFGALL DRGAGGFRLG PYGTTVPGAR RYEPGTNIVE TTWMTQSGWV IVRDALAIGP WSDDNSAVTT RTRPPTDDDA EHMLVRTIEC IQGSVQIEMV CEPMFDYARE PAEWTVVGDG YDAVDAHHHT ASLRLTSDLR IGIEGNRARA RHTLREGETR FVALSWGQGL RAPQDAAEAT AMLERTSHFW RDWLDDGHFP DHPWRVYLQR SALVLKGLTY APTGAMVAAL TTSLPETPGG ERNWDYRYTW MRDATFTLWG LHALGLDWEA DDFMQFVADV PRNPDGSLQI MYGIDGEKEL TERTLDHLTG YDGARPVRIG NGAFDQRQND VYGAVLDSVY LHTKVGGRLP QRLWPVLEAQ VRCAIEVWDE PDQGIWEARG EPKHYVSSKL MCWVALDRGA RLAEIHGEPE LAGEWQAVAD QIKADILEHG VRDGVFRQHY DTDALDASTL LVPLVRFLGP EDERVERTVR AIARDLTDHG FVLRYRTEET DDGLSGEEGT FLICSFWLVA ALEEIGDHRR AVTLLERLLA GASKLYLFAE ELDPHSGRQL GNYPQAFTHL ALINAVMHVI EGETT
|
| |