Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_1791 |
Symbol | |
ID | 8325976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 1967594 |
End bp | 1970335 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644942340 |
Product | cellulose-binding family II |
Protein accession | YP_003099585 |
Protein GI | 256375925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.381117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCGC GCCAGCCCGA GCCCACCCCG CCCGTCACCG CACGGGCGCG GCGCGGGAGC CGATTACCCG CCCTCCTGGC CGCCCTCGTC ACGGCCGCCG CCACGCTCGT CACCGTCGCC GCCACCAGCC CCGCCCCCGT CGCGCAGGCC GCCCCGGCGA GCCAGCCGCA CACCTGGAGG AACGTCGAGA TCCAGGGCGG CGGGTTCGTG CCCGGCATCG TCTTCAGCCC CGCCGAGCGG AACCTGATCT ACGCCCGCAC CGACATCGGC GGCGCCTACC GCTGGAACCA GGCCACCGGC CGCTGGGTCC CGCTGCTCGA CGCGATCGGC TGGGACCGCT GGGGCCACAA CGGCGTCATC TCGATCGCCC CCGACCCCGT GAACGCCGCC AAGGTCTACG CCGCCGTCGG CATGTACACC AACGACTGGG ACCCCGACAA CGGCGCCGTC CTGCGCTCCT CCGACCAAGG CGCGACCTGG CAGTCCACCG CCCTCCCGTT CAAGCTCGGC GGCAACATGC CGGGCCGGGG CATGGGCGAG CGCCTCGCCG TCGACCCCAA CGACCCGCGC GTGCTGTACC TCGGCGCCCC CAGCGGCAAC GGCCTGTGGC GCAGCACCGA CAGCGGCGTC ACCTGGGCGA AGGTCACGGC GTTCCCGAAC CCCGGCAACC ACGTCGCCGA CCCGAACGAC ACCACCGGCT ACAACAGCGA CAACCAGGGC GTCACCTGGG TCACCTACGA CCCGCGCTCG TCCGCGTCCG GCGCGCCCAG CCGCACGATC TACGTGGGCG TCGCCGACAA GCAGAACACC GTCTACCGCA CCACCGACGC GGGCGCGACG TGGTCCCGCC TCGCGGGCCA GCCCACCGGC TACCTCGCCC ACAAGGGCGT CCTGGACCCC GTCGGCGGCT TCCTCTACCT CGCCACCAGC GACACCGGCG GCCCGTACGA CGGCGCCAAG GGCGACGTGT GGAAGCACAA CACCGCCACC GGCGAGTGGA CCAGGATCAG CCCCATCCCG TCCGACAGCG CCGACGACCA CTTCGGCTAC AGCGGCCTGA CCGTCGACCG CACCAACCCG AACACGCTCA TGGTCACCAC CCAGATCTCC TGGTGGCCCG ACATCATCGT GTTCCGCTCC ACCGACGCGG GCGCCACCTG GACCCGCATC TGGGACTTCA CCAGCTACCC CAACCGCAGC TTCCGCTACA CCATGGACAT CACCGCCTCC CCGTGGCTGA GCTGGGGAGC CACCCCGCAG CCGCCCGAGG TCACGCCCAA GCTCGGCTGG ATGACCGAGT CCCTGGAGAT CGACCCGTTC GACCCGGACC GGATGCTCTA CGGCACCGGC GCGACGATCT ACGGCGCCAC CAACCTCACC GCGTGGGACC GGGGCGGCCA GATCACCCTC AAGCCGACGG CCGCAGGTCT TGAGGAGACC AGCGTGCTCG ACCTGGTCAG CCCGCCCACC GGCGCGGCCC CGCTGGTCAG CGCGCTCGGC GACATCGGCG GGTTCCGCCA CGCGGACCTG GCCGAGGTGC CGCCGATGAT GTTCACGCAG CCCAACCTCA CCTCCACCAC CAGCCTCGAC TACGCCGAGT CCAACGCCTC GGTCATGGTC CGCTCCGGCA ACTCCGACGC CGCCCCGCAC GTCGCGTTCT CCACCGACGG CGGCGCGAAC TGGTTCGGCG GCGCCGATCC CGGCGGGGTC ACCGGCGGCG GCACGGTCGC GGCGGCGGCG GACGGCAGCC GGTTCGTGTG GAGCCCGGCG GGCGCGGGGG TGCACCTCTC GGTGGGCTTC GGCAACTCCT GGACCGCGTC GAGCGGGGTG CCCGCGGGCG CGGTGGTCGA GTCGGACCGG GTGAACCCGA GGAAGTTCTA CGCCCTGGCC GCCAACCGGG TGTACACCAG CGTCGACGGC GGCGCGACGT TCACGGCGGG CGCGTCGGTG GCGGCGACCA AGCTGCACGC CGTGCCCGGC CGCGAGGGCG AGCTGTGGCT GGCGGGCGAG GGCGGGCTGT TCCGCTCCAC CGACGGCGGC GCGAGCGCGG TCGAGCAGGC GAACGTCACC AGCGCGGTGA ACGTCGGCTT CGGCAAGGCC GCGCCCGGCA GCGCCTACCC GGCGCTGTAC GCGGTGCTCA CCACCGGCGG CGTGACCGGC GTGTTCCGCT CCGACGACAC CGGGGCGAAC TGGGTGCGGA TCAACGACGA CCAGCACCAG TACGGCAACG CGGGCGAGGC GATCACCGGC GACCCGAGGC TGTACGGGCG GGTCTACCTG GGCACCAACG GGCGCGGAAT CCTGTACGCC GACCCGAGCG GGCCGCCGCC GAGCACGACG ACCACGACCA CCACGCCCGG CGGTGGGACC ACGACCACGA CGACGACGAC GACCACTACC ACCACCACGG TTCCGCCTTC GGCGTCCTGC GCGGTCGAGT ACCGGGTCAC GAACCAGTGG TCCGGCGGCT TCCAGGGGTC GGTGCGGATC ACCAACCGGG GCGCGACCGC CGTCACCGGG TGGGCGCTGA AGTGGACCTA CGCGAGCGGT CAGCGGGTGA CCGGCGCGTG GAACGGCAAG GCCGCGCAGA GCGGCGCCGA GGTCACGGTG ACCAACGAGG GCTGGAACGG GACCATCGGC GCCGGGGGCG CGGTGGAGTT CGGATTCCAG GCGAGCTGGC AGGGCAGCAA CGCCAACCCC GCCGCGTTCA CGCTCAACGG CGGATCCTGT TCACCGGTGT GA
|
Protein sequence | MAPRQPEPTP PVTARARRGS RLPALLAALV TAAATLVTVA ATSPAPVAQA APASQPHTWR NVEIQGGGFV PGIVFSPAER NLIYARTDIG GAYRWNQATG RWVPLLDAIG WDRWGHNGVI SIAPDPVNAA KVYAAVGMYT NDWDPDNGAV LRSSDQGATW QSTALPFKLG GNMPGRGMGE RLAVDPNDPR VLYLGAPSGN GLWRSTDSGV TWAKVTAFPN PGNHVADPND TTGYNSDNQG VTWVTYDPRS SASGAPSRTI YVGVADKQNT VYRTTDAGAT WSRLAGQPTG YLAHKGVLDP VGGFLYLATS DTGGPYDGAK GDVWKHNTAT GEWTRISPIP SDSADDHFGY SGLTVDRTNP NTLMVTTQIS WWPDIIVFRS TDAGATWTRI WDFTSYPNRS FRYTMDITAS PWLSWGATPQ PPEVTPKLGW MTESLEIDPF DPDRMLYGTG ATIYGATNLT AWDRGGQITL KPTAAGLEET SVLDLVSPPT GAAPLVSALG DIGGFRHADL AEVPPMMFTQ PNLTSTTSLD YAESNASVMV RSGNSDAAPH VAFSTDGGAN WFGGADPGGV TGGGTVAAAA DGSRFVWSPA GAGVHLSVGF GNSWTASSGV PAGAVVESDR VNPRKFYALA ANRVYTSVDG GATFTAGASV AATKLHAVPG REGELWLAGE GGLFRSTDGG ASAVEQANVT SAVNVGFGKA APGSAYPALY AVLTTGGVTG VFRSDDTGAN WVRINDDQHQ YGNAGEAITG DPRLYGRVYL GTNGRGILYA DPSGPPPSTT TTTTTPGGGT TTTTTTTTTT TTTVPPSASC AVEYRVTNQW SGGFQGSVRI TNRGATAVTG WALKWTYASG QRVTGAWNGK AAQSGAEVTV TNEGWNGTIG AGGAVEFGFQ ASWQGSNANP AAFTLNGGSC SPV
|
| |