Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3302 |
Symbol | |
ID | 8448917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3629267 |
End bp | 3630967 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645042381 |
Product | alpha amylase catalytic region |
Protein accession | YP_003202621 |
Protein GI | 258653465 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.15105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0149256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCG AGGAAACCGT GGCCTTCAGC GACATCACGG CCGACCTGGA GTCGACCCCC GGCCAGGACT GGCGCCGCAC CGCGGTCGTC TACCAGATCT ACCCGCGCTC GTTCGCCGAC TCCAACGGTG ACGGGATCGG CGACCTGCCC GGGATCAACC ATCGCCTGCC GGCCCTGGCC GAGCTGGGCG TGGACGCCAT CTGGCTGTCC CCCTTCTACA AGTCCCCGCA GGCCGACGCC GGGTACGACG TGGCCGACTA CCGCGACGTC GACCCCGTTT TCGGCACCCT GGCGGACTTC GACGCGATGC TCGAGCGGGC CCACGGGCTG GGTCTGAAGG TGATCGTCGA CCTGGTGCCC AACCATTCCT CCGACGAGCA CGTCTGGTTC CAGCAGGCCC TGGCCGCGGC CCCCGGCTCC CCCGAGCGGG ACCGCTACGT CTTCCGCGAG GGACGCGGCG AGCACGGCGA GCTGCCGCCG AACAACTGGG AATCCTGCTT CCGCGGGCCG GCCTGGACCC GGACCACCAA CCCCGACGGC ACCCCCGGCC AGTGGTACCT GCACCTGTTC GACACCAAGC AGCCCGACTG GAACTGGGAG AACCCGCAGG TTCGGGCCGA GTTCCTGGAC GTGTTGCGGT TCTGGCTGGA CCGCGGCGTG GACGGCTTCC GGGTGGACGT GGCCCACGCG CTGATCAAGG CCCCGGGCCT GCCCGACATG AAGGCCTCCG ACGAGGTCGT CGAGGACAGT GAGGGCTACT TCCACACCGG CCCCATGTGG GACCAGGACC GGGTGCACGA GGTCTACCGG GAGTGGCGGG CCCTGCTGGA CACCTACTCC CCCGACCGGA TCCTGTGCGC CGAGGCGTGG GTGCCCTCGC TGTCCCGGCT GGCCCGCTAC GTCCGCGAGG ACGAGATGCA CCAGGCGTTC AACTTCGACT ACCTGGAGAG CGAGTGGGAC GCCACCCACC TGCGCTCGGT GATCGATTCC TCGGTCGCCG CCAACGACGA GGTCGGCGCG CCCACCACCT GGGTGCTGTC CAACCACGAT GTGGTCCGCC ATGTCTCGCG GCTGGGCCTG CCGGCCGGTC CGCGGCCCAA CGGCATCCGG GCCCAGGATC AGCAACCCGA CTACCAGCTC GGCCTGCGCC GGGCCCGGGC GTCGACCCTG TTGATGCTGG CCCTGCCCGG TTCGGCCTAT ATCTACCAGG GCGAGGAGCT CGGGCTGCCC GACCACACCG AGCTCGACGA CGACCTGCGT CAGGACCCGA CCTGGTGGCG CTCCGGTTAC ACCGAGGCCG GCCGCGACGG CTGCCGCGTG CCGCTGCCCT GGGAGGCCGG CGAGCCCGGC CTGGGCTTCG GACCGGGCGG CGCGACCTGG CTGCCGCAGC CCGCGTCCTA CGCGGAGCTG GCTCGTGACA AGCAGGAGGA CGTCGAGGGT TCGACCCTGG AGATGTACCG CACCGCGCTG GCGTTCCGGC GCGCGTTCGC GCTCGCCGTC GGCGACCTGG AGTGGGTCGA CGCCGAGCCG GGCGTCGTGC GGTTCGTCAA CGGTGAACTG ACCATCGCCG CCAACACCGG GACCGAAGCG GTTCCGATGC CGGTCGGCGA GCTGCTGATG GCCTCCGGCG AGCTGACCGA TCACTCGGTG CTCCCGCCCG ACACCACGGT CTGGTTGCTC ACCGAGGTCT TCAGCGAGTA G
|
Protein sequence | MTIEETVAFS DITADLESTP GQDWRRTAVV YQIYPRSFAD SNGDGIGDLP GINHRLPALA ELGVDAIWLS PFYKSPQADA GYDVADYRDV DPVFGTLADF DAMLERAHGL GLKVIVDLVP NHSSDEHVWF QQALAAAPGS PERDRYVFRE GRGEHGELPP NNWESCFRGP AWTRTTNPDG TPGQWYLHLF DTKQPDWNWE NPQVRAEFLD VLRFWLDRGV DGFRVDVAHA LIKAPGLPDM KASDEVVEDS EGYFHTGPMW DQDRVHEVYR EWRALLDTYS PDRILCAEAW VPSLSRLARY VREDEMHQAF NFDYLESEWD ATHLRSVIDS SVAANDEVGA PTTWVLSNHD VVRHVSRLGL PAGPRPNGIR AQDQQPDYQL GLRRARASTL LMLALPGSAY IYQGEELGLP DHTELDDDLR QDPTWWRSGY TEAGRDGCRV PLPWEAGEPG LGFGPGGATW LPQPASYAEL ARDKQEDVEG STLEMYRTAL AFRRAFALAV GDLEWVDAEP GVVRFVNGEL TIAANTGTEA VPMPVGELLM ASGELTDHSV LPPDTTVWLL TEVFSE
|
| |