Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1403 |
Symbol | |
ID | 3786433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1605003 |
End bp | 1606880 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811491 |
Product | alpha amylase, catalytic region |
Protein accession | YP_412098 |
Protein GI | 82702532 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGACAC GACTTCACGA CATGCCTTAT GGCACTCAAA TCACGGACCA CGGAGTGCGC TTCCGTCTTT GGGCGCCTGG TTGCAAACAG GTGGGATTGT GCCTTGCAGA CGGACGGACG AATGAGAAGC CCGAACGGGC ACTCCCCATG ACTCCCCGCG CAGGAGGATG GTTCGAGTCG ACTGTAGCGG AAGCGGGAGC GGGAATACGC TATGGGTTCG ATGTGAACGG TGCCCGCGTG CCCGATCCCG CTTCGCGCTT CAATCCGGAT GATGTGCATG GTTTAAGTGA GGTGATCGAT CCTTCGGCCT TCGCCTGGCA GGATGCGAAA TGGCGTGGAC GGCCATGGGA AGAAGCCGTC ATCTATGAGC TGCATGTAGG CACGTTCTCG CCCGAAGGAA CCTTCAAGGG CGTCTCCCAG AGACTGGACT ATCTGGTCGA ACTCGGTGTG ACCGCCATTG AGCTCATGCC GGTTGCCGAT TTTCCGGGCT CACGAAACTG GGGCTATGAC GGCGTTCTGC TTTACGCGCC GGATAGCCGT TACGGACGTC CGGAGGATCT CAAGGAACTG GTTCAGGCCG CGCATGAACG GGGACTGATG ATTCTGCTCG ATGTCGTCTA CAATCACTTC GGGCCGGAAG GAAACTATCT TCACCTTTAT GCAAAGGAAT TCTTTACTGG GCGCCATCAC ACTCCGTGGG GTGCGGCAAT AAATTTCGAT GGCCCCGCAA GTGGACGGGT ACGCGAGTTT TTCATCCACA ACGCGTTGTA TTGGCTGCAA GAGTATCATT TCGACGGCCT GCGGCTCGAT GCAGTCCACG CGATCATCGA CCATTCCAGT CCGCACATCC TTGTCGAACT GGCGGAGCGT GTTCGTGCCG AGGTCGGTAC AGAGCGCCAC GTGCATCTGA TCCTGGAAAA CGATGCCAAC AATGCCCGCT ATCTGACAAA TAGTTGGTAC GACGCGCAGT GGAACGACGA TATTCACCAC GCTCTGCATG TGCTTTCTAC ACAGGAGGGC GATGGATACT ATGTGGATTA TGCCGACAAT CCGGTACGCC ATCTCGGGCG CTGCCTGGCG GAAGGGTTTG CCTATCAGGG AGAGATATCC GTTTATCGCG ACGATATGGC GCGTGGAGAG GCGAGCATTC ATCTGCCGCC CCAGGCTTTC ATTTCCTTTC TGCAATCCCA CGACCAGGCG GGCAACCGTG CATTTGGCGA GCGTATTAGT CATATTGGCG AGGAAGCGTT GACACGTATG GCGGCAGCCA TCTATTTGCT GGCTCCCGGC ATTCCCATGC TGTTCATGGG CGAGGAGTTT GCCGCAAAAT CGCCTTTCCT GTTTTTCTGC GACTTCGGAC CGGAGCTGCG CGAAGCTGTG ACACAGGGGC GACGCAGGGA ATTTGCCCGC TTTGCGCATT TTACGCAAGA CATGGACGAG ACAGCGATAC CGGACCCGAA TGCTGTTCAG ACTTTTCTTA TCTCGAAAAT CGATTGGGAT TTACTCAGGA ATGAAGCTCA CTTTGCCTGG CTGGAGTATT ACCGCAACCT TATGAAACTT CGCAGCGAAA TTATCGTACC GCGCCTTCGC GGAATGAAGG GTAATTCGGC ACATTTTGAG GTATTCGCGC CTAAATGCCT GTGGGTCTGC TGGCAGTTGG GCGACGATTC CACCTTGCGG CTATTGGCGA ATTTTTCCGA CGAGAGCGTG GCGGCTCCCC GGTTCGGCGG GCAGATCGTG TTTGCCTCAC CCGGTGCAAT ACCCCCATCT TCCCGAACAA CCAAAAGTAT CCCGGTGAAA AGTATGCTGG CGCCTCGTTC GGTAGTCTGG ATGCTTGAAC CGGCTGCCGA TAAAAGTAAA GGCAGTTTTA CCGGATAA
|
Protein sequence | MLTRLHDMPY GTQITDHGVR FRLWAPGCKQ VGLCLADGRT NEKPERALPM TPRAGGWFES TVAEAGAGIR YGFDVNGARV PDPASRFNPD DVHGLSEVID PSAFAWQDAK WRGRPWEEAV IYELHVGTFS PEGTFKGVSQ RLDYLVELGV TAIELMPVAD FPGSRNWGYD GVLLYAPDSR YGRPEDLKEL VQAAHERGLM ILLDVVYNHF GPEGNYLHLY AKEFFTGRHH TPWGAAINFD GPASGRVREF FIHNALYWLQ EYHFDGLRLD AVHAIIDHSS PHILVELAER VRAEVGTERH VHLILENDAN NARYLTNSWY DAQWNDDIHH ALHVLSTQEG DGYYVDYADN PVRHLGRCLA EGFAYQGEIS VYRDDMARGE ASIHLPPQAF ISFLQSHDQA GNRAFGERIS HIGEEALTRM AAAIYLLAPG IPMLFMGEEF AAKSPFLFFC DFGPELREAV TQGRRREFAR FAHFTQDMDE TAIPDPNAVQ TFLISKIDWD LLRNEAHFAW LEYYRNLMKL RSEIIVPRLR GMKGNSAHFE VFAPKCLWVC WQLGDDSTLR LLANFSDESV AAPRFGGQIV FASPGAIPPS SRTTKSIPVK SMLAPRSVVW MLEPAADKSK GSFTG
|
| |