Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1404 |
Symbol | |
ID | 3786434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1606903 |
End bp | 1610205 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811492 |
Product | Alpha amylase, catalytic region |
Protein accession | YP_412099 |
Protein GI | 82702533 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACGAG AAGAAGACCC GCTCTGGTAC AAGGATGCCA TTATCTATGA ATTGCACGTC AAGACATTCT TCGACAGCAA TGGCGACGGC ATAGGAGACT TTTCGGGCCT GATTTCCAAG CTCGACTATC TGGCGGAACT GGGCGTTACC GCGCTCTGGC TTCTTCCATT TTATCCGTCT CCGGGACGGG ATGACGGTTA CGATATTTCC GATTATCACA ATGTGCACCC GGATGTCGGC ACCCTGGAGG ACTTTCACCG GTTTATCGCG GAAGCTCACA GTCGAGGTTT ACGCGTAATC ACGGAACTGG TGCTCAATCA TACTTCCGAT CAACATCCAT GGTTTCAGGC AGCGCGCCGC GCCCCACCTG GGTCCGTCAA GCGCGACTAT TACGTATGGA GCGACGACAA CACCAAGTAC AGCGGTGCAC GCAGTATTTT CACGGACACA GAGGGATCCA ACTGGGAATG GGATGATGTA GCCCAGGCCT ATTACTGGCA CCGCTTTTTT TTCCATCAAC CGGACTTGAA TTTTGATAAC CCCCATGTCT TTAACGCCAT GATGCATGTG ATGCGGCTTT GGCTGGACGC CGGTGTCGAT GGCATGCGGC TGGATGCCAT GCCTTATCTG TGCGAGCGTG AGGGGACCAA CTGCGAAAAC CTGCCGGAAA CTCATGCGGT GATCAAACGC ATGCGCTCCG AACTGGATAA ACACTACCGC AACCGGATGT TCCTGGCTGA GGCGAACCAG TGGCCGGAGG ATGTACGCGA GTATTTTGGC GAGGGCGACG AATGCCATAT GGCTTTCCAT TTCCCACTCA TGCCGCGAAT GTACATGGCT ATTGCGCGCG AAGATCGTCA TCCCATTGTA GAGATCATGG AGCAGACACC AGACATCCCC GAGAATTGTC AGTGGGCAGT GTTTCTGCGC AATCACGACG AACTGACATT GGAGATGGTC ACCGACCGCG AGCGGGATTA TCTTCACCAG ACCTACGCAA TCGACCCTCA GGCCCGCCTG CATCTCGGTA TTCGCCGGCG TCTGGCGCCG CTGATGGACA ATGACCGGCA CCGAATCGAA TTGATGAATC TGCTGCTCAT GACCATGCCA GGGTCGCCCG TGCTCTACTA CGGAGATGAA ATCGGTATGG GTGATGACCT TCTCCTCGGT GATCGAAATG GGGTGCGCAC GCCCATGCAG TGGTCGGGTG CAGTTAACGG CGGATTTTCA ACGGCTGATT CTCAACGGCT TTTCCTGCCG GCCATCATCG ACCCCGTATA TGGGTTTGGG GCAGTGAACG TGGACTCACA GAGACGGAAT TCGTCATCCC TGCTCAACTG GATGAAGCGC CTGATTGCGA TGCGCAAGGC ACACCGGACC TTTGGCCGGG GCACGCTGCG TTTTTTACGG CCTGGCAATC GCAAGATTCT TGCTTATCTG CGTGAGCATG AGGATGAAAC GATTCTGTGC GTGGCTAATC TGTCCCGCGT CGCGCAGCCG GTAGAGCTGG ATTTAAGCCA ATTCAGGGGA AGGGTACCGG TTGAATTGAT GGGGCGCACG CCTTTCCCAC CCGTGGGTGA GCTGCCGTAT CTTCTTACCT TGAGCGCTCA TGGCTTTTAT GCTTTTCGCC TGACAGCCGA TGTGGCCGCC CCTGCCTGGC ATGAGGAGCG GCAGGTATCG CCGGATCTGC CTGTCCTTGT GCTTGTGGAT TCGGGTTGGG GCACCCTGTT GAACCGGGGC GAGGGTAACG GGGGCATGAA AGACCTGATG GCTCGTCGTG CCCGCCAACA GCTCGAGGAG CAGATCATGC CTCGCTTCTT TTATTCCCAA CCCTGGTTTT TGATGCGAAA TCTCCCCGTC AGAAAATTTG AGCTGGGGGA GATGCACGAG TGGTCCGCGG AGCAGGGAAG CTGGCTGCTT GCCACCGTTG TTTTAACGCT GGCCAATGAT GAGACTTATC GCTTTGCCGT GCCTTTGGCA CTGGTGTGGG AGGACGAGGA TGAGGCGATT GTGAGCACGC TGCTGCATGC CACGCTTGCC AAGGTTCGCC GCCGGGAGCG GACGGGCGTG CTGTTCGACG CCTTCTGGGA TGACGGCTTC TGCCGCGCCG TGATCTCCAG CATGCATGAG GGTTCCGCAC TCCTGTTTAA GCGGGGGCAA GTATCTTTCC ATGCGACCAC CGCTTTTCCC GGCCCGGTGG TTTCAGGCGC GTCAACAACG GTGACCCGGA CAGTTTCGGA GAGAGGGCGA TTGTTCGTGA ACATGGGCGA CCGGCTGGTA CTGAAAGGAT ACCGCTGGCT TCTTCCCGGC GTGCATCCCG AGCTGGAGAT GTCGCGTTTT CTGACGGAGA CGGCAAAATT TACCCACATG GCGCAACTCG CCGGCACCGT GGAGTACACG GACAGTGAGG AAGGCAATTC CACTCTGGCA ATCCTCGAGC ACTACGCCGA GAATCAGGGT AGCGCCTGGG CTTATACACA GGACTATCTG CAACGCTACC TGGATGAATG CCGCACACAA CAAAAGCGTC CCATTGATTC GCGGCATATT GCCTACATGA CCCTCATCAA TACGCTGGGA TTGCGTACGG CGGAATTTCA CCGGGCACTT GCGCAGGATG ATGCAGAGGG AGCTTTCGGC GTCGAGCCCA TTACCTCCGA AGATCTTGCG CAGTGGGCGA GCACCGTGCG TGCGCAGATG GATGAAATGT ACAAATTGCT GGAAGCGAAA TGGCCGGATG TACCCAAATC CGCGCAGGAG GCCGGCAACG ATCTTTTATC AGCCCGGTCG AAATTCTACC GTCGTATCAC CCGCCTTGCA GCCATACATC CCCAGGCCTT GAAGGCGCGT TGTCATGGGG ATTACAGCCT GCGTCAGGTG TGGCTCTCGA ATAATGATTT TCTGATTACG AATTACGGCG GCGGTGCCGA ACGCGCATGG CGTGAGCGCC GCTGGAAACA GAGTCCCCTC CGCGATGTGG CAGGTATGCT GTTTTCGTTT TCCGAGGTGG CGGCAGCGGC ACTGGAGCAT GTCACGGATG AATATCCGGA ATCAACCATT ATGCTTGCAC AACAAGCTGA TAAATGGCGA GTGCTTGCCA GCGGCGATTT CCTCAAAAGC TATCGCAGGG CGATGAAGGG AAATTCCCTG TTCCCCGCTG ATGCCGGAGT AACCGATGCT TTGGTTACGC TCTTCATGGT GGAGAAAGCT GTTGCCAGCG TGAGTAACGC GCTCGCGCAA CAATCGAAGG CAGTCGATGG AACCATGCAG CGACTGATAC GGCTGATGCA ACACAGAAGG TAG
|
Protein sequence | MGREEDPLWY KDAIIYELHV KTFFDSNGDG IGDFSGLISK LDYLAELGVT ALWLLPFYPS PGRDDGYDIS DYHNVHPDVG TLEDFHRFIA EAHSRGLRVI TELVLNHTSD QHPWFQAARR APPGSVKRDY YVWSDDNTKY SGARSIFTDT EGSNWEWDDV AQAYYWHRFF FHQPDLNFDN PHVFNAMMHV MRLWLDAGVD GMRLDAMPYL CEREGTNCEN LPETHAVIKR MRSELDKHYR NRMFLAEANQ WPEDVREYFG EGDECHMAFH FPLMPRMYMA IAREDRHPIV EIMEQTPDIP ENCQWAVFLR NHDELTLEMV TDRERDYLHQ TYAIDPQARL HLGIRRRLAP LMDNDRHRIE LMNLLLMTMP GSPVLYYGDE IGMGDDLLLG DRNGVRTPMQ WSGAVNGGFS TADSQRLFLP AIIDPVYGFG AVNVDSQRRN SSSLLNWMKR LIAMRKAHRT FGRGTLRFLR PGNRKILAYL REHEDETILC VANLSRVAQP VELDLSQFRG RVPVELMGRT PFPPVGELPY LLTLSAHGFY AFRLTADVAA PAWHEERQVS PDLPVLVLVD SGWGTLLNRG EGNGGMKDLM ARRARQQLEE QIMPRFFYSQ PWFLMRNLPV RKFELGEMHE WSAEQGSWLL ATVVLTLAND ETYRFAVPLA LVWEDEDEAI VSTLLHATLA KVRRRERTGV LFDAFWDDGF CRAVISSMHE GSALLFKRGQ VSFHATTAFP GPVVSGASTT VTRTVSERGR LFVNMGDRLV LKGYRWLLPG VHPELEMSRF LTETAKFTHM AQLAGTVEYT DSEEGNSTLA ILEHYAENQG SAWAYTQDYL QRYLDECRTQ QKRPIDSRHI AYMTLINTLG LRTAEFHRAL AQDDAEGAFG VEPITSEDLA QWASTVRAQM DEMYKLLEAK WPDVPKSAQE AGNDLLSARS KFYRRITRLA AIHPQALKAR CHGDYSLRQV WLSNNDFLIT NYGGGAERAW RERRWKQSPL RDVAGMLFSF SEVAAAALEH VTDEYPESTI MLAQQADKWR VLASGDFLKS YRRAMKGNSL FPADAGVTDA LVTLFMVEKA VASVSNALAQ QSKAVDGTMQ RLIRLMQHRR
|
| |