Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0830 |
Symbol | |
ID | 3707135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 907053 |
End bp | 910367 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637737332 |
Product | Alpha amylase, catalytic region |
Protein accession | YP_342873 |
Protein GI | 77164348 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCAC GGGGGGAAGA ATTAGCCTGG CAAGATGATC CGCTTTGGTA TAAAGATGCC ATTATCTACC AACTCCATGT TAGGGCTTTT TTTGATAGTA ATAATGATGG CACGGGAGAT TTCCGGGGCC TTACCCAGAA GTTGGACTAT ATCCAAGATC TGGGCGTAAA TGCGATCTGG CTGCTGCCCT TTTATCCTTC TCCTTTGCGA GATGATGGCT ATGATATTGC GGATTACCGC AATATCTATC CGGATTATGG GACTCGCAGG GATGTGAGGC ACTTTGTGCG GGAAGCCCAC CGCCGTGGCC TCAAGGTTAT TACCGAGCTA GTCATCAACC ATACCTCGGA TCAGCACCCT TGGTTTCAGG CTGCCCGCAG GGCGCCACCA GGATCGGCCA AACGGGATTT CTACGTCTGG AGCGATACCG ATCAAAAATT TCCCGAGACC CGGATTATCT TCACCGATAC GGAAACTTCC AACTGGGCCT GGGACCCGGT GGCGAAGGCT TACTATTGGC ATCGGTTTTT CTCTCACCAA CCCGATCTCA ACCATAACAA CCCCCAGGTG GTTAGGGCGG TGATCCGGGT GATGCGTTTT TGGCTGGATA TGGGTGTGGA CGGACTGCGT TTAGATGCTA TTCCCTACCT ATGCGTGCGT GAGGGAACCC ATAACGAAAA TCTTCCCGAA AGTCATGAGG TGCTTAAAGA GATGCGGGCT GTGGTGGACG AGCATTATCA AGGCCGCATG TTTTTGGCGG AGGCCAATCA ATGGCCAGAA GATGTAAGGG AATATTTTGG TGACGGCGAT GAATGCCATA TGGCCTACCA TTTTCCCCTT ATGCCGCGCA TGTATATGGC GATTGCCCAG GAGGATCGCC ATCCTATTAC TGAGATTATG AACCAAACTC CGGATATTCC CCAGACCTGC CAATGGGCCA TCTTTTTGCG TAACCATGAT GAGCTGACTC TGGAGATGGT CACCGACAAA GAACGTGACT ATATGTATCA GAGCTATGCC GCTAACCCTC GTATGCGGGT GAATGTGGGC ATCCGGCGGC GTTTGGCACC GTTAATGGAC AATGATTTAG ACAAGATTCG GTTGATGAAT AGCTTGCTTT TCTCTATGCC TGGCTCGCCC ATTATTTACT ATGGAGATGA AATTGGCATG GGAGATAATA TTTATTTGGG TGATCGCAAT AGTGTGCGTA CCCCCATGCA GTGGAGTCCT GATCGGAATG CGGGTTTTTC TAAGACCGAT CCTCAGCGCT TGTTTTTGCC GCCTATCATG GACCCCATTT ACGGTTATGA GGGCGTTAAC GTGGAAGCCC AGAGCCGGGC GCCCTCTTCC CTCCTCAATT GGATGAAACG TCTGCTTGCA GTGCGTAAGA GCATTAAGAC CTTTGGCCGC GGCACACTAG CATTTCTTCG TCCTGGAAAC CGTAAAATCT TAGCCTATGT GCGGGAATGG GAAGGGGAAG CTATTTTATG TGTTGCTAAT TTGTCCCGCT GCGCCCAGCC AGTGGAATTG GACCTCTCCC GCTTTGAAGG ACGAGTACCG CTAGAATTGA TGGGACATAC CCCTTTCCCT CCTATTGGCG AATTGCCCTA TTTGCTGACT CTGCCGGGCC ATGGTTTTTA TTGGTTCCGG TTGGCAATGG ATGTAGCGGA ACCCGCTTGG CATGAGCAGC GCTCGCCACC AGCTGAGTTA CCAGTGCTAG TGTTATTTGA TGGCTGGGAT AGCTTATTTC CTGAGCGCGT CGATCCCTCC CGCCAGCGGA TGGCCGAAAA ATTGGGCAGG CAATTAGAGC AAGCATTAGG GGATTTTCTG TTAGCGCAGC GCTGGTTTGC TGCCAAAGGG GAGAAGATAG AGCGGATAGC GTTGGAGCAG ATGGAGGAGT TAGGGAATTG GCTGCTGGCC AGAGTGAGGG TTGAGTTCAC CGAGTCCGAA GCTCAGTCTT ATTTTATCCC GTTAGCTATT GCTTGGGGAG AAGGGGATGA AGAGCCCATA CGGCGCTTGC TGCCTGCTAC CATCGCTAAG ATTCGGCGAC AAGCTCGTTT AGGTATCTTA TATGATGCCT TAGAGGATAT AAATTTTAGC CAAGCATTAG TCATGGCCAT GGGACAGCAT AGGGAGATCC TCTTTGGAGC CGGAAGATTA AAATTCTCCC CCACGACGGC TTTTGAAAAG TTGGTCTACA GTGCTTCCCT AGAGCAACTG CGGCGCCCGG CGGTGGAAGG TACAAACAGC ACCTTGATTT TAGATAACCG GCTATTTTTG AAAATCTACC GTTATTTAGA AGAAGGTATC AATTCAGAGC GGGAGATAGG CTATTTCCTT ACCGAAATAT CGCCCTTTCC TCATATTGCC CCTCTAGCAG GTACCTTAGA GTATATAAGT CCAAAAGAAA AGGTGATAAC TCTGGCTCTT CTGCAAGGTT TTGTTTCCAA CCAGGGAGAT GCTTGGGCTT ATACCGTGGC TTATTTAGAA CGTTTTCTAG AACATTGCCT CGCAAAACCG CTCGAAGAAG TTGCCACTGA ATTGGACAAG AATCATGAGG ATTATCTAAG AAAAATAACC CTCTTGGGCC GGCGCACGGG GGAATTGCAT CAGGCGCTGG CCAAGGAAAC GGGGAATCCG GCCTTTGATC CGGAACCCAT TGTTTCTGCT GATTTGATAT CTTGGAGAAA ACGCCTGGAA GCAGATATTG AGTGCACTTT CAAGAAGCTA GCACAGCGGA AAAACACTTT TCCCGAATCA CTGCGCAACG ACGTAGAATG GTTTTTGAGA TCCGGCGAAA TCTTGCGGCA ACGGCTTTGT TTAAATTCTT CCATATTGCA AACTGTCAAG ACTCGCTATC ATGGTGATTA TCATTTAGGG CAAGTACTAG TAGCAGAGGA TGATTTGGTC ATTATTGATT TTGAAGGCGA GCCTGCTCGC CCCTTAAGGG AGCGTCGGAA AAAGCATTCG CCGCTGCGGG ACGTAGCGGG TATGTTGCGA TCATTTAACT ATGCTGCTGT CGTAGCTCTG CGCCATTGTA CGGCAGAACG TCCTGAAGAT CAGGTTTTAC TCATGCCACT ACTACATGCT TGGGAGCAGC AGGCGAGGGA GTATTTTCTA ACTGGCTACC GGGAGGGGAT GAGGGATTGT CCTTCTTATC CTGTAGACTC GGAACAGGCT CACGACTTGA TCACGCTATT TACTTTGGAA AAGGCACTGT ACGAACTCCG TTACGAGCTC GACAATCGGC CAGCATGGGT GGAGGTACCC CTTAGGGGAT TGCTTGCGTT TTTGGAAGAG GATGAGCAGA AATAA
|
Protein sequence | MNPRGEELAW QDDPLWYKDA IIYQLHVRAF FDSNNDGTGD FRGLTQKLDY IQDLGVNAIW LLPFYPSPLR DDGYDIADYR NIYPDYGTRR DVRHFVREAH RRGLKVITEL VINHTSDQHP WFQAARRAPP GSAKRDFYVW SDTDQKFPET RIIFTDTETS NWAWDPVAKA YYWHRFFSHQ PDLNHNNPQV VRAVIRVMRF WLDMGVDGLR LDAIPYLCVR EGTHNENLPE SHEVLKEMRA VVDEHYQGRM FLAEANQWPE DVREYFGDGD ECHMAYHFPL MPRMYMAIAQ EDRHPITEIM NQTPDIPQTC QWAIFLRNHD ELTLEMVTDK ERDYMYQSYA ANPRMRVNVG IRRRLAPLMD NDLDKIRLMN SLLFSMPGSP IIYYGDEIGM GDNIYLGDRN SVRTPMQWSP DRNAGFSKTD PQRLFLPPIM DPIYGYEGVN VEAQSRAPSS LLNWMKRLLA VRKSIKTFGR GTLAFLRPGN RKILAYVREW EGEAILCVAN LSRCAQPVEL DLSRFEGRVP LELMGHTPFP PIGELPYLLT LPGHGFYWFR LAMDVAEPAW HEQRSPPAEL PVLVLFDGWD SLFPERVDPS RQRMAEKLGR QLEQALGDFL LAQRWFAAKG EKIERIALEQ MEELGNWLLA RVRVEFTESE AQSYFIPLAI AWGEGDEEPI RRLLPATIAK IRRQARLGIL YDALEDINFS QALVMAMGQH REILFGAGRL KFSPTTAFEK LVYSASLEQL RRPAVEGTNS TLILDNRLFL KIYRYLEEGI NSEREIGYFL TEISPFPHIA PLAGTLEYIS PKEKVITLAL LQGFVSNQGD AWAYTVAYLE RFLEHCLAKP LEEVATELDK NHEDYLRKIT LLGRRTGELH QALAKETGNP AFDPEPIVSA DLISWRKRLE ADIECTFKKL AQRKNTFPES LRNDVEWFLR SGEILRQRLC LNSSILQTVK TRYHGDYHLG QVLVAEDDLV IIDFEGEPAR PLRERRKKHS PLRDVAGMLR SFNYAAVVAL RHCTAERPED QVLLMPLLHA WEQQAREYFL TGYREGMRDC PSYPVDSEQA HDLITLFTLE KALYELRYEL DNRPAWVEVP LRGLLAFLEE DEQK
|
| |