Gene Noc_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0830 
Symbol 
ID3707135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp907053 
End bp910367 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content50% 
IMG OID637737332 
ProductAlpha amylase, catalytic region 
Protein accessionYP_342873 
Protein GI77164348 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAC GGGGGGAAGA ATTAGCCTGG CAAGATGATC CGCTTTGGTA TAAAGATGCC 
ATTATCTACC AACTCCATGT TAGGGCTTTT TTTGATAGTA ATAATGATGG CACGGGAGAT
TTCCGGGGCC TTACCCAGAA GTTGGACTAT ATCCAAGATC TGGGCGTAAA TGCGATCTGG
CTGCTGCCCT TTTATCCTTC TCCTTTGCGA GATGATGGCT ATGATATTGC GGATTACCGC
AATATCTATC CGGATTATGG GACTCGCAGG GATGTGAGGC ACTTTGTGCG GGAAGCCCAC
CGCCGTGGCC TCAAGGTTAT TACCGAGCTA GTCATCAACC ATACCTCGGA TCAGCACCCT
TGGTTTCAGG CTGCCCGCAG GGCGCCACCA GGATCGGCCA AACGGGATTT CTACGTCTGG
AGCGATACCG ATCAAAAATT TCCCGAGACC CGGATTATCT TCACCGATAC GGAAACTTCC
AACTGGGCCT GGGACCCGGT GGCGAAGGCT TACTATTGGC ATCGGTTTTT CTCTCACCAA
CCCGATCTCA ACCATAACAA CCCCCAGGTG GTTAGGGCGG TGATCCGGGT GATGCGTTTT
TGGCTGGATA TGGGTGTGGA CGGACTGCGT TTAGATGCTA TTCCCTACCT ATGCGTGCGT
GAGGGAACCC ATAACGAAAA TCTTCCCGAA AGTCATGAGG TGCTTAAAGA GATGCGGGCT
GTGGTGGACG AGCATTATCA AGGCCGCATG TTTTTGGCGG AGGCCAATCA ATGGCCAGAA
GATGTAAGGG AATATTTTGG TGACGGCGAT GAATGCCATA TGGCCTACCA TTTTCCCCTT
ATGCCGCGCA TGTATATGGC GATTGCCCAG GAGGATCGCC ATCCTATTAC TGAGATTATG
AACCAAACTC CGGATATTCC CCAGACCTGC CAATGGGCCA TCTTTTTGCG TAACCATGAT
GAGCTGACTC TGGAGATGGT CACCGACAAA GAACGTGACT ATATGTATCA GAGCTATGCC
GCTAACCCTC GTATGCGGGT GAATGTGGGC ATCCGGCGGC GTTTGGCACC GTTAATGGAC
AATGATTTAG ACAAGATTCG GTTGATGAAT AGCTTGCTTT TCTCTATGCC TGGCTCGCCC
ATTATTTACT ATGGAGATGA AATTGGCATG GGAGATAATA TTTATTTGGG TGATCGCAAT
AGTGTGCGTA CCCCCATGCA GTGGAGTCCT GATCGGAATG CGGGTTTTTC TAAGACCGAT
CCTCAGCGCT TGTTTTTGCC GCCTATCATG GACCCCATTT ACGGTTATGA GGGCGTTAAC
GTGGAAGCCC AGAGCCGGGC GCCCTCTTCC CTCCTCAATT GGATGAAACG TCTGCTTGCA
GTGCGTAAGA GCATTAAGAC CTTTGGCCGC GGCACACTAG CATTTCTTCG TCCTGGAAAC
CGTAAAATCT TAGCCTATGT GCGGGAATGG GAAGGGGAAG CTATTTTATG TGTTGCTAAT
TTGTCCCGCT GCGCCCAGCC AGTGGAATTG GACCTCTCCC GCTTTGAAGG ACGAGTACCG
CTAGAATTGA TGGGACATAC CCCTTTCCCT CCTATTGGCG AATTGCCCTA TTTGCTGACT
CTGCCGGGCC ATGGTTTTTA TTGGTTCCGG TTGGCAATGG ATGTAGCGGA ACCCGCTTGG
CATGAGCAGC GCTCGCCACC AGCTGAGTTA CCAGTGCTAG TGTTATTTGA TGGCTGGGAT
AGCTTATTTC CTGAGCGCGT CGATCCCTCC CGCCAGCGGA TGGCCGAAAA ATTGGGCAGG
CAATTAGAGC AAGCATTAGG GGATTTTCTG TTAGCGCAGC GCTGGTTTGC TGCCAAAGGG
GAGAAGATAG AGCGGATAGC GTTGGAGCAG ATGGAGGAGT TAGGGAATTG GCTGCTGGCC
AGAGTGAGGG TTGAGTTCAC CGAGTCCGAA GCTCAGTCTT ATTTTATCCC GTTAGCTATT
GCTTGGGGAG AAGGGGATGA AGAGCCCATA CGGCGCTTGC TGCCTGCTAC CATCGCTAAG
ATTCGGCGAC AAGCTCGTTT AGGTATCTTA TATGATGCCT TAGAGGATAT AAATTTTAGC
CAAGCATTAG TCATGGCCAT GGGACAGCAT AGGGAGATCC TCTTTGGAGC CGGAAGATTA
AAATTCTCCC CCACGACGGC TTTTGAAAAG TTGGTCTACA GTGCTTCCCT AGAGCAACTG
CGGCGCCCGG CGGTGGAAGG TACAAACAGC ACCTTGATTT TAGATAACCG GCTATTTTTG
AAAATCTACC GTTATTTAGA AGAAGGTATC AATTCAGAGC GGGAGATAGG CTATTTCCTT
ACCGAAATAT CGCCCTTTCC TCATATTGCC CCTCTAGCAG GTACCTTAGA GTATATAAGT
CCAAAAGAAA AGGTGATAAC TCTGGCTCTT CTGCAAGGTT TTGTTTCCAA CCAGGGAGAT
GCTTGGGCTT ATACCGTGGC TTATTTAGAA CGTTTTCTAG AACATTGCCT CGCAAAACCG
CTCGAAGAAG TTGCCACTGA ATTGGACAAG AATCATGAGG ATTATCTAAG AAAAATAACC
CTCTTGGGCC GGCGCACGGG GGAATTGCAT CAGGCGCTGG CCAAGGAAAC GGGGAATCCG
GCCTTTGATC CGGAACCCAT TGTTTCTGCT GATTTGATAT CTTGGAGAAA ACGCCTGGAA
GCAGATATTG AGTGCACTTT CAAGAAGCTA GCACAGCGGA AAAACACTTT TCCCGAATCA
CTGCGCAACG ACGTAGAATG GTTTTTGAGA TCCGGCGAAA TCTTGCGGCA ACGGCTTTGT
TTAAATTCTT CCATATTGCA AACTGTCAAG ACTCGCTATC ATGGTGATTA TCATTTAGGG
CAAGTACTAG TAGCAGAGGA TGATTTGGTC ATTATTGATT TTGAAGGCGA GCCTGCTCGC
CCCTTAAGGG AGCGTCGGAA AAAGCATTCG CCGCTGCGGG ACGTAGCGGG TATGTTGCGA
TCATTTAACT ATGCTGCTGT CGTAGCTCTG CGCCATTGTA CGGCAGAACG TCCTGAAGAT
CAGGTTTTAC TCATGCCACT ACTACATGCT TGGGAGCAGC AGGCGAGGGA GTATTTTCTA
ACTGGCTACC GGGAGGGGAT GAGGGATTGT CCTTCTTATC CTGTAGACTC GGAACAGGCT
CACGACTTGA TCACGCTATT TACTTTGGAA AAGGCACTGT ACGAACTCCG TTACGAGCTC
GACAATCGGC CAGCATGGGT GGAGGTACCC CTTAGGGGAT TGCTTGCGTT TTTGGAAGAG
GATGAGCAGA AATAA
 
Protein sequence
MNPRGEELAW QDDPLWYKDA IIYQLHVRAF FDSNNDGTGD FRGLTQKLDY IQDLGVNAIW 
LLPFYPSPLR DDGYDIADYR NIYPDYGTRR DVRHFVREAH RRGLKVITEL VINHTSDQHP
WFQAARRAPP GSAKRDFYVW SDTDQKFPET RIIFTDTETS NWAWDPVAKA YYWHRFFSHQ
PDLNHNNPQV VRAVIRVMRF WLDMGVDGLR LDAIPYLCVR EGTHNENLPE SHEVLKEMRA
VVDEHYQGRM FLAEANQWPE DVREYFGDGD ECHMAYHFPL MPRMYMAIAQ EDRHPITEIM
NQTPDIPQTC QWAIFLRNHD ELTLEMVTDK ERDYMYQSYA ANPRMRVNVG IRRRLAPLMD
NDLDKIRLMN SLLFSMPGSP IIYYGDEIGM GDNIYLGDRN SVRTPMQWSP DRNAGFSKTD
PQRLFLPPIM DPIYGYEGVN VEAQSRAPSS LLNWMKRLLA VRKSIKTFGR GTLAFLRPGN
RKILAYVREW EGEAILCVAN LSRCAQPVEL DLSRFEGRVP LELMGHTPFP PIGELPYLLT
LPGHGFYWFR LAMDVAEPAW HEQRSPPAEL PVLVLFDGWD SLFPERVDPS RQRMAEKLGR
QLEQALGDFL LAQRWFAAKG EKIERIALEQ MEELGNWLLA RVRVEFTESE AQSYFIPLAI
AWGEGDEEPI RRLLPATIAK IRRQARLGIL YDALEDINFS QALVMAMGQH REILFGAGRL
KFSPTTAFEK LVYSASLEQL RRPAVEGTNS TLILDNRLFL KIYRYLEEGI NSEREIGYFL
TEISPFPHIA PLAGTLEYIS PKEKVITLAL LQGFVSNQGD AWAYTVAYLE RFLEHCLAKP
LEEVATELDK NHEDYLRKIT LLGRRTGELH QALAKETGNP AFDPEPIVSA DLISWRKRLE
ADIECTFKKL AQRKNTFPES LRNDVEWFLR SGEILRQRLC LNSSILQTVK TRYHGDYHLG
QVLVAEDDLV IIDFEGEPAR PLRERRKKHS PLRDVAGMLR SFNYAAVVAL RHCTAERPED
QVLLMPLLHA WEQQAREYFL TGYREGMRDC PSYPVDSEQA HDLITLFTLE KALYELRYEL
DNRPAWVEVP LRGLLAFLEE DEQK