Gene Noc_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0218 
Symbol 
ID3706273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp241013 
End bp242965 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content51% 
IMG OID637736734 
ProductAlpha amylase, catalytic region 
Protein accessionYP_342278 
Protein GI77163753 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATG CCCGTTGGAT GAAGCAGCAT AGTACTGTTT CGCTCAGGCG TCTGATGCCT 
CGTCTGGAGG CGCGTTTCAA GGCGCAGGTC GATCCTGACG AGTGGCAGGG TTACGTGCAG
CGCCTAGAGA CCCATTTTCC AAGACTGTTT GATTGCCTAT ATCGCCTCTA TGGTCAGCAA
TATGACTTTT TCTTCCATCT AGAAAGCATT CTGGCGTCGG CTACGGAAAT GTGGTGGGTG
CGCCCCGCCG AACTCAAGGC GTTGGATGCG CTCCGGACTG CCGATCCCTA TTGGTACCAA
TCCCACCGCT TGGTGGGTGC TATGTGCTAT GTGGATCTTT TTGCTGGCGA TTTGGTCGGC
CTGCGCGAGC GTATCTCCTA TTTAACTGAG CTGGGGATTA ATTATTTACA TCTCATGCCA
GTATTTAAGG TGCCCGAAGG GGATAATGAT GGGGGCTATG CCGTCAGTAG CTATCGTGAA
ATCAACCCCA ATCTGGGAAC AATGGAAGAT CTGGTCCGGT TGGCCGATGA GCTGCGCTAT
CGAGGCATTT CCCTCTGCCT GGACTTCGTA TTCAATCATA CTGCCGATGA GCATGACTGG
GCCCGCCGTG CCCAGGCCGG TGATCGAGAG TATCAAGAGT ATTATCGCAT GTTCCCTGAT
CGGGAATTGC CAGATGCTTA TGAGCGTAGC CTCAATCCCA TTTTTCCCGA TGAACATCCA
GGATCGTTTA CTTACCGTAA CCGCATCCGC AAGTGGATCT GGACTAGTTT CCATAATTAT
CAATGGGACT TAAATTATGA AAATCCGGTG GTTTTTAACC GGATGCTGGA AGAAATGCTT
TTTCTTGCTA ATCAAGGGGT GGAGATTCTG CGCCTTGATG CGGTGGCTTT TGTCTGGAAG
GAGCTAGGTA CCAGTTGTCA GAATTTACCC GAAGCGCACA TCATTATCCA GGCTTTCAAT
GCCTTGGTGC GTATCGCCGC GCCGGCCATG GTATTTAAGT CCGAGGCCAT TGTGCATCCG
GATGATGTGA GAAAGTACAT CAGCGAAGAG GAATGCCAAC TTTCCTATAA TCCTCAGCTT
ATGGCTTTGT TATGGGATGC CTTGGCAAGC CGGGACATCC GTCTGCTGCG CTATGGGTTG
CAGCGACGTT TTGCCCTCCC GTCCGGTTGT GCCTGGGTGA ATTATGTGCG TTGCCACGAT
GATATTGGGT GGACCTTTTC TGATGATGAT GCGCGTGCTC TGGATCTCGA TCCCCAGGCT
CACCGCCAGT TTCTTAGCCA GTTTTATACC GCGCGGTTTG AAGGTAGTTT CGCCCGGGGG
ATGCCTTTCC AGGAAAACTT AGCTACTGGA GATGCCCGGG TGTCGGGCAC TTGCGCTTCT
TTGGCAGGAT TGGAAAAGGG CCTTTATCAT AATGATGAAA CAGAAATCGA GTATGCGATC
CGTAAGATCC TGTTGATCCA TGGCGTTATT CTGACTATTG GTGGCATACC GTTGATTTAT
CTGGGTGATG AAATTGGCGT TCTAAATGAT TATGATTATG AAAAAGATTT GGCTAAAATT
GGCGATACCC GCTGGCTGCA CCGATTGCCC TTCGATGAGG CGCGGGCCGA GCAGCGTTGG
GATTTTACCA CTGTTCCTGG CCGAATCTAC CAAGGTTTGC TGCGGCTGAT CCAAATCCGC
CAGCAAAATC TAGCCTTTAC CCGAGCCGAA ACGGAAGTTG TCGACACCGG TAACGATCAT
GTATTTGGCT ATTTCCGCAA CCACGATGAA TACACGGTAC TGATACTGGC TAATTTCAGC
GACTTTACCC AGCATCTGGA AGCTCGGCGG CTGCGGATGT TGGGATTGCG CAAAACCGAG
GTGGACCTTT TTGCAGGTAA AACCGTTACT GCCACCCGCG AATTGAGCCT GGAACCCTAC
GCTTTCATGG TGCTGGCCAG GCCTGGCAAA TAA
 
Protein sequence
MSDARWMKQH STVSLRRLMP RLEARFKAQV DPDEWQGYVQ RLETHFPRLF DCLYRLYGQQ 
YDFFFHLESI LASATEMWWV RPAELKALDA LRTADPYWYQ SHRLVGAMCY VDLFAGDLVG
LRERISYLTE LGINYLHLMP VFKVPEGDND GGYAVSSYRE INPNLGTMED LVRLADELRY
RGISLCLDFV FNHTADEHDW ARRAQAGDRE YQEYYRMFPD RELPDAYERS LNPIFPDEHP
GSFTYRNRIR KWIWTSFHNY QWDLNYENPV VFNRMLEEML FLANQGVEIL RLDAVAFVWK
ELGTSCQNLP EAHIIIQAFN ALVRIAAPAM VFKSEAIVHP DDVRKYISEE ECQLSYNPQL
MALLWDALAS RDIRLLRYGL QRRFALPSGC AWVNYVRCHD DIGWTFSDDD ARALDLDPQA
HRQFLSQFYT ARFEGSFARG MPFQENLATG DARVSGTCAS LAGLEKGLYH NDETEIEYAI
RKILLIHGVI LTIGGIPLIY LGDEIGVLND YDYEKDLAKI GDTRWLHRLP FDEARAEQRW
DFTTVPGRIY QGLLRLIQIR QQNLAFTRAE TEVVDTGNDH VFGYFRNHDE YTVLILANFS
DFTQHLEARR LRMLGLRKTE VDLFAGKTVT ATRELSLEPY AFMVLARPGK