Gene Noc_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0156 
Symbol 
ID3706189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp168016 
End bp169194 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content51% 
IMG OID637736673 
Producthypothetical protein 
Protein accessionYP_342219 
Protein GI77163694 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCTAT GGGATGTGGT GATTATCGGC GGTGGCGCCG CTGGTTTGAT GTGTGCTATT 
GAGGCTGGTA AACGCCAGCG ACGGGTATTG CTCATTGAGC ACAGTAACCG TGTGGGCAAG
AAGATTCTAA TGTCTGGGGG CGGGCGCTGT AACTTCACTA ACCTACACGT CAGGCCGGAT
AATTTTTTAT CGGCTAATCC TCACTTCTGT AAATCCGCGC TGGCCCGCTA TAGCCCGTGG
GATTTTATCG CCATGGTAGA ACGCCACGGT ATTGCTTACC ATGAGAAAGA ATCCGGGCAG
CTATTCTGCA ACCAATCTTC AAAGTTAATC GTGAACATGC TGGTAGCGGA GTGTCAGCAA
GTCGGGGTAC GTATTGAATT GGGATCCAAG GTAACCACGG TGAAACATAG GTTCCCTGGC
TTTGCTTTAG AAACCAGTCT AGGATCGGTT CAGGCATCCG CATTGGTCAT TGCCAGTGGT
GGCTTGTCCA TCCCTAAAAT GGGTGCTAGC GGTTTTGGTT ATGAACTTGC TAAACGGTTT
GGACATCGTA TTCTTGCTAC CCGTCCCGCC TTGGTGCCCC TGATCTTCAC TGAAGAAGAC
TTGGAGCAAT ATCGGGACTT AAGTGGCATT GGACTATTAG CCGAAGTCGG CTGCAATAAT
CAGTATTTTA CCGGCGGCAT GTTGTTTACT CATCGAGGAA TCAGCGGCCC GGCTATTTTA
CAGATCTCTT CCTATTGGCA GCTTTCTGAT GAACTGGGGA TAAATCTATT GCCAGGAACA
GATGTTCTAG CATGGTTAAC AGAGCGGCAG CGCAGTCGCC CTAGTGCCGA ACTCAGGACA
GTGCTCGCCA AATGCTTGCC AAAACGGTTG GCCCAGCGCC TCTGTAAATT AGTATTTGGC
AGCTTTCCCC TGCGCCAATA TTCTCCACTG GAATTGAGGG CAGTGGCCGA ACGGCTCCAG
TATTGGCGCT TTTATCCCAA GGGAACCGAA GGTTATCGCA CGGCTGAGGT GACATTGGGT
GGTGTCAATA CCGATGAGCT TTCTTCGGCT ACCATGGCCT CCAAAAAAGT GCCGGGACTA
TACTTTATCG GGGAAGTGGT GGATGTGACT GGTCATCTGG GTGGATTCAA TTTTCAATGG
GCTTGGGCAT CTGGTCATGC GGCTGGGCAG GCGGTTTGA
 
Protein sequence
MPLWDVVIIG GGAAGLMCAI EAGKRQRRVL LIEHSNRVGK KILMSGGGRC NFTNLHVRPD 
NFLSANPHFC KSALARYSPW DFIAMVERHG IAYHEKESGQ LFCNQSSKLI VNMLVAECQQ
VGVRIELGSK VTTVKHRFPG FALETSLGSV QASALVIASG GLSIPKMGAS GFGYELAKRF
GHRILATRPA LVPLIFTEED LEQYRDLSGI GLLAEVGCNN QYFTGGMLFT HRGISGPAIL
QISSYWQLSD ELGINLLPGT DVLAWLTERQ RSRPSAELRT VLAKCLPKRL AQRLCKLVFG
SFPLRQYSPL ELRAVAERLQ YWRFYPKGTE GYRTAEVTLG GVNTDELSSA TMASKKVPGL
YFIGEVVDVT GHLGGFNFQW AWASGHAAGQ AV