Gene Noc_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0491 
Symbol 
ID3706662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp528860 
End bp529939 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content43% 
IMG OID637737000 
Producthypothetical protein 
Protein accessionYP_342544 
Protein GI77164019 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000229793 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCAAC TAGATGAAGC ACAGAATCGC TATCGAAAAA ACTTTATTAT GATACTGTTG 
GTATCCATTC TGGCAGTATT TTTAGTGATG ATTCATGAGT ACCTTATTGC CATTCTTTTA
GCGATTATTT TCACAGCCCT GTTGTATCCT GTTTATGCTT GGATTTTAAA GAAATTTAAT
GGCAGACAGG TGCTGTCGTC AATGACTACG ATCCTGTTAG CTATTCTAAT GATAGGCCTG
CCGTTGCTAG GTTTGCTCGG CGCCGTAGCT GCCGAAGCAA TTCAAATTAG CAATAGCATT
GCCCCCTGGA TAGAGAAAAA AATTCCTGAT CAGAACGCCT CCCCCCTCCA CGAATTTCCA
CAGTGGCTGC CGTTTGCTGA TCAGCTTGAG CCTTATAGAA CGCGGATTTT AGCTAAAGTA
GGGGAGTTTG CCGGTAATGC AGGCGCGTTT ATCGCAAGTG GAATTTCTAA GGCCACCCAA
GGCACGATCG GTTTCATAGT AAATTTTTTC ATTATGTTAT ATGCCATGTT CTTCTTTTTT
ATATGGGGGC CGGATTCGCT TATTAACTTA ATACGTTATC TTCCCCTTAC TGAAAAAGAC
CGTTCCCATA TTCTTGAAAA AGGACTTTCA GTTACAAAGG CGACCTTAAA GAGTATTCTC
ATCATTGGGG TATTACAGGG AATCCTAGTA GGGCTCGCCT TCTGGGTAGC TGGGATTAAA
GGGGCTATCT TTTGGGGTAC CATCACGGTA GTGCTTTCTG CGGTTCCCGG GCTCGGTGCC
CCCGTTGTTT GGATTCCAGC GGTAATTTAT TTGATAGCTA CGGATCAAAT AGGTTGGGCC
ATTGGGATGA CGTTATGGGG GATAATTATC GTAGGCTTGG TGGATAACAT CCTGCGTCCT
CGAATTGTGG GCAGCGAGGC CAAAATGCCT GATTTGCTGA TTTTGCTAGC TACTTTGGGT
GGTATTCTTA TGTTCGGAAT GGTGGGTGTT ATTGTAGGTC CTATTATTGC TGCCTTACTA
ATCACTGTGC TTGATATCTA TGGAAAAGTA TTTACTAATC TTTATTCCCA GGCGGAATGA
 
Protein sequence
MIQLDEAQNR YRKNFIMILL VSILAVFLVM IHEYLIAILL AIIFTALLYP VYAWILKKFN 
GRQVLSSMTT ILLAILMIGL PLLGLLGAVA AEAIQISNSI APWIEKKIPD QNASPLHEFP
QWLPFADQLE PYRTRILAKV GEFAGNAGAF IASGISKATQ GTIGFIVNFF IMLYAMFFFF
IWGPDSLINL IRYLPLTEKD RSHILEKGLS VTKATLKSIL IIGVLQGILV GLAFWVAGIK
GAIFWGTITV VLSAVPGLGA PVVWIPAVIY LIATDQIGWA IGMTLWGIII VGLVDNILRP
RIVGSEAKMP DLLILLATLG GILMFGMVGV IVGPIIAALL ITVLDIYGKV FTNLYSQAE