Gene Noc_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0489 
Symbol 
ID3706660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp524223 
End bp526220 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content39% 
IMG OID637736998 
Producthypothetical protein 
Protein accessionYP_342542 
Protein GI77164017 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000190696 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAA AATTATTTAT CCTATTCGCA ATTATCCTGA TATCCCTGCT GCTGCTGGTG 
ATACTGGCTG GAGTAATGAT TGATGAGAAT AGGGTTAAAA AATGGGTTTC TAACTATATT
GAACAGGAGT ATGACAGAGA ATTTGAGATA ACTGGCGAGC TCAAGCTAAA TTTATTATCC
TTGCAACCCA GCGTTACAGC AAAAGGAATA AAACTAGAAA ATGCAGAATG GGCGGATAAG
CCTAATATGG CGACTATCGG TAAAGTTTTT TTCCAGTTTG AGTTTTTTCC GTTGATCTTA
GGAAACATCA ACATCCTAGA TGTGGAAATT AGAGATGGCC GGATTTCCTT GCTTGAAAAA
CAGAATAAAA CTAATTGGGA TATTTTTTTA CCAGAAGAGA AGAAGCCCCC AGGTTATTTT
TCAGTAGAGA AAATCAGAAA TTTTAGGCTA AACGATTTCA CGGTGACTTA CCAAAGTAGG
GTAAAAAAAG AAACCCATAC TCTGCTCGCA AAAAATATTC AGCTGCAAAA TTTTTATTCT
ACTAAAAGGA AAGGTTCTTT TCAGGGAAAG GTTGATTCCA TTCCAATTGG TTTTCAACTT
AACCAAAAGC CAATCAATGC AGAAAATGTG GACAAACTAC TCCTGTTGAC AGGACATATC
GGTACGATAG AACTTTATTT TACCGGCGAT ATTTCCGATG ATTTTAATCT GATAAGGGGT
AAATCCCGTG TAAAAGGATC CGGAGAAACA TTATTGCAGC TGGCCGAATT GGCTGGTTTA
AAAATTAAGA ATTTTCGCTC TTATGATAGT TTTGCCACTA TTGATGCGAA TCTAGAGAAG
ATGTTGATAA ATGTGAATCA GATGGATATT ACCTATGGTC AGAGTCAAAT GACAGGTACG
TTGAAGATTG ATTCGAACGA GAAGCGAACG CTTGTTAGAG GCGATTTACT ATTTCCGGTG
CTGTCTAGCC CTGATTTTAG ATCCCTAACT AAGAAAAGCG TAGCCGAGCT AGAAGAAGAA
ACCACGGATA TTGATGTTAA GGCGCAAGAA AAACAAAGCG AGAAAAAAAA GCTCTTCGAT
GATGATCCTC TCGAATCTTT TATCCCAGAA AAACTGGAAC TCTATCTGCA TATGAAAGTG
GATAAATATG TTGGCGGTGA TTGGGATAAA CTTATTCAGG GGGGCAAGCT TAATATTGCG
ATCAAGGAAA AACAGTTAGC GCTTCATCCA ATAAAAGTAA GGATGTTAGG AGGAGGGATC
TCTCTGGAGG GATTGATCGA TCAATCAAAG AATAACTTAA CAGAAGCTGA TCTTAGGCTA
AACGTTAATA ATTTTGAGCT CAAGCAAATC TCTGAAAGCT TAGCCGATCT TGATAATAAG
CTTAATATTG ATGACATCAT TGGAGGAGAT TTTAATGCTG TGGTTAACTT GCATGCTAAT
GGTGGTTCAC CGCAAGACAT GGCTTCAACT CTGAGTGGGG ATCTTAATTT CGTAGTAGAG
GACGGCTATA TCGGTAGCTT GCTGGTAGAA GCCTTGCAGA TTGATGTGAC GGAAGCCTTT
GTTTCTTGGC TGGCTGATAA TCCAAAGACA GAAATGAACT GCCTGGTGGG ATTATTTGAT
ATTGATTCAG GACGCGTCGA AACCGAGTCC CTGCTCATCA ATACCGATGA TTCCAACGTA
GTGGGTAATG GGGTTGTGAA TCTGGATAAG GAATTAATAA AGTACATGTT GGTTGCAAGG
GCAAAAGATT TCTCTTTAAC GGGAGCGCCC ATTAAAATAG TTATGGAGGG GGATCTTTTA
AATCCTATGC TTGATATTGG TGGTGAAGAT TCATTGCTTG AAGTGGCTGC TGAAGTGCTT
GTGACGCCTA TTGCTGAAGG TTTTAAAAAT ATCTTTAGCG AAAATAGCGA AAGTGAAGAA
TTAACGGGTT GTAAGAAATT TATGGACGAT ATTGCTCGTA TCCAGAAAAA AGCAGACACG
GTGGAAAAAC ACCAGTAA
 
Protein sequence
MLKKLFILFA IILISLLLLV ILAGVMIDEN RVKKWVSNYI EQEYDREFEI TGELKLNLLS 
LQPSVTAKGI KLENAEWADK PNMATIGKVF FQFEFFPLIL GNINILDVEI RDGRISLLEK
QNKTNWDIFL PEEKKPPGYF SVEKIRNFRL NDFTVTYQSR VKKETHTLLA KNIQLQNFYS
TKRKGSFQGK VDSIPIGFQL NQKPINAENV DKLLLLTGHI GTIELYFTGD ISDDFNLIRG
KSRVKGSGET LLQLAELAGL KIKNFRSYDS FATIDANLEK MLINVNQMDI TYGQSQMTGT
LKIDSNEKRT LVRGDLLFPV LSSPDFRSLT KKSVAELEEE TTDIDVKAQE KQSEKKKLFD
DDPLESFIPE KLELYLHMKV DKYVGGDWDK LIQGGKLNIA IKEKQLALHP IKVRMLGGGI
SLEGLIDQSK NNLTEADLRL NVNNFELKQI SESLADLDNK LNIDDIIGGD FNAVVNLHAN
GGSPQDMAST LSGDLNFVVE DGYIGSLLVE ALQIDVTEAF VSWLADNPKT EMNCLVGLFD
IDSGRVETES LLINTDDSNV VGNGVVNLDK ELIKYMLVAR AKDFSLTGAP IKIVMEGDLL
NPMLDIGGED SLLEVAAEVL VTPIAEGFKN IFSENSESEE LTGCKKFMDD IARIQKKADT
VEKHQ