Gene Noc_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0550 
Symbol 
ID3706742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp592101 
End bp594401 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content54% 
IMG OID637737058 
Producthypothetical protein 
Protein accessionYP_342600 
Protein GI77164075 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CATTCTGTTT ACTTGCCATT GTTGCGTTTC CCGCCATGGC CGAAGTCACC 
GTGACTGAAC CGACGAAATT GACATCCATT GAAAGCCAAA TTTCGGCGCA AGAGTCCAAA
AGCCTGGACC TCAAGGCGCG CTCAAGCGCA CTGGAAGCCG AGGAGCACCT CGCCGCCGCC
ATTGACGGCT TTGTGTACGC GAGAGTGAAG CGCACGACGG GGAATTTTGT CGTCAAGGGG
AAAACCTATC ACTCTATTTC TGCTACCGAC CGGCTGATTG ATGTCACCTT TCAAGGCACA
ACCGGGGACT TTTTGACCGG CCCCGGCGAT TTGGTCTGGC GGAAGGGCGA TGGCACGGAA
GTTATCCTCC ACCATTGCCC GGAAGTCGCG CTAGGTGAGA ACACATGTAT TGTGATTGAC
CCGCGAGTGA GCTTCGACGG CAAGCAAGTG GCCTATACGG TGCTGGAGGG TCGGTATGGG
ACCAACCCTG ATGTGAAAAT GCAGCATGGG ATTGATGCGC AGAAGTCCTC CCTTCATTTC
GTCAATCTCG AAACGATGGA AAAGACCCAA TGGGCAGCCG TGCAAGGGGT ATTTGATATG
GCCCCACAAT GGTTGCCCGA TGGCACCCTG ATGTTTACCT CCAACCGGGA CCATATCAAA
GCAGGAAGCA TCAGCGGCTT TAGCCCGGAA GGGGAAGTCT TGCAGTTGTG GGTGGCGGAC
AGAAATGGCA GCCACGCCCG TAACGTCAGT CCGGACATTC TCGCCGATGC GCTCCATCCC
ATCCTTCATC CTTCTGGTAA GGTGCTTTTT TCAAGCTGGA AGCGGGACTT GGAAAGCGTT
CAGAACAAAA CCCTGGATAA TCTGTGGTAT ATCGCCGAAA CCCGGCAAGA TGGTTCGGAG
CATAATGCTA TCTGGGGCGC CCATGCGCGG TTTCTCCCTA ACCCAGACAA TATCACCATT
AAAGCCCTTC ATTTTACGGG CGTGCGTAGC AATGGGGACG TGTGTACGAC GAACTATTAT
CGCCAAAATA ATAAAGGCGG CGGTAACATT CAGTGCTTTA CCTATCCCAC CCCGAATCAT
CCCGAGGGCA AATTCCCTTT CAAGGCCAAA GAGGGAAGAT ATTTAGGGTT GTGGGGGCAT
TCCGGCGATC AGTTCAAAAC CGATGGGCTG GGGCGCACCC GCGATCCCAT GGGGATGCCT
AAAGGGGGGC TGGTGTTTTC CGCCAGCCCA AAAGGCGTGT GCCATTTCTT CCATAAAATT
GCCCAGCTAC CGCTCAATGA TAGCGGCTGT GATTTCGGTA TCTATGTACA AGGCACCGTG
CCAGAGAAAA GCCTGGAAAG CCGTATCCTT ATCGTCAATA GCCCCGACTG GCACGAGTTC
CAACCTCAGC CCGTGATGCC CTACGAGGAT ATTTACGGCA TTAAGAAGCC GCCAGTGGTA
AAGATGGACC ATCCCAACCC GGACGGCAAA TGCATCCTGC GCTCGGCTAG CAAGCGGATG
CAAGTAGACA ATGAACAGGG CTATCGCGGC CCCGGCAACG CTAATCCGTG TTATGAACAA
GGCTGTTCCA TGGCGGGCGT GGATAAGCAA GCGCTCATGA CAAGCATCAA GTTCTGGCGG
CCCGTTCCGT TCAATTCACG GGTTTTGCAT CAGGATTTTG ATGGATTTGG TCCATTCAGG
ATGGAAGTGC TCGGCACGGT TCCATTACGG TCTGATGGTT CATTTACCGC TGAAGTGCCT
TGCGATACCC CGTTCTATAT GGCTGGAGTC GATAGCGAAG GCAGAACGCT CGCCAGAGAC
AAGATCGCCA TGTCTCTGCG GCCCGGTGAA ACCCGTACCT GCTCGGGGTG CCACAACCAC
GACGACGACA ACCCCCCTCA GTTCAAGGAC TCAGAGGCGG CCAAAGTGGC CCCCACCCCA
GTCCCCGCAA GCGGAACGCT GTTGAATTGG ACCACGGATG TCTGGCCCCT GTTGCGTGAC
AATTGCGATG AAAACTTTCC CGAGTTTAAC CTTTCAGCCT CCACCGAGGA AAAAGCTTTT
AATAACTTTA CCCATTACGC CAAAGGGGCC GCAAAGCCAC CGTGGCGGAG CTACTTTATC
AACTGGTTGT TTGCCCGCGA GAGTCTCCTG TACTGGCGCA GCGCTGGAGA ACGTTTGGAT
GGACGCACCG ATGCCAGCCG CACCGACGAT ATCGACTACG GCACCGTAGC ACCCAAGCAA
GCCTGCCTTA ACGCCGCGCA GCTTAAAATC CTTGCAGACT GGATTGAAAG CGGTGCTTAT
CGGAAAAATA ATAAGTGGTG A
 
Protein sequence
MKKTFCLLAI VAFPAMAEVT VTEPTKLTSI ESQISAQESK SLDLKARSSA LEAEEHLAAA 
IDGFVYARVK RTTGNFVVKG KTYHSISATD RLIDVTFQGT TGDFLTGPGD LVWRKGDGTE
VILHHCPEVA LGENTCIVID PRVSFDGKQV AYTVLEGRYG TNPDVKMQHG IDAQKSSLHF
VNLETMEKTQ WAAVQGVFDM APQWLPDGTL MFTSNRDHIK AGSISGFSPE GEVLQLWVAD
RNGSHARNVS PDILADALHP ILHPSGKVLF SSWKRDLESV QNKTLDNLWY IAETRQDGSE
HNAIWGAHAR FLPNPDNITI KALHFTGVRS NGDVCTTNYY RQNNKGGGNI QCFTYPTPNH
PEGKFPFKAK EGRYLGLWGH SGDQFKTDGL GRTRDPMGMP KGGLVFSASP KGVCHFFHKI
AQLPLNDSGC DFGIYVQGTV PEKSLESRIL IVNSPDWHEF QPQPVMPYED IYGIKKPPVV
KMDHPNPDGK CILRSASKRM QVDNEQGYRG PGNANPCYEQ GCSMAGVDKQ ALMTSIKFWR
PVPFNSRVLH QDFDGFGPFR MEVLGTVPLR SDGSFTAEVP CDTPFYMAGV DSEGRTLARD
KIAMSLRPGE TRTCSGCHNH DDDNPPQFKD SEAAKVAPTP VPASGTLLNW TTDVWPLLRD
NCDENFPEFN LSASTEEKAF NNFTHYAKGA AKPPWRSYFI NWLFARESLL YWRSAGERLD
GRTDASRTDD IDYGTVAPKQ ACLNAAQLKI LADWIESGAY RKNNKW