Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1251 |
Symbol | |
ID | 3706363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1372390 |
End bp | 1375767 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637737753 |
Product | hypothetical protein |
Protein accession | YP_343282 |
Protein GI | 77164757 |
COG category | [S] Function unknown |
COG ID | [COG3002] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCAC ACCACGCCGA ACAGGCGGAA TCGCCCTTAG ATAACCGAGA AAAACTGACT ACCGCGGTAG ACAAGTTAGA ACTCGTGCTG CCATGCGCAG CCCCCCTGTA TGACTTCATG CACTTAAACA CCATGCAGGG CTACCACCAT ATCTCCTTCG CCGAAGCCAT GGCGGCTCAC TTTGAGCTGA CTGGCATCCG AGGTTACTTG CCTGAAGAAG ACTTCCGTAA GCATTATGCC CGGGGCCGTA TTGACGATGC TGACCTTAAC GAATCCCTAG CAAACAATAC CCATGGCCGT AATCGCGAAA TTGTACTGAA AGTTAGTGGC CGTCCTATCA ATAAACAGGA CATTTGGCGT ATCAGCCTAA TCCAGGACAT TAACCCTCTT TCTCCCAGCC GTTTTCGTTG GCAAATCAAG GAATATGATG CTCTGGAGCG TTTTCAGAAC GGCGTGCCAA AATCTGCCCG CGACACACTG CTTAACGCTA CCCAAGAGAC AGATCAAAAC CGCCGTACGG AAAGCCAGGC CATCCGAGAT CTATGGGAAG CATGTTTAAG GGTTTTCCAG TTAGAAAATC CTAATCTGCA TTCGGAAGAG TTAGGGGAAC TGGTAGACCT AGAAGAATTT TCCCGCAGCC AGGCCGAAGG TAAACAGAGT AAATTCCGCA CAAGCCAACC CACGCTTATC CCCCAAGAAG AAATGCTTGC TGAAGCACGA AAAGATCTTC ATCATCTAGT CGATAAGGTA GGTGAGGAAC TCTCCCTGCG TGGTCTATTA CAAGCACTGA CTGGTGAGGA TCTTTTAGAT CAGGTTCGTC CCATCTTGAT TCGCTTCTGT GCTTCCCATC TTGATGAAGG CTTCACAGCC TGGAGTCTAC CGGAGCGAGG GCAAGGGCTC TATGCCGCAT GGCGAAAGTG CCCCTTCGCT GAACTCGGAC TAGACTTGGC TAGGCTACCG GATTGGCAGT CGTTCCATGC CGAGTTACCA GAGCACTCAG TAGATGCCGT TATTGCCTGC CTTGAGCGAT TGAAAATCCC AGAATCCCGT TGGGAAGGTT ACCTGAAACG GATAGCTGTG GAACTTCCAG GCTGGTCCGG TCTTATTAAC TGGCGCCACC ATCGGCCCAA ATATAAGCCG AATCGGAAGG CCCCTACTTC GCTTATGGAT TATCTGGCTA TCCGCCTCTT CTTGGATGTG ATTCATATTG AACAGGTTAC CCAAAACACC TGGGGTATCG CCGGTAATCT AGAGGAACTC AAAACTTACT TCGAGAATTA CCTTTGGGAG TTCTCTGGGC GGTATGCGCT ATTTTCAAAC ACGCTCCCAG AATATTTGGC CATCAGAGCA CAAGAACTGA TAGCTCTGCC CAGAACTGCC CAAAAGGATC GGGAAAATTG GCGCACGGTC TCCAATATGA TTCATAACTG GAAGCACAAT CCTTCGGCGG AAAGGACAAA ACGTCAAACC GTGCATTCTC ATGTCTGGCG TTTATTCTGC CTAGCACAGC ATCTAGGTCT GCCGGGAAAT GAGGTGGGCA AACTTTCCTC CAGCGAGGCT GAACAACTGC TCGCCATCCT GGATGAATTG ACCACCTCAG AACGGGGGTA TATCTGGCTC TGTGCTTATG AGTACCATTA CCGGGAGGAT TATTTCGCTG CGCTGACTCA GAATCATGGG CGTGGCCGCT GGGCCAACCG GAATGAACGC CCAGAGGCAC AACTGATCTT CTGTTTCGAC GACCGGGAGG AAGGTATTCG CCGCCATTTA GAGGAAGTCA ACCCTAACCT TGAAACCTTG GGGGCGCCAG GCTTTTTCGG TGTACCCATT CAGTGGCGCG GGTTAGATTA CCCTGATACC ACCCCCCATT GTCCGGTTGT GGTAACCCCT GTTAATGAAC TTCATGAAGA ACCTCGTCCG GAAGCAAAAA AACGGTACGG CCTCCATAAA CGCCTCTATA ACTTCAAGCA ATTTCTACTT CGCGTACTTC ATAATAAAAC CCGCCGCGAC CTATTGACTT CAAAGGTATT AATCGATGTC CTCTTCCCTG GAATGCTAGC TGTGCTAGCA GGCAAGGTTT TCTTTCCCTT CCAGCAAGCA TCGCTAAAAC GCAAAGCTAC CGCTGCGCTT GTGCCACCCG TACCGACCCA ATTGAAACTT ACTGTACCGG ACGACGGCAC GGAAGCCACA CCCGATAATC TACGTGTAGG ATTCACCGAT GCTGAGCAAG CTGAACGATT AGCAGCCTTT CTACGCGCTA TCGGGTTTAC CTCTGGCTTT GCGCCGCTGG TGGTGCTATC CGGGCATGGC TCCATGAGCG AAAATAACCC CCAGCTAGCT TCTTATGACT GCGGCGCCTC CGGCGGGCGC CATGGCGGAC CCAATGCCCG CGCTTTTGCG GCCATGGCTA ACCGGCCGGA AATACGGGCA CGGCTCGCTG AACAAGGTAT CCACATTCCT GACGACACCT GGTTCATCGG CACGGAACAT GATACTTGTT CCGAATCGTT CCCCTGGTTC GATCTGGATA AAGTGCCCGC TAATTTCGCG CCCGCCCTGA AGAAATTGAA AGCCGAAGTT GATCAGGCCC TCCTGCTGTC AGCCCATGAA CGTTGTCGGC GCATGGCTTC GGCACCCCGT AAACCTAGTC TACAGCAGGC CAGGAGACAT GTTGCCGAAC GTGGCACGGA CTTCAGCCAG GCCCGGCCAG AGCTAGGCCA TGCCACGGTT GCCTCGGCGC TTATCGGGCG CCGTTCCGTC ACCCGGGGGA TATTTCTAGA CCGCCGCTGC TTCGTGCTTT CTTATGATCC AACTATTGAT GACGCAGAAG GCACTATTCT TGAAGGCGTC CTTAAGAACG CTGGCCCGGT AGGCGTAGGT ATTAATCTGG ATTATTACTT TTCGGCCGCC AATAACCAGG GATTTGGTAG TGGCTCAAAG GTAGCCCAAA ACGTAACCGG TCTATTTGGC GTCATGCAGG GTATTGACGA TGATTTACGC ACCTGGTGTT CCTATCAAAT GGTTGATGTT CATGAGCCCA TGCGCGTTCT AACCGTAGTG GAAGCAACCA CGGAAACCCT GACCGCTATC TACAAGCGCC AACCCTCTTC CCAAGAGCCC GCCGGAGGTA GCTGGTTGCT GCCACCTTTG CGCCAGCTCA TTGATGGCGG TTGGCTGCTG TTAGCAGCTA TCCACCCAAA AACCGGCAAA ATTTCGGTGT TCGATCCTAA GCAAGGATTT ATTCCCTGGA AAAGCTATCG GGAACCCTCA CCCTTGCCGG TGGTGGAGCG CTCCATGGAC TGGTATGACG GTTATAGCGA TCCTAGGCCA CCTGCGCTAG TTGAACCCAA ACAGACGGAG ACCCATCATG CTGCCTGA
|
Protein sequence | MSAHHAEQAE SPLDNREKLT TAVDKLELVL PCAAPLYDFM HLNTMQGYHH ISFAEAMAAH FELTGIRGYL PEEDFRKHYA RGRIDDADLN ESLANNTHGR NREIVLKVSG RPINKQDIWR ISLIQDINPL SPSRFRWQIK EYDALERFQN GVPKSARDTL LNATQETDQN RRTESQAIRD LWEACLRVFQ LENPNLHSEE LGELVDLEEF SRSQAEGKQS KFRTSQPTLI PQEEMLAEAR KDLHHLVDKV GEELSLRGLL QALTGEDLLD QVRPILIRFC ASHLDEGFTA WSLPERGQGL YAAWRKCPFA ELGLDLARLP DWQSFHAELP EHSVDAVIAC LERLKIPESR WEGYLKRIAV ELPGWSGLIN WRHHRPKYKP NRKAPTSLMD YLAIRLFLDV IHIEQVTQNT WGIAGNLEEL KTYFENYLWE FSGRYALFSN TLPEYLAIRA QELIALPRTA QKDRENWRTV SNMIHNWKHN PSAERTKRQT VHSHVWRLFC LAQHLGLPGN EVGKLSSSEA EQLLAILDEL TTSERGYIWL CAYEYHYRED YFAALTQNHG RGRWANRNER PEAQLIFCFD DREEGIRRHL EEVNPNLETL GAPGFFGVPI QWRGLDYPDT TPHCPVVVTP VNELHEEPRP EAKKRYGLHK RLYNFKQFLL RVLHNKTRRD LLTSKVLIDV LFPGMLAVLA GKVFFPFQQA SLKRKATAAL VPPVPTQLKL TVPDDGTEAT PDNLRVGFTD AEQAERLAAF LRAIGFTSGF APLVVLSGHG SMSENNPQLA SYDCGASGGR HGGPNARAFA AMANRPEIRA RLAEQGIHIP DDTWFIGTEH DTCSESFPWF DLDKVPANFA PALKKLKAEV DQALLLSAHE RCRRMASAPR KPSLQQARRH VAERGTDFSQ ARPELGHATV ASALIGRRSV TRGIFLDRRC FVLSYDPTID DAEGTILEGV LKNAGPVGVG INLDYYFSAA NNQGFGSGSK VAQNVTGLFG VMQGIDDDLR TWCSYQMVDV HEPMRVLTVV EATTETLTAI YKRQPSSQEP AGGSWLLPPL RQLIDGGWLL LAAIHPKTGK ISVFDPKQGF IPWKSYREPS PLPVVERSMD WYDGYSDPRP PALVEPKQTE THHAA
|
| |