Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1321 |
Symbol | |
ID | 3706258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1470781 |
End bp | 1471881 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637737820 |
Product | squalene/phytoene synthase |
Protein accession | YP_343349 |
Protein GI | 77164824 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGCT CTTATCATAC TAGTTACGAT TCAGCTGATA AAAACTACCA GGATTATATT CTTCCTGGCG TCTCCCGTAC CTTTGCCCTC ACTATTCCCC AACTCCCTCC CCCCTTGCAA GAGGTGGTAG CCAATGGCTA TCTCCTGTGC CGCATAGCCG ATACCATTGA GGATGAACCC ACCCTCAGCA TCGCCCAGAA AAAATACTTT TCCAACCTTT TCGTTACCGT AGTCGCGGGG CAAACCTCGG CTGAATCTTT TGCCTGCTCC CTATACCCTC TCCTATCGGA GCATACCCTA GCTGCCGAAC GGGAACTTAT TCAAAATGCG CCACGGATTC TCCGGATCAC TTACAGCTTC AACCCCCGCC AACGAGCTGC CTTGGAACGC TGTGTATGCA TTATGTGCGA CGGTATGCCT CGCTTTCAGA ATACCGCCAG CTTACGAGGC CTAGCCGATA TGGAGGCCAT GGACCAATAT TGCTACTTTG TCGCCGGCGT CGTCGGCGAA ATGCTGACGG AACTTTTCTG TGATTACTCT CCTGGGATCA ACCGCAACCG TGAAGCTCTA CGCAATCTCA TGGTCTCTTT TGGCCAGGGT TTGCAGATGA CCAACATCCT CAAGGACATC TGGGACGATA GAAAAAGGCG GATTTGCTGG CTGCCGCGCA CCGTCTTCGA ACAAGCAGGC TTTAATTTGG ATAACCTTGA GCCCGGCCAC TATCAATCTG CCTTTGGCGA TGGTCTCCAA CACCTCATTG GCGTCACCCA TGCCCATCTC CGCAACGCCT TAACCTATAC CCTACTCATT CCCCCGGAAG AAGGAGGTAT CCGGCGTTTT TGTCTATGGG CTATTGGCCT TGCCATGCTG ACATTGCGCA AACTTCATCG GCGTCGGAAT TTTTCGGCTA GCTGGCAAGT CAAGATCTCC CGCCGTAGCG TAAAAACAAC AATACTCTTA ACAAGTATTG CAGCAAATCA CGATAAGGTA TTAACATTCC TGTTTAATCT TGCGTCAAAA GGTGTACCTT TTATACCACT AAAGGTAAAT AGCAACGACA GTAAACAAAC ATCTATGCCG CAAGCGTCGC CAGAGAAATA G
|
Protein sequence | MMGSYHTSYD SADKNYQDYI LPGVSRTFAL TIPQLPPPLQ EVVANGYLLC RIADTIEDEP TLSIAQKKYF SNLFVTVVAG QTSAESFACS LYPLLSEHTL AAERELIQNA PRILRITYSF NPRQRAALER CVCIMCDGMP RFQNTASLRG LADMEAMDQY CYFVAGVVGE MLTELFCDYS PGINRNREAL RNLMVSFGQG LQMTNILKDI WDDRKRRICW LPRTVFEQAG FNLDNLEPGH YQSAFGDGLQ HLIGVTHAHL RNALTYTLLI PPEEGGIRRF CLWAIGLAML TLRKLHRRRN FSASWQVKIS RRSVKTTILL TSIAANHDKV LTFLFNLASK GVPFIPLKVN SNDSKQTSMP QASPEK
|
| |