Gene Noc_1321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1321 
Symbol 
ID3706258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1470781 
End bp1471881 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID637737820 
Productsqualene/phytoene synthase 
Protein accessionYP_343349 
Protein GI77164824 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGCT CTTATCATAC TAGTTACGAT TCAGCTGATA AAAACTACCA GGATTATATT 
CTTCCTGGCG TCTCCCGTAC CTTTGCCCTC ACTATTCCCC AACTCCCTCC CCCCTTGCAA
GAGGTGGTAG CCAATGGCTA TCTCCTGTGC CGCATAGCCG ATACCATTGA GGATGAACCC
ACCCTCAGCA TCGCCCAGAA AAAATACTTT TCCAACCTTT TCGTTACCGT AGTCGCGGGG
CAAACCTCGG CTGAATCTTT TGCCTGCTCC CTATACCCTC TCCTATCGGA GCATACCCTA
GCTGCCGAAC GGGAACTTAT TCAAAATGCG CCACGGATTC TCCGGATCAC TTACAGCTTC
AACCCCCGCC AACGAGCTGC CTTGGAACGC TGTGTATGCA TTATGTGCGA CGGTATGCCT
CGCTTTCAGA ATACCGCCAG CTTACGAGGC CTAGCCGATA TGGAGGCCAT GGACCAATAT
TGCTACTTTG TCGCCGGCGT CGTCGGCGAA ATGCTGACGG AACTTTTCTG TGATTACTCT
CCTGGGATCA ACCGCAACCG TGAAGCTCTA CGCAATCTCA TGGTCTCTTT TGGCCAGGGT
TTGCAGATGA CCAACATCCT CAAGGACATC TGGGACGATA GAAAAAGGCG GATTTGCTGG
CTGCCGCGCA CCGTCTTCGA ACAAGCAGGC TTTAATTTGG ATAACCTTGA GCCCGGCCAC
TATCAATCTG CCTTTGGCGA TGGTCTCCAA CACCTCATTG GCGTCACCCA TGCCCATCTC
CGCAACGCCT TAACCTATAC CCTACTCATT CCCCCGGAAG AAGGAGGTAT CCGGCGTTTT
TGTCTATGGG CTATTGGCCT TGCCATGCTG ACATTGCGCA AACTTCATCG GCGTCGGAAT
TTTTCGGCTA GCTGGCAAGT CAAGATCTCC CGCCGTAGCG TAAAAACAAC AATACTCTTA
ACAAGTATTG CAGCAAATCA CGATAAGGTA TTAACATTCC TGTTTAATCT TGCGTCAAAA
GGTGTACCTT TTATACCACT AAAGGTAAAT AGCAACGACA GTAAACAAAC ATCTATGCCG
CAAGCGTCGC CAGAGAAATA G
 
Protein sequence
MMGSYHTSYD SADKNYQDYI LPGVSRTFAL TIPQLPPPLQ EVVANGYLLC RIADTIEDEP 
TLSIAQKKYF SNLFVTVVAG QTSAESFACS LYPLLSEHTL AAERELIQNA PRILRITYSF
NPRQRAALER CVCIMCDGMP RFQNTASLRG LADMEAMDQY CYFVAGVVGE MLTELFCDYS
PGINRNREAL RNLMVSFGQG LQMTNILKDI WDDRKRRICW LPRTVFEQAG FNLDNLEPGH
YQSAFGDGLQ HLIGVTHAHL RNALTYTLLI PPEEGGIRRF CLWAIGLAML TLRKLHRRRN
FSASWQVKIS RRSVKTTILL TSIAANHDKV LTFLFNLASK GVPFIPLKVN SNDSKQTSMP
QASPEK