Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2556 |
Symbol | |
ID | 3786282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2924736 |
End bp | 2926697 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637812647 |
Product | squalene cyclase |
Protein accession | YP_413237 |
Protein GI | 82703671 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.511022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TCGGAGGTAT GGCGAGAACT TCATTGCAAG CTCAATCCCC GGGCTCAAAC AATACTCCCT CAATGGACGA AAAAATGCTG AAAGCGGGAC TGGAAGCGGC ACGCGGTGCG CTGCTGGCTC AGCAGAGGGA GGATGGCCAC TGGTGCTTTC CATTGGAGGC CGATTGCACG ATTCCCGCCG AGTATATACT GATGATGCAC TTCATGGATG AAGTGGACCT GGATCTGGAG GTCAGGATTG CCCGTTTTAT TCGTGAAAAA CAGGATGTAG CGCACGGAGG CTGGCCGCTC TACTACGGCG GTGAATTCGA TCTGAGTTGT TCGGTCAAGG CCTATTATGC TCTGAAAATT GTGGGGGACT CCCCGGACGC GCCCCACATG GTGCGTGCGC GTGCAGCAAT TCTGAAACAT GGCGGCGCAG CGAGAGCAAA TGTATTTACA CGCCTGCTGC TTGCCATGTA TGACCAGCTT CCCTGGCGAG GCGTGCCATT TGTGCCGGTG GAAATCATAC TGTTCCCCAA GTGGTTTCCA TTCCATACCA GCAAGGTCGC ATACTGGTCG CGGACCGTAA TGGTTCCGCT ATCTATCCTG TGCAGCCTGA AGGCGCGCGC GGCGAATCCG CGCAAAGTCG CCATTCGTGA ATTGTTTACG GTACCTCCGG GGGAGGAGCG GAATTATTTT CCGGTACGTA CTGCGCTCAA TCGCGTGTTT CTGCTGATCG AGCGCACTCT CTCCCTGCTC GAACCCTTCA TACCCCAAGG GGTGCGCAGA TTGGCGCTGC GGCGGGCGGA AAGCTGGATC GTGGAAAGGC TGAACGGCGA CTCAGGCCTG GGGGCGATCT TTCCTGCCAT GGTGAATGCC GGCGAAGCTC TGGCATTGCT TGGATACCCA TATGATCATC CCGCGCGGGA GCAATGCCGC AAGGCGCTGC GCCTGCTGCT GGTGGAGGAA GGCGAGCGTA CCTGGTGCCA GCCCTGTGTT TCCCCAGTAT GGGATACCGT CCTGACCTGT CTCGCCTTTC AGGAGGATAC GGAGGTCGAT CAGAAGCCCA TCCGGAAAGC GCTCGACTGG CTGGTTCCCT GTCAGGTGCT CGATGCGCCG GCGGACTGGC AAGAAGATCA TCCCGGATTA CCGGGCGGGG GCTGGGCTTT TCAGTATGCA AATCCCCACT ATCCCGATCT CGACGACACG GCTGCGGTGG CGTGGGCACT GTACCAGGCC GACCCTAAGG CTTATCAGGA AAGCATCAGC CGGGCCGCCG ACTGGCTGGC GGGGATGCAG TCCAGCAATG GGGGGTTTGC GGCTTTCGAC AGCGACAATA CGTATTACTA TCTCAACGAG ATCCCGTTTG CCGATCATGG GGCGCTCCTT GATCCTCCCA CCAGTGATGT TTCCGCCCGC TGTGCAGGTT TTCTCGCCTT GTACGGCCAG TCCAGGCATA AACAGGCGCT GGAACGGAGT CTGGCATACC TCTTCAATGA ACAGGAGGCC AGTGGCGCCT GGTTCGGCCG CTGGGGCAGC AATTACATCT ATGGAACGTG GTCTGTGCTG GAGGCCTTCC GTCTGGCCGG GATAGATGCC GGTCATCCTG CCATCCGACG CGCGGTGCAC TGGCTCAAAT CCGTGCAGCG GGAGGATGGC GGATGGGGAG AAAGTAACGA CAGCTATCTT TCCCCCCAGC AGGCAGGCCA GTTTCATACC AGTACTTCTT TTCATACCGC ATGGGCACTG CTTGCACTGA TGGGAGCCGG CGAATGGCGA AGTCATGAGG TTCACCGGGG AATTGCCTAC CTCTTGCGGG AACAGGACAG CGACGGGCTC TGGCATGAGC CCTGGTTTAC CGCTCCCGGC TTCCCGCGGG TTTTCTACCT CAAGTACTAC GGGTATACGA AATATTTCCC GGTATGGGCA TTGACCCGAT TTCATGCATT GAACCGGAAG TTCCCCGGGT GA
|
Protein sequence | MKKFGGMART SLQAQSPGSN NTPSMDEKML KAGLEAARGA LLAQQREDGH WCFPLEADCT IPAEYILMMH FMDEVDLDLE VRIARFIREK QDVAHGGWPL YYGGEFDLSC SVKAYYALKI VGDSPDAPHM VRARAAILKH GGAARANVFT RLLLAMYDQL PWRGVPFVPV EIILFPKWFP FHTSKVAYWS RTVMVPLSIL CSLKARAANP RKVAIRELFT VPPGEERNYF PVRTALNRVF LLIERTLSLL EPFIPQGVRR LALRRAESWI VERLNGDSGL GAIFPAMVNA GEALALLGYP YDHPAREQCR KALRLLLVEE GERTWCQPCV SPVWDTVLTC LAFQEDTEVD QKPIRKALDW LVPCQVLDAP ADWQEDHPGL PGGGWAFQYA NPHYPDLDDT AAVAWALYQA DPKAYQESIS RAADWLAGMQ SSNGGFAAFD SDNTYYYLNE IPFADHGALL DPPTSDVSAR CAGFLALYGQ SRHKQALERS LAYLFNEQEA SGAWFGRWGS NYIYGTWSVL EAFRLAGIDA GHPAIRRAVH WLKSVQREDG GWGESNDSYL SPQQAGQFHT STSFHTAWAL LALMGAGEWR SHEVHRGIAY LLREQDSDGL WHEPWFTAPG FPRVFYLKYY GYTKYFPVWA LTRFHALNRK FPG
|
| |