Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0727 |
Symbol | |
ID | 4243175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1175476 |
End bp | 1177374 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638106019 |
Product | squalene-hopene cyclase |
Protein accession | YP_720632 |
Protein GI | 113474571 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.548053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.146033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGAAA AAAACAAAGT AAAACAGTCT ATATTAGCTA GTCAAAAGCA CTTATTAAGT TTACAAGAAA CAGAAGGATA CTGGTGGGGT CAACTAGAAT CAAATGTCAC AATAACAGCA GAAATAATTT TACTACATAA AATTTGGCAA ACTGATAAAA AAATACCATT GAATAAGGCA AAAAATTACC TAATATCTCA ACAACGAGAA CATGGTGGTT GGGAACTATT TTATGGGGAT GGAGGAGACT TAAGCACTTC TATTGAAGCT TATATGGCTT TGAGATTATT GGGTGTTTCA AGAACAGATC CAATTATGAT TGAAGCACAA AATTTTATTA TTAAAAAAGG TGGTATTAGT TGCAGCAGAA TTTTTACTAA ATTACATTTG GCTTTAATTG GATGCTACAG CTGGCAGGGT ATTCCTTCTA TTCCTTCTAG CATAATGCTA CTTCCTGAAG ATTTTCCATT TACTATTTAT GAAATGTCAA GTTGGGCAAG AAGTAGTACT GTTCCACTAT TAATTGTGTT TGATAAAAAG CCGATTTTTT CTGTTAACCC CACTATTAAT CTTGATGAGC TTTATGCCGA AGGAATTAAC AATGCTAGTT TTGAATTACC TCGTAAATAC GATCTGACAG ATTTATTTTT GGGACTAGAT AAGGCTTTTA AATTTGCAGA AAATTTGAAT TTGATGCCTT TGCAACAGGA AGGATTAAAA GCAGCAGAAA AATGGATTTT AGAAAGGCAA GAGGTTACGG GTGACTGGGG GGGTATTATT CCGGCTATGT TAAATTCTAT GTTGGCATTA AAATGCTTGG AATATGATGT GGCTGACCCT GTGGTGGTGC GGGGACTGGA GGCTATAGAT AGGTTTGCTA TAGAAAATGA GGATAGTTAT CGGGTACAAG CCTGTGTATC TCCAGTGTGG GATACCGCTT GGGTAATACG TTCGTTGGTT GATTCTGGTA TTTCTCCTAG TCATCCGGCA ATGGTTAAGG CTGGACAATG GTTGTTGCAA CAGCAAATTT TGGATTATGG TGATTGGGTG TTTAAGAATA AATTTGGTAA ACCGGGGGGT TGGGCGTTTG AATTTATGAA TCGTTTTTAT CCAGATATAG ATGATACGGC GGTAGTGGTT ATGGCTTTGG ATGTTGTAGA GTTGCCTGAT GAGGATCTGA AAGGTAAGGC GATCGCTCGT GGCATGGAGT GGATAGCTTC GATGCAATGT GAAGCTGGTG GTTGGGCAGC TTTTGATGTA GATAATAATC AAGACTGGTT GAATGCTACT CCTTATGGAG ATTTAAAGGC GATGATCGAT CCGAATACGG CAGATGTGAC GGGGCGGGTA TTGGAAATGG TTGGTTGTTG TGGGTTAGCT ATGGATAGTT GGCGGGTGAA ACGGGGGATC GATTTTTTGG TAAGGGAGCA GGAAGAGGAA GGTTGTTGGT TTGGTCGGTG GGGAGTTAAT TATATATACG GTACGAGTGG AGTTATTTTG GCTTTGGCTG TTATGGCACG GGAAAGTCAC CGAGGTTATA TAGAGAGGGG GGCAAGTTGG TTGGTTGGTT GTCAAAATTC GGATGGTGGA TGGGGTGAAA GTTGTTGGAG TTATAATGAT CCGTCGTTGA AGGGGAAGGG GAAAAGTACG GCTTCTCAGA CTGCTTGGGC GTTAATTGGT TTATTGGCTG CGGGGGAGGG GACTGGTAAT TTTGCTAGGG ATGCTATTGA TGGGGGTGTG GGTTTTTTAG TTTCGACTCA GAATGATGAT GGGAGTTGGC TGGAGGATGA GTTTACTGGT ACTGGGTTTC CTGGTCATTT TTATATTAAG TATCATTTTT ATTCACAGTA TTTTCCTTTG ATGGCTTTGG GAAGGTATGA GAGTTTGTTA AGTGGTTGA
|
Protein sequence | MIEKNKVKQS ILASQKHLLS LQETEGYWWG QLESNVTITA EIILLHKIWQ TDKKIPLNKA KNYLISQQRE HGGWELFYGD GGDLSTSIEA YMALRLLGVS RTDPIMIEAQ NFIIKKGGIS CSRIFTKLHL ALIGCYSWQG IPSIPSSIML LPEDFPFTIY EMSSWARSST VPLLIVFDKK PIFSVNPTIN LDELYAEGIN NASFELPRKY DLTDLFLGLD KAFKFAENLN LMPLQQEGLK AAEKWILERQ EVTGDWGGII PAMLNSMLAL KCLEYDVADP VVVRGLEAID RFAIENEDSY RVQACVSPVW DTAWVIRSLV DSGISPSHPA MVKAGQWLLQ QQILDYGDWV FKNKFGKPGG WAFEFMNRFY PDIDDTAVVV MALDVVELPD EDLKGKAIAR GMEWIASMQC EAGGWAAFDV DNNQDWLNAT PYGDLKAMID PNTADVTGRV LEMVGCCGLA MDSWRVKRGI DFLVREQEEE GCWFGRWGVN YIYGTSGVIL ALAVMARESH RGYIERGASW LVGCQNSDGG WGESCWSYND PSLKGKGKST ASQTAWALIG LLAAGEGTGN FARDAIDGGV GFLVSTQNDD GSWLEDEFTG TGFPGHFYIK YHFYSQYFPL MALGRYESLL SG
|
| |