Gene Tery_0727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0727 
Symbol 
ID4243175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1175476 
End bp1177374 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content40% 
IMG OID638106019 
Productsqualene-hopene cyclase 
Protein accessionYP_720632 
Protein GI113474571 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.548053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGAAA AAAACAAAGT AAAACAGTCT ATATTAGCTA GTCAAAAGCA CTTATTAAGT 
TTACAAGAAA CAGAAGGATA CTGGTGGGGT CAACTAGAAT CAAATGTCAC AATAACAGCA
GAAATAATTT TACTACATAA AATTTGGCAA ACTGATAAAA AAATACCATT GAATAAGGCA
AAAAATTACC TAATATCTCA ACAACGAGAA CATGGTGGTT GGGAACTATT TTATGGGGAT
GGAGGAGACT TAAGCACTTC TATTGAAGCT TATATGGCTT TGAGATTATT GGGTGTTTCA
AGAACAGATC CAATTATGAT TGAAGCACAA AATTTTATTA TTAAAAAAGG TGGTATTAGT
TGCAGCAGAA TTTTTACTAA ATTACATTTG GCTTTAATTG GATGCTACAG CTGGCAGGGT
ATTCCTTCTA TTCCTTCTAG CATAATGCTA CTTCCTGAAG ATTTTCCATT TACTATTTAT
GAAATGTCAA GTTGGGCAAG AAGTAGTACT GTTCCACTAT TAATTGTGTT TGATAAAAAG
CCGATTTTTT CTGTTAACCC CACTATTAAT CTTGATGAGC TTTATGCCGA AGGAATTAAC
AATGCTAGTT TTGAATTACC TCGTAAATAC GATCTGACAG ATTTATTTTT GGGACTAGAT
AAGGCTTTTA AATTTGCAGA AAATTTGAAT TTGATGCCTT TGCAACAGGA AGGATTAAAA
GCAGCAGAAA AATGGATTTT AGAAAGGCAA GAGGTTACGG GTGACTGGGG GGGTATTATT
CCGGCTATGT TAAATTCTAT GTTGGCATTA AAATGCTTGG AATATGATGT GGCTGACCCT
GTGGTGGTGC GGGGACTGGA GGCTATAGAT AGGTTTGCTA TAGAAAATGA GGATAGTTAT
CGGGTACAAG CCTGTGTATC TCCAGTGTGG GATACCGCTT GGGTAATACG TTCGTTGGTT
GATTCTGGTA TTTCTCCTAG TCATCCGGCA ATGGTTAAGG CTGGACAATG GTTGTTGCAA
CAGCAAATTT TGGATTATGG TGATTGGGTG TTTAAGAATA AATTTGGTAA ACCGGGGGGT
TGGGCGTTTG AATTTATGAA TCGTTTTTAT CCAGATATAG ATGATACGGC GGTAGTGGTT
ATGGCTTTGG ATGTTGTAGA GTTGCCTGAT GAGGATCTGA AAGGTAAGGC GATCGCTCGT
GGCATGGAGT GGATAGCTTC GATGCAATGT GAAGCTGGTG GTTGGGCAGC TTTTGATGTA
GATAATAATC AAGACTGGTT GAATGCTACT CCTTATGGAG ATTTAAAGGC GATGATCGAT
CCGAATACGG CAGATGTGAC GGGGCGGGTA TTGGAAATGG TTGGTTGTTG TGGGTTAGCT
ATGGATAGTT GGCGGGTGAA ACGGGGGATC GATTTTTTGG TAAGGGAGCA GGAAGAGGAA
GGTTGTTGGT TTGGTCGGTG GGGAGTTAAT TATATATACG GTACGAGTGG AGTTATTTTG
GCTTTGGCTG TTATGGCACG GGAAAGTCAC CGAGGTTATA TAGAGAGGGG GGCAAGTTGG
TTGGTTGGTT GTCAAAATTC GGATGGTGGA TGGGGTGAAA GTTGTTGGAG TTATAATGAT
CCGTCGTTGA AGGGGAAGGG GAAAAGTACG GCTTCTCAGA CTGCTTGGGC GTTAATTGGT
TTATTGGCTG CGGGGGAGGG GACTGGTAAT TTTGCTAGGG ATGCTATTGA TGGGGGTGTG
GGTTTTTTAG TTTCGACTCA GAATGATGAT GGGAGTTGGC TGGAGGATGA GTTTACTGGT
ACTGGGTTTC CTGGTCATTT TTATATTAAG TATCATTTTT ATTCACAGTA TTTTCCTTTG
ATGGCTTTGG GAAGGTATGA GAGTTTGTTA AGTGGTTGA
 
Protein sequence
MIEKNKVKQS ILASQKHLLS LQETEGYWWG QLESNVTITA EIILLHKIWQ TDKKIPLNKA 
KNYLISQQRE HGGWELFYGD GGDLSTSIEA YMALRLLGVS RTDPIMIEAQ NFIIKKGGIS
CSRIFTKLHL ALIGCYSWQG IPSIPSSIML LPEDFPFTIY EMSSWARSST VPLLIVFDKK
PIFSVNPTIN LDELYAEGIN NASFELPRKY DLTDLFLGLD KAFKFAENLN LMPLQQEGLK
AAEKWILERQ EVTGDWGGII PAMLNSMLAL KCLEYDVADP VVVRGLEAID RFAIENEDSY
RVQACVSPVW DTAWVIRSLV DSGISPSHPA MVKAGQWLLQ QQILDYGDWV FKNKFGKPGG
WAFEFMNRFY PDIDDTAVVV MALDVVELPD EDLKGKAIAR GMEWIASMQC EAGGWAAFDV
DNNQDWLNAT PYGDLKAMID PNTADVTGRV LEMVGCCGLA MDSWRVKRGI DFLVREQEEE
GCWFGRWGVN YIYGTSGVIL ALAVMARESH RGYIERGASW LVGCQNSDGG WGESCWSYND
PSLKGKGKST ASQTAWALIG LLAAGEGTGN FARDAIDGGV GFLVSTQNDD GSWLEDEFTG
TGFPGHFYIK YHFYSQYFPL MALGRYESLL SG