Gene Glov_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_2233 
Symbol 
ID6367430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp2382488 
End bp2384542 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content56% 
IMG OID642677648 
Productsqualene-hopene cyclase 
Protein accessionYP_001952469 
Protein GI189425292 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000414929 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCAA GAAAATACCC GATTTCCCAC GCTCTGACAT CGTTTAATCA TACCACGGTA 
GCACCAGTGG AAGCTCCGGC CCCTATCAGT GTCAAGAGCC CCGCAAAAGT CCACCGTCTG
CCCTCTTCGA TCTGGAAGAA GATGGAGGGC AGCGCAGGCA ATCCTTTGGA TAAGGCCGTT
GAACTGACCC GCGACTTCTT TTTCCGGGAG CAATTGCCGG ATGGTTACTG GTGGGCAGAG
CTTGAATCAA ACGTCACCAT TACGGCAGAG TACATCATGC TGTTCCACTT CCTTGGCATG
GTTGACAAGG ACAAAGAGCG CAAGATGGCC AACTATCTGC TGCGCCAGCA GACTGAAGAA
GGCTACTGGA CGGTCTGGCA CAACGGACCG GGTGACCTTT CCACCACCAT TGAAGCCTAC
TTCGCCCTGA AGCTGGCCGG CTATCATGCT GATCACATTG CCCTGCGTAA AGCCCGCGAC
TTTATCCTGG CCAATGGCGG CATCCTGAAG TCACGGGTTT TCACCAAAAC GTTTCTGGCC
ATGTTCGGTG AGTTCTCCTG GCTGGGGGTT CCCTCCATGC CGATCGAGCT GATGCTGTTG
CCTGACTGGG CCTACCTGAA TGTCTATGAA TTCTCCAGCT GGGCCCGTGC CACCATCATC
CCGATGTCTG TGTTAATGGC CAATCGTCCG GTCTATAAAC TACCGCCCCA TGCACGGGTG
CAGGAGCTGT ACGTGCGTCC GCCCCGGCCC ACCGATTACA CCTTTACCAA GGAAGACGGC
ATCTTCTCCC TGAAGAACTT CTTTATCGGT GTAGATCACC TGCTCAAGAT CTACGAATCA
AGCCCGATCC GCCCTTTCAA GAAGCGGGCA ACCGAAAAGG TGGAACAATG GATACTTGAA
CACCAGGAGA AAACCGGTGA CTGGGGCGGC ATCCAGCCCG CCATGCTGAA CGCCATCCTG
GCGCTGCACT GCCTGGGCTA TGCCAATGAC CATCCGGCCG TGGCAAAAGG CTTGGAGGCC
CTGGCCAATT TTACGATTGA AGACAGTGAC TCCTTGGTGC TGCAATCATG CATCTCACCG
GTCTGGGATA CGGCACTGGT GCTGCAAGCC ATGCAGGAGG CCAGTGTTCC TTTGGATCAC
CCGTCTCTGA TCAAGGCGTC CCAGTGGCTG CTCGATCGTG AGGTACGGAT CAAGGGCGAC
TGGAAGATTA AATCACCTGA TCTGGAACCT GGCGGCTGGG CCTTTGAATT TCAGAACGAC
TGGTACCCCG ACGTGGACGA CTCGACAGCC GTGATGATCG CTATCAAGGA TATCAAAGTC
AAGAACACCA AAGCCCGTCA GGATGCCATC CGGCGCGGCA TTGACTGGTG TCTGGGGATG
CAGAGCGAAA ACGGCGGCTG GGCAGCCTTT GACAAAGACA ACACCAAGCA TATGCTGAAC
AAGATCCCGT TTGCTGATCT GGAGGCATTG ATCGACCCCC CTACTGCTGA TCTGACCGGG
AGGATGCTGG AGCTGATGGG CAATTTTGGC TACACCAAAG ACCATCCCCA GGCAGTCAGC
GCCCTGGAGT TTCTAAAAAA CGAACAGGAG CCGGAGGGCC CCTGGTTCGG CAGATGGGGC
GTCAACTACA TTTACGGCAC GTGGTATGTG TTGATCGGGC TGGAGGCGAT CGGCGAGGAC
ATGAACAGCC CCTATATCAA GAAGTCGGTC AACTGGATCA AGTCCCGGCA GAACCTGGAT
GGAGGTTGGG GTGAAGTCTG TGATTCCTAC TGGGACCGCA CCCTGATGGG CTGCGGCCCC
AGCACCGCCT CCCAGACATC CTGGGCCCTG ATGGCCCTGA TGGCCGCCGG TGAGGTCGGC
TGCCAGGCGG TTGAGCGGGG GATTCAGTAT CTACTGGCCA CCCAGAACAG TGACGGCACC
TGGGATGAAG AGGCATTTAC CGGCACCGGT TTCCCCAAGT ATTTCATGAT CAAGTACCAT
ATTTATCGCA ACTGCTTCCC GCTGACCGCC TTGGGCAGAT ATCGCCGGCT GACGGCGGGG
ACGCACGCAC AGTAA
 
Protein sequence
MKSRKYPISH ALTSFNHTTV APVEAPAPIS VKSPAKVHRL PSSIWKKMEG SAGNPLDKAV 
ELTRDFFFRE QLPDGYWWAE LESNVTITAE YIMLFHFLGM VDKDKERKMA NYLLRQQTEE
GYWTVWHNGP GDLSTTIEAY FALKLAGYHA DHIALRKARD FILANGGILK SRVFTKTFLA
MFGEFSWLGV PSMPIELMLL PDWAYLNVYE FSSWARATII PMSVLMANRP VYKLPPHARV
QELYVRPPRP TDYTFTKEDG IFSLKNFFIG VDHLLKIYES SPIRPFKKRA TEKVEQWILE
HQEKTGDWGG IQPAMLNAIL ALHCLGYAND HPAVAKGLEA LANFTIEDSD SLVLQSCISP
VWDTALVLQA MQEASVPLDH PSLIKASQWL LDREVRIKGD WKIKSPDLEP GGWAFEFQND
WYPDVDDSTA VMIAIKDIKV KNTKARQDAI RRGIDWCLGM QSENGGWAAF DKDNTKHMLN
KIPFADLEAL IDPPTADLTG RMLELMGNFG YTKDHPQAVS ALEFLKNEQE PEGPWFGRWG
VNYIYGTWYV LIGLEAIGED MNSPYIKKSV NWIKSRQNLD GGWGEVCDSY WDRTLMGCGP
STASQTSWAL MALMAAGEVG CQAVERGIQY LLATQNSDGT WDEEAFTGTG FPKYFMIKYH
IYRNCFPLTA LGRYRRLTAG THAQ