Gene Plim_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1904 
Symbol 
ID9138606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2481617 
End bp2483734 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content53% 
IMG OID 
Productsqualene-hopene cyclase 
Protein accessionYP_003629933 
Protein GI296122155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCAG GTACTTTTGG TGCGAAACGA GTCGACCTGC TGGCAGCATT CGAACATTCG 
GCACCTGCTG AAAAAACGCG AGAAACCTGT GTGGGATTAC AGACCGCGAT TGCGAGAACT
CGCCAGTATC TTCTCGATCA GCAACATTCC GAGGGATTCT TCGTTGCTGA GCTTGAAGGA
GATACGATCC TCGAATCCGA GTACATCCTC CTGCTGGCCT TTCTGAATGA AGGGCAATCG
CCTGATGCAC AGGCAGCAGC CAGGTATCTG CTGACAAAGC AGAATACAGA TGGCAGTTGG
AGCAACTTCC CAGGTGGCCC CATCGATGTC AGTTGTGCTG TCAAAGCTTA CCTGGCCTTG
CGAATTACAG GTCATGCAGC CGATGAACCC GCTTTAATCA GGGCTCGTGA AGCCATTCTG
CAGGCAGGTG GCGTCGAGCG TGTCAACAGT TTTACGAGAT TCTACCTCGC CATGCTGGGG
CTTATTCCTT ATTCGCTGTG CCCGGCTGTT CCGCCAGAAG TGGTTCTCTT GCCCGATTGG
TTCCCGATCA ATCTTTCACA AATGTCAGCA TGGTCGCGGA CGATTGTCGT CCCACTGAGC
TTGCTTTGGG CCTTTCAACC CGCAGTGGAA TTGAACGACG CGGACGGCCA TCAGATCACC
ATCGAGGAGT TGTATGCCTC TCCTGAGAAG CAGTTGCCCC GGTTTATTCG CGGTGTGAAT
CATGAGTCGA ACTCCAACGG CTGGATGAAC TGGAGTCGAT TTTTCTTTCG CGTCGATCAA
TGCCTGAAGT CCATCGAAAG TTATGGAATC AAACCTTTGC GGTCGCGTGC AGTCCGCAAA
TGTGTGCAGT GGATCCTGGA TCGGCAGGAG ATGAGCGATG GATTGGGAGC GATCTTTCCT
CCGATCGTCT GGACGTTGAT CGGGCTTAAG TGTGCCGGGT TTGACGATCA GCATCCAATG
GTTCAAAAAC AGCGGGACGA ATTGAATCGA CTGATGCTCC GCGAGCAGGA TGCGTTACGC
CTGCAGCCTT GCTTGTCTCC CGTATGGGAT ACTGCAATTT CGATCATTGC CTTGAGGGAG
TCGGGAGTCG AGCCAGATCA TCCTGCACTA TCTAAAGCCC GGAACTGGCT GCTGAGTAAA
GAAGTCCGCC ATGCCGGTGA CTGGTCGAAA GCGCATCCCG AGACCCCTGT CTCCGGATGG
TATTTCGAGT TCAATAATGA GTTTTATCCC GATGTTGATG ATACCGCGAT GGTGCTGATT
GCTCTGGCTT CCACATTACC AGAAGAGGCG ACGCCGTTAG CGATTTCCCA TGGAGTTCTT
CCTGTCCAGA CCGGTTGGAG TGCAGAAAGT ACCTCTCGCG TACAGGCACT CAAGCAACTG
GAAAATCATC GGCCAGTCTT AGAAGCCATG GGGCGCGGTG TGCAGTGGCT TAAGGCACTT
CAGTCCAAAG ATGGTGGATG GGGAGCTTTC GATTCGGATA TCAACAAGGA ACTACTGACA
AAAGTTCCTT TCGCTGACCA TAACGCCATG CTTGATGAGA CGAATGCCGA TATTTCGGCT
CGCGTCCTCG AAGCGTATGC AGCCGTGGGG ATCAGTTTCA ATGATCCATC TGTGCAAAGA
GCGCTGGAGT TCATCTGGAA TGATCAGGAG GACGATCATG CCTGGTATGG TCGCTGGGGC
GTTAACTACA TCTACGGCAC ATGGCAAGTT CTGGTGGGAC TGACTGCCAT TGGTATTTCC
GCCCATGATC CTCGCTTAGT ACGTGCGGCG GGTTGGCTCA AGAGCAAGCA GCAGGCCTGT
GGTGGCTGGG GTGAAACTCC CGCCACTTAT GATAATCCGA CCCTCAGGGG ACAAGGCACA
CCGACTGCCT CACAAACGGC ATGGGCTGTA CTGGGTTTGA TTGCAGCCGG TGAGCAGAAC
TCGATTGAAT GCCAGCGGGG CGTGGAATTC CTGCTGAAAA CTCAGAAACA TAATGGCACA
TGGGACGAAG AAGAATTTAC GGGGACAGGC TTTCCCAGGG TCTTCTACCT GCGATACCAC
TATTACCCCC TCTACTTCCC ACTCATGGCA CTGGGGCGTT TTGCCAGAGC TGGTGGAAGA
GTAAATTTTG CAGGATGA
 
Protein sequence
MTSGTFGAKR VDLLAAFEHS APAEKTRETC VGLQTAIART RQYLLDQQHS EGFFVAELEG 
DTILESEYIL LLAFLNEGQS PDAQAAARYL LTKQNTDGSW SNFPGGPIDV SCAVKAYLAL
RITGHAADEP ALIRAREAIL QAGGVERVNS FTRFYLAMLG LIPYSLCPAV PPEVVLLPDW
FPINLSQMSA WSRTIVVPLS LLWAFQPAVE LNDADGHQIT IEELYASPEK QLPRFIRGVN
HESNSNGWMN WSRFFFRVDQ CLKSIESYGI KPLRSRAVRK CVQWILDRQE MSDGLGAIFP
PIVWTLIGLK CAGFDDQHPM VQKQRDELNR LMLREQDALR LQPCLSPVWD TAISIIALRE
SGVEPDHPAL SKARNWLLSK EVRHAGDWSK AHPETPVSGW YFEFNNEFYP DVDDTAMVLI
ALASTLPEEA TPLAISHGVL PVQTGWSAES TSRVQALKQL ENHRPVLEAM GRGVQWLKAL
QSKDGGWGAF DSDINKELLT KVPFADHNAM LDETNADISA RVLEAYAAVG ISFNDPSVQR
ALEFIWNDQE DDHAWYGRWG VNYIYGTWQV LVGLTAIGIS AHDPRLVRAA GWLKSKQQAC
GGWGETPATY DNPTLRGQGT PTASQTAWAV LGLIAAGEQN SIECQRGVEF LLKTQKHNGT
WDEEEFTGTG FPRVFYLRYH YYPLYFPLMA LGRFARAGGR VNFAG