Gene Plim_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0903 
Symbol 
ID9137588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1157184 
End bp1159277 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content56% 
IMG OID 
ProductSqualene cyclase-like protein 
Protein accessionYP_003628946 
Protein GI296121168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGATC TCACCCAAAA ACTTCAGCAA GCCTTACAGC TCGCCTCCCG AGCACTGCTG 
AATGAGCGTG TGCGACCTGG CCTGGCCCAT TGGGAAGGCG AACTCTCGAC CTCGGCATTA
TCCACAGCCA CAGCTGTAAT GGCCTTGTTT CAGTATGCCA AATGCCAACA GGCCAGTGGG
CGTCTGCAAA AGGTGTTCGA TGGGAAGTCT GAAGGCGACT GGCGACTGAT CGAACAAGGT
CTGGCGTGGC TCCTTCAGCA TCAGCTGGCC GACGGCGGCT GGGGCGATAC TGACAAAAGT
ATCTCGAACA TCAGTACCAC CATGCTCGCC CATGCCACGC TGGTCGCTTG CCGGGAAGCT
GTCAGACAAA AAAGTCTGGT GCTGAACGCC AGCGATATCG ACGCCGCCAT TGAGCGGAGT
GGCCGATTGA TCGAAGAGTT GGGGGGCATT CAGGCGATTC GTGATCGATA CGGGAAAGAT
CACACGTTTT CAGTCCCTAT CCTGACCCAT GCGGCACTGG CTGGGCTGGT GTCATGGAAT
GAGATCCCCG CACTCCCTTA TGAACTGGCA CTGTTGCCGC ATCGCTTTTT CGAGGTCATT
CAGCTCCCGG TCGTTTCGTA TGCCTTACCC GCTCTGATTG CGATCGGGCA GACGCTGCAT
TTGAGGCAGC GCACGTGGAA CCCGTGGTGG TGGGTGCGGC GAGCCGCCAT TCCGGGCACA
TTGCAGAAAT TGCAGAGTAT TCAGCCGGAA AGTGGAGGTT TTCTCGAAGC GACTCCACTG
ACGAGTTTCG TCACGATGTG CCTAGCGAGT GTCGGACGTG TCGATCATCC CGTCACACAG
GCAGGGCTTA AGTTTATTCG TGATTCCGTC CGGCCCGATG GAAGCTGGCC CATCGATACC
AATCTGGCCA CGTGGGTCAC CACACTTTCG ATCAATCATC TGGGGGCTGA GGCCTTTTCG
TCGGACGAGC GTGAAGCTCT GATGCGCTGG TTGTTACAGC AGCAGTATCG AACTATGCAC
CCCTATACGA ATGCTGCTCC CGGCGGATGG GCCTGGACGA ATCTTTCGGG GGGTGTTCCC
GATGCTGACG ATACCCCCGG AGCCATGCTG GCTCTGATGG AACTCGACCG GGTTTCTGTT
TCCTCGCAAG AGAGTCTTTC GATTGAACAG GCCCTCTATC AGGCTGCGCT GTGGCTGATC
AAGCTGCAGA ATCGCGATGG TGGCTGGCCG ACTTTCTGCC GAGGTTGGGG AGCACTCCCT
TTTGACCGCA GTTCGAATGA TATCACTGCC CATTGCCTGC GGGCGCTGAT TCAATATGAA
CGCAGGCTCA ATGACGTCAC TGTTGATGCC ACTGGCGATA CCACTTCCAG ACCACTCGCA
GTCGAAGTGC CATCCCCAAA GTTGCGAGAG CAGATGCAGC GAAGCATTCA GCAAGGCTTT
GAGTATCTCG AAAAGACGCA GCGGGAAGAT GGGTCTTGGC TGCCATTATG GTTTGGCAAT
CAGCACAGCC CCGATGATGA AAACCCCTTG TACGGCACGG CACGAGTGCT CCTGGCCTAT
GCTGATGCCG GTCTTGAAGG AAGCTCGGCA GCTTTACGCG GCTGTGACTG GCTGGTGAGA
CATCAACATG CGGATGGTGC CTGGGGGCCC GGTACATCCA TAGAAACTGC TGATACCTCC
GATGCCGAGT CCGATGTCGA AGGAGAACCC GCGAGTATCG AAGAGACGGC TCTCGCCTTG
ATGGCTCTTT GTCGCTTCGA CGCTACTCAT AACGTCCTGC ATCGCGGCGC TTCGTGGCTC
ATTACAAAGG TTGAAAACGA GACCTGGCGC GAACCGACGC CGATTGGTTT TTACTTTGCA
AAACTCTGGT ACTACGAAAA ACTCTATCCG CAGGTCTTTA CGGTCGGTGC ACTCAAAGCT
CTGGCACTGC GACTGGGTTC AGCGTTGACG ACAGTCTCGG AGAATGAACC TGCTCCCAGT
TCTGCGGAAC CACCGATTCC GCCCATTGCG ACTGATCGAG TGGCTGACTC AATGCATCTT
CAGCGAACAT CACCTTCGAT CAATCTAGCG AATGGGGGCA TCACCCTGGC TTGA
 
Protein sequence
MEDLTQKLQQ ALQLASRALL NERVRPGLAH WEGELSTSAL STATAVMALF QYAKCQQASG 
RLQKVFDGKS EGDWRLIEQG LAWLLQHQLA DGGWGDTDKS ISNISTTMLA HATLVACREA
VRQKSLVLNA SDIDAAIERS GRLIEELGGI QAIRDRYGKD HTFSVPILTH AALAGLVSWN
EIPALPYELA LLPHRFFEVI QLPVVSYALP ALIAIGQTLH LRQRTWNPWW WVRRAAIPGT
LQKLQSIQPE SGGFLEATPL TSFVTMCLAS VGRVDHPVTQ AGLKFIRDSV RPDGSWPIDT
NLATWVTTLS INHLGAEAFS SDEREALMRW LLQQQYRTMH PYTNAAPGGW AWTNLSGGVP
DADDTPGAML ALMELDRVSV SSQESLSIEQ ALYQAALWLI KLQNRDGGWP TFCRGWGALP
FDRSSNDITA HCLRALIQYE RRLNDVTVDA TGDTTSRPLA VEVPSPKLRE QMQRSIQQGF
EYLEKTQRED GSWLPLWFGN QHSPDDENPL YGTARVLLAY ADAGLEGSSA ALRGCDWLVR
HQHADGAWGP GTSIETADTS DAESDVEGEP ASIEETALAL MALCRFDATH NVLHRGASWL
ITKVENETWR EPTPIGFYFA KLWYYEKLYP QVFTVGALKA LALRLGSALT TVSENEPAPS
SAEPPIPPIA TDRVADSMHL QRTSPSINLA NGGITLA