Gene Plim_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2089 
Symbol 
ID9138792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2710862 
End bp2712208 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003630115 
Protein GI296122337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.488843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAT TTTTGAATCG TCGTGATTTC AACCAGTACA CTGCACTGGG TGGGGCAGCA 
GCTTTATCGG CAGGGTTACC CTTTGGATTG TCAGCAGGAA TACAAGGGGT TGGATCTGCG
TCTTCTCTAT TGGCTGAGGA GCATGCAAAA GTCAGCTTTC CGATGGGGAA AGCCGAGCAT
TGTGTCATGA TCTGGCTCGG TGGTGGAGCC GGTCAGATCG ATACCTGGGA TCCCAAAGTG
AAAGGCGATC CCAAAGCGAA TAAAGCGGGT TCGTATTATG GCAAGATTCA GACGGCTATC
CCAGGGGTGG AAGTCTGCGA ACACCTCTCG CGCTGTGCAC CGATCATGGA TCGATTCACA
TTATTTCGAA CCGTGCATCA CGATGTGATT GACGAGCATG CGGCAGCGAC GAATCGTATG
CATACAGGCC GCCCTGTCAG TGAAACGGTG ATTTACCCTT CGGTCGGTTC TGTCATTGCT
CATCAGCGTG GTGCGGCAGG TGATGGTGTG CCTGCGTATG TTCTCATTGG GTATCCCAGC
ACGACGAGAG GCCCGGGATT TCTGGGCAGC AAAGGGAACT ATGTCTATCT GACAGATACC
GAAAGTGGCC CCCAGGGCTT TCAGCCAGCT TCGGTGATTC GCCAGGAGCG GCAAGCACGT
CGTAACGAAT TGCTGAAGAA AGTTCGCCAG CTCAATACCA CTGAAGAAAA ACAGGCTTTA
CTGAAAAATT ACGAGTCGAT GATTGATGAA GCTCAGCGGC TGGCTGGCCC GCAGTTCATG
CGGATCTTTG ATTTGAAATC CGAATCTGCC GACCTTCGTA ATGAATATGG TGGCGAGTTT
GGGCAGCGCT GCCTGCTGAC CCGCCGCTTA CTGCAGTCAG GGGTGCGGTT TGTCGAAGTC
TCGCATAACT TGAACTTTCT CAACGGTACT GGCTGGGATG TCCATAATGA TGGGATCGTC
CAGCAGCATC GACTGATTCA GGAACTCGAT CAGGCGCTCG CAGCCCTCGT GCTCGACCTG
GAGCGAAACA AACTTCTCGA TAAAACGTTG ATTGTGGTTT CAACAGAGTT TGGACGACCT
GCCAAATTCG ATGGTGGCGG CGGGCGCGGG CATCATGGCA AATGCTTTTC GGTCGCTTGT
GCAGGTGGCG GGATCAAGAC CGGCGTGGCG ATTGGTGAGA CTGATGATCT GGCGATGAAC
ATCGTCACTA GACCAGTTTC CGTACCCGAC CTGCATGCCA CCATGTACGC AGTATGTGGC
GTGAATCCTC GGGAAGAACT GTATGCAGGT GAGCGTCCTG TTCCTATTAC AGATGGTGGT
ACCCCCGTGC TGGAACTCTT CTCGTGA
 
Protein sequence
MNPFLNRRDF NQYTALGGAA ALSAGLPFGL SAGIQGVGSA SSLLAEEHAK VSFPMGKAEH 
CVMIWLGGGA GQIDTWDPKV KGDPKANKAG SYYGKIQTAI PGVEVCEHLS RCAPIMDRFT
LFRTVHHDVI DEHAAATNRM HTGRPVSETV IYPSVGSVIA HQRGAAGDGV PAYVLIGYPS
TTRGPGFLGS KGNYVYLTDT ESGPQGFQPA SVIRQERQAR RNELLKKVRQ LNTTEEKQAL
LKNYESMIDE AQRLAGPQFM RIFDLKSESA DLRNEYGGEF GQRCLLTRRL LQSGVRFVEV
SHNLNFLNGT GWDVHNDGIV QQHRLIQELD QALAALVLDL ERNKLLDKTL IVVSTEFGRP
AKFDGGGGRG HHGKCFSVAC AGGGIKTGVA IGETDDLAMN IVTRPVSVPD LHATMYAVCG
VNPREELYAG ERPVPITDGG TPVLELFS