Gene Plim_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4054 
Symbol 
ID9140774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5205947 
End bp5208076 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionYP_003632064 
Protein GI296124286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0459328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAACTT CACCAGTCGA ACATCTCTCT GGCCTCGTAG TCGGTGTCGT AGAGTCAGTT 
GCTCCTGATC AAGTGCGGGT GATGCTTGAA CTCGACACTC CGCACACGAC GGCATTGAAC
ACTGGGTCGC CTGTCGCGTT TCCGCGCCTG AACGGTTATG TGCTGATTCC CCATGAAGCG
GGGGCGACGG TTGCTTACAT TTCCTGGATA GGAATTGAGC GTTCACCATT CCCGAAACGA
TCTGGCCTCA AGGACTTTGG CTTGATCGAC CTTCCATTTC CGTTGCGAAA AATGGCAGTT
TCACCAGTCG CAACGCTGAC ATGCAAACGC GACAAGACCT CGCGCACCCA GTATGTGCTC
TCCCGTGGCG TTGTCGCCTT TCCCTCTGTC GGGGATCAGG TGTTGATTCC CACTGCCGAA
CAAATTGAAG CAATCGTTGG GGCGAAGGAT ACGGATAGGC GCGTCAAGAT CGGCGTGTCG
CCCCTCGGGG CAAGTACGAA AATCATGGTC GACCCAGACA AGCTGTTCGG GCGGCACCTC
GCGGTGCTGG GGAATACCGG TAGCGGGAAG TCATGCACGG TTGCCGGATT GATTCGCTGG
TCGATGGATG CAGCCAAAAA GCAGATGTCC GAAACCGGCA AGACGGGACG TCCCAATGCC
CGCTTTATCG TTCTCGATCC TAACGCTGAA TACTCGAATG CTTTTCGGGA CGACCCTCAA
AATGTCCGTC TTTTTAAAGT TCCGCCAGTC TCCGGAGATG ATCGTGCTTT GCAGGTGCCA
GCCTGGATGT GGTGCGGACA TGAATGGACG GCCATTTCCA ACGCTCAACC CGGTGCCCAA
CGGCCTTTGT TGATGCAAGG GTTAAGAGAT TTGAAGAGTG GCTCTGTTTC GCGGGGATCG
CGTGAAGCAA TATTACGACG CTATGTGATC TCTTACATGG TGCGAGTTTC GGAGATGTTG
AGCCGTGGGA CCATAGCGTT TGCTGGTTCT CCTCGACCTC GTTTTGAATG CGCTGGTTTG
CTGAATGGGA TAGCGAAGGA CTGTCAAGCA TGGTCGGGTG ATCTTGAAGG TCAAGCACAA
ACGTTAATGC AGAACGCTGC TTCGGCTGCA TTACAAATCG AGCAATCTAG GAAATCGGGC
CAATACTACA ACGACTTCCT CGTATCAGAC TTGGAGAGTA TTCGATCTTC GCTAGAGGAT
TGTGCAAAAG TATTGCCCGA TGTTGCTCCG GAAGGGCCTA TCAGCGAAGA CTCGCCAAGC
TACTTCGACG TGAATATTCT CGCTGACCAT CTTGAACGTA TAGCCGTTGA GCAAGGTGCC
GGTGTGGCGG GCTTCGTTGC GACTCTCGGT TTGCGTATTC GAGCAATGTT GGCGGATCAG
CATCTCGGGG CTGTCGTCAA TGGAAATCCT ACGTTTGAGG CGTGGCTTGA AGAGTATGTG
GGGGCAGATA ATGCGTCGAA TGGAAATGTG GCGATCATTG ATCTTTCTCT CATCCCAAGT
GAAGTTGTCC ATATCGTAGT CGCCGTGTTG GGCCGACTTG TGTTTGAGTC ACTTCAGCGT
TATCGGCGCG ATAATGCCGC CGGTGAGTCA CTTCCGACCG TCCTAGTACT TGAAGAGGCG
CATACATTCG TGCGTAAAGG GCATGAAGAA TCCTCCGGCA CGGCAACAGC TACTGCACTA
TGTCGGGAGA CATTTGAGAA AATTGCCCGC GAAGGACGCA AGTTCGGACT CGGACTCGTT
GTGTCGTCAC AAAGGCCCTC AGAATTGTCA GCTACCGTCC TGGCGCAGTG CAACACATTC
ATTTTGCATC GCATCGTAAA TGATGCTGAC CAGCACCTCG TTGGCAAGCT TGTGCCTGAC
AATGTCGCAG GATTGTTGGC CGAGCTTCCA AGTCTGCCCT CGCGTCAAGC GATTTTGCTT
GGCTGGGCAA CACCAATTCC GATTCTTGTA GAAATTGACG AGTTGCGCGC GGATCAGCGG
CCACATTCAT CTGACCCTGA TTTCTGGGAT GTATGGACGC ACGAAAAGCC TCGCGATTTG
GATTGGAAAG AAGTGGTTGG GGACTGGGTA GGACAGGTCA AAGTCGATGA AACTGAAGAT
GGAGAGTTGC TAGAAAATGT AGGCGAGTGA
 
Protein sequence
MATSPVEHLS GLVVGVVESV APDQVRVMLE LDTPHTTALN TGSPVAFPRL NGYVLIPHEA 
GATVAYISWI GIERSPFPKR SGLKDFGLID LPFPLRKMAV SPVATLTCKR DKTSRTQYVL
SRGVVAFPSV GDQVLIPTAE QIEAIVGAKD TDRRVKIGVS PLGASTKIMV DPDKLFGRHL
AVLGNTGSGK SCTVAGLIRW SMDAAKKQMS ETGKTGRPNA RFIVLDPNAE YSNAFRDDPQ
NVRLFKVPPV SGDDRALQVP AWMWCGHEWT AISNAQPGAQ RPLLMQGLRD LKSGSVSRGS
REAILRRYVI SYMVRVSEML SRGTIAFAGS PRPRFECAGL LNGIAKDCQA WSGDLEGQAQ
TLMQNAASAA LQIEQSRKSG QYYNDFLVSD LESIRSSLED CAKVLPDVAP EGPISEDSPS
YFDVNILADH LERIAVEQGA GVAGFVATLG LRIRAMLADQ HLGAVVNGNP TFEAWLEEYV
GADNASNGNV AIIDLSLIPS EVVHIVVAVL GRLVFESLQR YRRDNAAGES LPTVLVLEEA
HTFVRKGHEE SSGTATATAL CRETFEKIAR EGRKFGLGLV VSSQRPSELS ATVLAQCNTF
ILHRIVNDAD QHLVGKLVPD NVAGLLAELP SLPSRQAILL GWATPIPILV EIDELRADQR
PHSSDPDFWD VWTHEKPRDL DWKEVVGDWV GQVKVDETED GELLENVGE