Gene Plim_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1444 
Symbol 
ID9138139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1856139 
End bp1859132 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content49% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003629477 
Protein GI296121699 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.513817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATT ACACCTATGC CGATGCCTCG TCACCCGGTT CTGATCCCAC GGGCGATCAC 
GGCTACTCCG AATACGAAGG CGTCTTTGAA AGTGGCACCG CTCACGATCT GGCGGAATGG
AATGTCACCT TCTCGAATCC CATGAGCCTG ATCAATCAGG CACTGGCCCC GTACAACTAT
TTCAACCTGC CGTTCATCCC TCAGGCCTTC ACCAATGCCA TGGGTGGCTA TGAACCGTCG
TATCTGGGTG ATTGGCACTT GTCGAGTGAA GGCTGGGCCA GTGATACGCA GTATTACAAA
TACTACGAGA ACGACCGGAC TGGGGATTGG TTTATTCAGT ATTGGGATCA CGAGCCAGAC
CACGATGATC CACCACTTTA CACTGAGTAT GGAGGGCCAA GCCATGATGC GCTGTCTCAG
GCTATGGGCC AAGCCCAAGC GGCTGCGCAG AAAAAGGCGA ATCAAACACC TCCAGACGAA
GCTCCTCCAG CTCAAAGTGC TGATCAGAAC CGACTTGAAG AAGAAGGTTC CGGATCGGGT
GGAGAATATA GTTCGAGTGA GCAGTCTGAA CCTCCGTCGC CGGGGGATGA GACGGGGCAG
GCTCAAAATA ACGCCGAAAA CCGAGCCCAA GAGCGCCTTG AGTCGGCTGC ATCAGGAAAT
AGTACCGAGG TCTCACCATC GAGTCAGGGT GGTGTCAGCA TCCCCTCAAC TCCAGATACA
CCTGTATCAG GAGTGTCACA GCCAACGGCA CCTATTGCCC CTTCTGGTAA CGCCGGAGCG
CCTGCACAGT CGGCTTGGGA TCAATTTGTA GGAGTTGCTA GTCAGGTAGC AAATGCACAC
GCTCAATTGT CCGCAGGTGT TGTTCAGGCG TTCGTGGTCG ATGGATTCAT GGGTAGTGTT
CAGGGAATCT ATGATCTCGC TAAAGGGGCC TACAATGCTG CGACACAGTA TGCAAGCTGG
CAAATGAATC GGCTTTGGTC AAGTCCGTGG AAGGTTCTCG TCGATCCTTG GGGAGTCGGT
GAAGGAATAG GAATTGCCGC AAAGAAAGTG TCAGAACTGA CTGCAAAGAT GCAGCCTTAT
GTTGATGAAA TCTCATCACT ACCCTCTGAA ACTATTTGGA AGTTGTTTAT TGGAGACTTT
GATGCTGTTC GAGGTCAAGT TTCCCCCACA CTCCTTCGAG CCGCTGAACT TGCTCACGGT
TTATCTAGCA GTGTCTATGA ATCCATTGCG AGCAATCTGA CGATGGACAA GGCCCCTGGC
CTGCTTGGTC GAGTTTTTGG CATGGTCTTG TATGAGATCG TTGAAGGTCT CGTCATAACG
GGGATCACTG CAGGAGCAGG AGCGGCTGCA GTAGGGGCAA AGATTGCTAA GAAACTGACA
CGTTTGTCAG ATCTTCCAGG GCTCGATTCG CCAAAGGTTC AAAAAGCAAT CAATGACGTG
GTCGGTTTTC TGCAGTCCGG TGGAAAAGTC GATGGGCCCT CAAAGCCACC AGCGACACCC
CACATGCCTA AATCCTCAGC TGAACTCGCA GACGAAGCGG CTGCACTCGC AAAGAAAAAA
TGTGATGAAG CACGTGCTCT TGGAAAGGAG GGCGGCTGCT TTGTACCGGG CACATTAGTC
TCTGTGACAA GCGTTGCTCT TCATGACAAA TCCATTTCAG ACTTTCTACT CGTAACGACA
AACGGAATTT GCCCCTCTTC AGAACCTGGG GTAAGTGAGC AATCATTCTC ACGTTCAGTC
GATGCTAAAG AATATGCATC ACTACCAATT GAAACGATTT CTTTAGGCTC ACGAGTAGTG
GCGAGTAATC CCCATCCATG GGAGTTTGAT TCTCGCTGGG AAGAGCCTGA ATCTGACAGT
TGGCGCACAA TACATCTGGA AATGTACAAA CAGGATGGCT CGGTTATTGA CTCACAGTTA
CTGCGTCCGG TGGAGGTTGT TGAGTCGTTA AATCTGCATC CTGGACGCAT GATTCTCATT
CATGCAGACG AACTGGATGT CGCAGGAATG GCCCTCATTA AATCTGTGAG CAATGCTCCA
TCCATTGCCC AGGGAACTGG CCGAGTGGTA ATTGGTCGCT TTATGACCCG TGAGGTCAAC
GAACTCATTC GGCTCACACT CTCCACTGGA GATGTCGTGG AGGGAACACC TAACCACCCA
CTTTGGTCAG TCGATTTGAA TGACTGGCAA GCAATGGAGA TGTTCGAAGC AGGAGATTAT
CTCGCTGGCC ATGAAGAAAA TGTGCTGGTT CTTTCAAAAG AACGTGTCGA GCACACCACT
CCAGTCTACA ACCTCGAAGT TCACGGCGAA CACGTCTACC ATTTGACACA AGCGGGAATT
CTGGCTCACA ATACATATCC AGAAGACACA CCAAATCCAC CTCCAACCAA AGACATAGTA
CCGGAACCAC CAAGCCCACC CAAAGCCGAC GATATAGTGC CGGAGGTGAA GGGTGTCGAA
CCGACGACAC GATATACACT CAAAGACAGA TTGAATTCGC CGGATGATGG CGCTCATAAA
GAAGTCTTTA GTGTTATGGA AACAGATAAA ATTGCAGTTG GAATACTCAA AAAGGGAGCG
TCTCCAGCAA TACTTGCTAA AGAAAGAGCT CTACTTAAGC AACTTGAAGA AGCGGGGCTT
CCTGTTATGA AAAATCACGA AATCACAGAA TACAATGGGA GGCCGGCAAT TGTCATGGAT
CGTTATGCGC TTGGTTCAAA GAAAATCGTA AAGTACAACT CTAAGATTCG GGATCTTGAG
TCTATTGGAG GAAGTGTTCT TCTAAATGAA AAAAGCATCT CTGATCTAAT GGCGATAAAG
TCAAAGATGA TGGAAAAGAA CATCAAGATC GATGAGCTCC AGTTTCTGAT TGGAGAAGAT
GGATCGGTTG TCATCGCCGA CCCTCTCGAT GTGTATCCTC AGCCGCCTAC CAGTCGCAAT
AAGAAAATGA TTGACATGCT GATTAGAAAG GCGATTGAGA ATACTAGGAT TTAA
 
Protein sequence
MVDYTYADAS SPGSDPTGDH GYSEYEGVFE SGTAHDLAEW NVTFSNPMSL INQALAPYNY 
FNLPFIPQAF TNAMGGYEPS YLGDWHLSSE GWASDTQYYK YYENDRTGDW FIQYWDHEPD
HDDPPLYTEY GGPSHDALSQ AMGQAQAAAQ KKANQTPPDE APPAQSADQN RLEEEGSGSG
GEYSSSEQSE PPSPGDETGQ AQNNAENRAQ ERLESAASGN STEVSPSSQG GVSIPSTPDT
PVSGVSQPTA PIAPSGNAGA PAQSAWDQFV GVASQVANAH AQLSAGVVQA FVVDGFMGSV
QGIYDLAKGA YNAATQYASW QMNRLWSSPW KVLVDPWGVG EGIGIAAKKV SELTAKMQPY
VDEISSLPSE TIWKLFIGDF DAVRGQVSPT LLRAAELAHG LSSSVYESIA SNLTMDKAPG
LLGRVFGMVL YEIVEGLVIT GITAGAGAAA VGAKIAKKLT RLSDLPGLDS PKVQKAINDV
VGFLQSGGKV DGPSKPPATP HMPKSSAELA DEAAALAKKK CDEARALGKE GGCFVPGTLV
SVTSVALHDK SISDFLLVTT NGICPSSEPG VSEQSFSRSV DAKEYASLPI ETISLGSRVV
ASNPHPWEFD SRWEEPESDS WRTIHLEMYK QDGSVIDSQL LRPVEVVESL NLHPGRMILI
HADELDVAGM ALIKSVSNAP SIAQGTGRVV IGRFMTREVN ELIRLTLSTG DVVEGTPNHP
LWSVDLNDWQ AMEMFEAGDY LAGHEENVLV LSKERVEHTT PVYNLEVHGE HVYHLTQAGI
LAHNTYPEDT PNPPPTKDIV PEPPSPPKAD DIVPEVKGVE PTTRYTLKDR LNSPDDGAHK
EVFSVMETDK IAVGILKKGA SPAILAKERA LLKQLEEAGL PVMKNHEITE YNGRPAIVMD
RYALGSKKIV KYNSKIRDLE SIGGSVLLNE KSISDLMAIK SKMMEKNIKI DELQFLIGED
GSVVIADPLD VYPQPPTSRN KKMIDMLIRK AIENTRI