Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_1444 |
Symbol | |
ID | 9138139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 1856139 |
End bp | 1859132 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003629477 |
Protein GI | 296121699 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.513817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGATT ACACCTATGC CGATGCCTCG TCACCCGGTT CTGATCCCAC GGGCGATCAC GGCTACTCCG AATACGAAGG CGTCTTTGAA AGTGGCACCG CTCACGATCT GGCGGAATGG AATGTCACCT TCTCGAATCC CATGAGCCTG ATCAATCAGG CACTGGCCCC GTACAACTAT TTCAACCTGC CGTTCATCCC TCAGGCCTTC ACCAATGCCA TGGGTGGCTA TGAACCGTCG TATCTGGGTG ATTGGCACTT GTCGAGTGAA GGCTGGGCCA GTGATACGCA GTATTACAAA TACTACGAGA ACGACCGGAC TGGGGATTGG TTTATTCAGT ATTGGGATCA CGAGCCAGAC CACGATGATC CACCACTTTA CACTGAGTAT GGAGGGCCAA GCCATGATGC GCTGTCTCAG GCTATGGGCC AAGCCCAAGC GGCTGCGCAG AAAAAGGCGA ATCAAACACC TCCAGACGAA GCTCCTCCAG CTCAAAGTGC TGATCAGAAC CGACTTGAAG AAGAAGGTTC CGGATCGGGT GGAGAATATA GTTCGAGTGA GCAGTCTGAA CCTCCGTCGC CGGGGGATGA GACGGGGCAG GCTCAAAATA ACGCCGAAAA CCGAGCCCAA GAGCGCCTTG AGTCGGCTGC ATCAGGAAAT AGTACCGAGG TCTCACCATC GAGTCAGGGT GGTGTCAGCA TCCCCTCAAC TCCAGATACA CCTGTATCAG GAGTGTCACA GCCAACGGCA CCTATTGCCC CTTCTGGTAA CGCCGGAGCG CCTGCACAGT CGGCTTGGGA TCAATTTGTA GGAGTTGCTA GTCAGGTAGC AAATGCACAC GCTCAATTGT CCGCAGGTGT TGTTCAGGCG TTCGTGGTCG ATGGATTCAT GGGTAGTGTT CAGGGAATCT ATGATCTCGC TAAAGGGGCC TACAATGCTG CGACACAGTA TGCAAGCTGG CAAATGAATC GGCTTTGGTC AAGTCCGTGG AAGGTTCTCG TCGATCCTTG GGGAGTCGGT GAAGGAATAG GAATTGCCGC AAAGAAAGTG TCAGAACTGA CTGCAAAGAT GCAGCCTTAT GTTGATGAAA TCTCATCACT ACCCTCTGAA ACTATTTGGA AGTTGTTTAT TGGAGACTTT GATGCTGTTC GAGGTCAAGT TTCCCCCACA CTCCTTCGAG CCGCTGAACT TGCTCACGGT TTATCTAGCA GTGTCTATGA ATCCATTGCG AGCAATCTGA CGATGGACAA GGCCCCTGGC CTGCTTGGTC GAGTTTTTGG CATGGTCTTG TATGAGATCG TTGAAGGTCT CGTCATAACG GGGATCACTG CAGGAGCAGG AGCGGCTGCA GTAGGGGCAA AGATTGCTAA GAAACTGACA CGTTTGTCAG ATCTTCCAGG GCTCGATTCG CCAAAGGTTC AAAAAGCAAT CAATGACGTG GTCGGTTTTC TGCAGTCCGG TGGAAAAGTC GATGGGCCCT CAAAGCCACC AGCGACACCC CACATGCCTA AATCCTCAGC TGAACTCGCA GACGAAGCGG CTGCACTCGC AAAGAAAAAA TGTGATGAAG CACGTGCTCT TGGAAAGGAG GGCGGCTGCT TTGTACCGGG CACATTAGTC TCTGTGACAA GCGTTGCTCT TCATGACAAA TCCATTTCAG ACTTTCTACT CGTAACGACA AACGGAATTT GCCCCTCTTC AGAACCTGGG GTAAGTGAGC AATCATTCTC ACGTTCAGTC GATGCTAAAG AATATGCATC ACTACCAATT GAAACGATTT CTTTAGGCTC ACGAGTAGTG GCGAGTAATC CCCATCCATG GGAGTTTGAT TCTCGCTGGG AAGAGCCTGA ATCTGACAGT TGGCGCACAA TACATCTGGA AATGTACAAA CAGGATGGCT CGGTTATTGA CTCACAGTTA CTGCGTCCGG TGGAGGTTGT TGAGTCGTTA AATCTGCATC CTGGACGCAT GATTCTCATT CATGCAGACG AACTGGATGT CGCAGGAATG GCCCTCATTA AATCTGTGAG CAATGCTCCA TCCATTGCCC AGGGAACTGG CCGAGTGGTA ATTGGTCGCT TTATGACCCG TGAGGTCAAC GAACTCATTC GGCTCACACT CTCCACTGGA GATGTCGTGG AGGGAACACC TAACCACCCA CTTTGGTCAG TCGATTTGAA TGACTGGCAA GCAATGGAGA TGTTCGAAGC AGGAGATTAT CTCGCTGGCC ATGAAGAAAA TGTGCTGGTT CTTTCAAAAG AACGTGTCGA GCACACCACT CCAGTCTACA ACCTCGAAGT TCACGGCGAA CACGTCTACC ATTTGACACA AGCGGGAATT CTGGCTCACA ATACATATCC AGAAGACACA CCAAATCCAC CTCCAACCAA AGACATAGTA CCGGAACCAC CAAGCCCACC CAAAGCCGAC GATATAGTGC CGGAGGTGAA GGGTGTCGAA CCGACGACAC GATATACACT CAAAGACAGA TTGAATTCGC CGGATGATGG CGCTCATAAA GAAGTCTTTA GTGTTATGGA AACAGATAAA ATTGCAGTTG GAATACTCAA AAAGGGAGCG TCTCCAGCAA TACTTGCTAA AGAAAGAGCT CTACTTAAGC AACTTGAAGA AGCGGGGCTT CCTGTTATGA AAAATCACGA AATCACAGAA TACAATGGGA GGCCGGCAAT TGTCATGGAT CGTTATGCGC TTGGTTCAAA GAAAATCGTA AAGTACAACT CTAAGATTCG GGATCTTGAG TCTATTGGAG GAAGTGTTCT TCTAAATGAA AAAAGCATCT CTGATCTAAT GGCGATAAAG TCAAAGATGA TGGAAAAGAA CATCAAGATC GATGAGCTCC AGTTTCTGAT TGGAGAAGAT GGATCGGTTG TCATCGCCGA CCCTCTCGAT GTGTATCCTC AGCCGCCTAC CAGTCGCAAT AAGAAAATGA TTGACATGCT GATTAGAAAG GCGATTGAGA ATACTAGGAT TTAA
|
Protein sequence | MVDYTYADAS SPGSDPTGDH GYSEYEGVFE SGTAHDLAEW NVTFSNPMSL INQALAPYNY FNLPFIPQAF TNAMGGYEPS YLGDWHLSSE GWASDTQYYK YYENDRTGDW FIQYWDHEPD HDDPPLYTEY GGPSHDALSQ AMGQAQAAAQ KKANQTPPDE APPAQSADQN RLEEEGSGSG GEYSSSEQSE PPSPGDETGQ AQNNAENRAQ ERLESAASGN STEVSPSSQG GVSIPSTPDT PVSGVSQPTA PIAPSGNAGA PAQSAWDQFV GVASQVANAH AQLSAGVVQA FVVDGFMGSV QGIYDLAKGA YNAATQYASW QMNRLWSSPW KVLVDPWGVG EGIGIAAKKV SELTAKMQPY VDEISSLPSE TIWKLFIGDF DAVRGQVSPT LLRAAELAHG LSSSVYESIA SNLTMDKAPG LLGRVFGMVL YEIVEGLVIT GITAGAGAAA VGAKIAKKLT RLSDLPGLDS PKVQKAINDV VGFLQSGGKV DGPSKPPATP HMPKSSAELA DEAAALAKKK CDEARALGKE GGCFVPGTLV SVTSVALHDK SISDFLLVTT NGICPSSEPG VSEQSFSRSV DAKEYASLPI ETISLGSRVV ASNPHPWEFD SRWEEPESDS WRTIHLEMYK QDGSVIDSQL LRPVEVVESL NLHPGRMILI HADELDVAGM ALIKSVSNAP SIAQGTGRVV IGRFMTREVN ELIRLTLSTG DVVEGTPNHP LWSVDLNDWQ AMEMFEAGDY LAGHEENVLV LSKERVEHTT PVYNLEVHGE HVYHLTQAGI LAHNTYPEDT PNPPPTKDIV PEPPSPPKAD DIVPEVKGVE PTTRYTLKDR LNSPDDGAHK EVFSVMETDK IAVGILKKGA SPAILAKERA LLKQLEEAGL PVMKNHEITE YNGRPAIVMD RYALGSKKIV KYNSKIRDLE SIGGSVLLNE KSISDLMAIK SKMMEKNIKI DELQFLIGED GSVVIADPLD VYPQPPTSRN KKMIDMLIRK AIENTRI
|
| |