Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_4003 |
Symbol | |
ID | 9140723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 5138069 |
End bp | 5141140 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | protein of unknown function DUF1355 |
Protein accession | YP_003632013 |
Protein GI | 296124235 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0902905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCAAC AAGCACAGTA CCTGCTGATG CGACTTTTTC CACCGGCTCG CAAGCCAGTG ACGCCCTGGA CTGTTGCGCC ATTACTCATC TCGATGGTCT TGACCGTGGG CATTGCCTTC TGGCTGGAGG CCAGTCGGGT GTTACTGTTG ACCCGGCCAT GGTTACTCTC GCTGGTGGTC TTGAGTGTGT GGGTCTGGTG GCTGCACATG GCGGGGATGC CGGGCTTACC GTGGCTGCGA TCCTGGATGG CACTGTGGGT TCGTCTGGCG ATGGTCGGAA TCCTCGCATT TCTGCTGGCT GAACCCCGCG CTGTTCGTGA GAACGACCGA CAATCACTGA TGTATGTGCT GGATACGTCC GATTCGATCG GCCGCTCAGC CAAGGATCAG GTGCTGCGCT ATATTGCAGA AACTGTCACG AAAAAACCGG CTCGCGATGA AGCCGGGCTG TCAGTCTTCG GGCGTAACGC GGCTGTGGAA TTACCACCTC GTACCACGTT TCTGGCCGAG GCACTCAATA CCGATATTCG TGGTGATGCC ACCAATATCG AGCAGGCACT TTCCCTTTCC AGCGCCATGC TGCCGGATGA TCAGGCGGGG AAGATTGTGC TGTTTTCCGA TGGATCTCAG ACTGAAGGGA GTCTCGACCG TATTCTCGAT GAACTGAAAT CCCGTAAGAT CTCTGTCGAT GTGGTGCCTA TTGAATATGA CTACGAACAT GAGGTCTGGG TCGAGCGAAT TGATCTTCCG AGCAACGTCA AGATTGGCGA GACCTATGAA GCGGCGGTGA TCGTCTCGGC CTTGTCGGCT GGTCAGGGCA AGCTGGTTGT TCGCGAGAAC GGCCAGCCGA TTGTGGAAGA AACGATTTCG TATCGCGAAG GGAAGACACG CCTGGCAGTC CCGCTCGCCT TACGCCGGCC CGGATACTAT GAATATACCG CCACCATTGA ACCCGAGGCA GAAGCTGACA GCCTGGCGCA AAACAATATG GCCATGGGTG GGATTGTCGT CGAAGGGGAG GGGAAAATCC TCGTTGTTTA CGATCCCACG GGGAATCCAC TCGATTGGGA ACCACTGGTC GAGTCTCTCA ATAAGGCCAA AAAACAAGTT GATGTGATGG CCGGAGTCGA CTTTCCGCGA GACCCTTCCT CACTGATTCC TTACGACTCG ATTTTGTTTG TGAATGTCCC TGCCAATGAG TTCGATGGCG TTCAATTGCA GGCACTCAAA GACAGTGTTT TCGATCTGGG AACTGGCTTT CTGATGGTCG GTGGGCCGGG GAGCTTTGGC CCCGGGGGAT ATCACCGGAC GGCTGTCGAA GAGATTCTTC CCGTCACGAT GGATATCACA CAGAAGAAGG TGCTTCCTAA GGGAGCACTG GCCATCATTC TGCATACCTG TGAATTCCCG GAGGGCAATA CCTGGGGCAA GCGAATCACC AAGCAGGCCA TTAAGGTTCT GGGCGAACAG GATGAAGTGG GCGTTCTGGC CTATGACTAC AACGATGGTG AGAAATGGAT TTTTGAACTC ACACCCGCAG GAAAGTACGA AGAGCTGTCG TTACTGATTA ACTCAGCTGA GATTGGGGAT ATGCCCAGTT TTCAGCAGAC GATGCAGATG GGTATCGATG GACTCGAAGC GAGCGATGCT TCGTCGAAAC ATATGATCAT CATTTCCGAT GGAGATCCTT CACCAGCCTC GCCCGATCTC TTGAAGCGAT TTATTGACGC GAAGGTGACC ATCAGCACGG TCGCTGTCTT TCCACACGGA GATGTGGATA CGCCGACGAT GACATCGATC GCACAGATTA CTGGCGGGCG TTATTACAAG CCGACCAATC CGAATCAGCT ACCAGCGATC TTCATCAAAG AATCGAAGAC ACTCCGCCGG TCGATGCTTC AAAACCGCGA TTTCTTCCCG GAAGTTGCTT CGAGTTCGCC AGTTTTGAAA GGCATCAGTT CATTACCGGA GTTGAAAGGG TATGTACTCA CGACCGCCAA GCCCGATGCT CAGGTTGTGC TCAAAGTTCC GCCCGGTTCG AAAGAGGAAG AGTCGCAGCT GGATCCACTT CTGGCGATTC GCCAGCACGG GTTAGGGAAG ACGGCGGCTT TCACTTCGGA ACTTGGCAAG AACTGGGGAA AGGACTGGGT GGCATGGGGC AAGTATGAGG ATTTCCTCAA TCAGCTCACC ACGGATATCG CCCGCATCCG CAAAGACACA CAACTCCGCT TGAGCACGTA TGTCGAAGGA GCGCAGGGAG TCGTTATTGT CGAAGATTTT GCCCCGGAAG AGGGCTTTCT GGAAATCTCC GGACGCGTCG GTGGCCCGAA CGATCGTTCG GAAAGCCTCA CTTTCCGGCA GGTGGGGCCG CGTCGCTATC AGGCGCTGGT TCCGCTGTGG GGGCAAGGCC GCTACTACGT TTCGGTGGCA GGTGCGGGAA CAAAAATCGG CGTGGACGGT CAACCGGCTG AGCGGAAGGA ATCGACGTTT GGCGGATTCG TGCTGGCCTA CTCGCCGGAA TATCTGCGGT TTGGATCGAA CCGGCAGTTA CTCGAAGAGA TTGCCCAAAG GACAGGTGGG CGCGTCTTGA CGGGTGATCC AGAAAGTGAC GAACTCTTCC CGAAAGAGCG CGAACCCCGC CAGAGTTCAC GTCCGATTTT TGACTGGTTT CTTGTGGCGC TGGCCTGTCT TGTCCCTCTC GATGTCGGTT TGAGGCGCAT TCAGTGGGAT TGGTCTGTCG TGGCAGGCTG GTTCAGACCC CGCCGGGAAG TCACCTCGAC AATGTCAACT TTGCTCGATC AGAAAAAGTC CGGTTCGCAG CAGACAACCA CTGAGACCGG CAAACCTGCC GCTGAGGCAT CGTCATCGCG GAAAACACCA CCACAACGGC CACCCGTCAT TCGCAAGCCA CCGATGACTC TACCGCCATC ACCATCTGCA AAGACTCCCC CCACTTCAGA AAAAACGCAA ACCGAAAAGC CGGCACCAGG TGCTGCCAAA TCGACTTATG AAAAACTGCT GGAGATCAAG CGACAACAGC AGAAGAAAGA CGAACCACCC GCGAAAGATT AA
|
Protein sequence | MWQQAQYLLM RLFPPARKPV TPWTVAPLLI SMVLTVGIAF WLEASRVLLL TRPWLLSLVV LSVWVWWLHM AGMPGLPWLR SWMALWVRLA MVGILAFLLA EPRAVRENDR QSLMYVLDTS DSIGRSAKDQ VLRYIAETVT KKPARDEAGL SVFGRNAAVE LPPRTTFLAE ALNTDIRGDA TNIEQALSLS SAMLPDDQAG KIVLFSDGSQ TEGSLDRILD ELKSRKISVD VVPIEYDYEH EVWVERIDLP SNVKIGETYE AAVIVSALSA GQGKLVVREN GQPIVEETIS YREGKTRLAV PLALRRPGYY EYTATIEPEA EADSLAQNNM AMGGIVVEGE GKILVVYDPT GNPLDWEPLV ESLNKAKKQV DVMAGVDFPR DPSSLIPYDS ILFVNVPANE FDGVQLQALK DSVFDLGTGF LMVGGPGSFG PGGYHRTAVE EILPVTMDIT QKKVLPKGAL AIILHTCEFP EGNTWGKRIT KQAIKVLGEQ DEVGVLAYDY NDGEKWIFEL TPAGKYEELS LLINSAEIGD MPSFQQTMQM GIDGLEASDA SSKHMIIISD GDPSPASPDL LKRFIDAKVT ISTVAVFPHG DVDTPTMTSI AQITGGRYYK PTNPNQLPAI FIKESKTLRR SMLQNRDFFP EVASSSPVLK GISSLPELKG YVLTTAKPDA QVVLKVPPGS KEEESQLDPL LAIRQHGLGK TAAFTSELGK NWGKDWVAWG KYEDFLNQLT TDIARIRKDT QLRLSTYVEG AQGVVIVEDF APEEGFLEIS GRVGGPNDRS ESLTFRQVGP RRYQALVPLW GQGRYYVSVA GAGTKIGVDG QPAERKESTF GGFVLAYSPE YLRFGSNRQL LEEIAQRTGG RVLTGDPESD ELFPKEREPR QSSRPIFDWF LVALACLVPL DVGLRRIQWD WSVVAGWFRP RREVTSTMST LLDQKKSGSQ QTTTETGKPA AEASSSRKTP PQRPPVIRKP PMTLPPSPSA KTPPTSEKTQ TEKPAPGAAK STYEKLLEIK RQQQKKDEPP AKD
|
| |