Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_4099 |
Symbol | |
ID | 9140819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 5256441 |
End bp | 5258186 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | NHL repeat containing protein |
Protein accession | YP_003632109 |
Protein GI | 296124331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.649178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACTT GTTCTAACCG AGCCTGGTTT CGACATTTTG AGAGTTGGCC GCAAAGATGC GGAGTCTGGC GATGGGATGG GAATGCTCTG GTAGGGTGCC TGGCTGCGAT CTTACTGTGT CTATTTTTCG TGCCTGCGGT TGCTCAGGAA ACGTCTGACG GGCAATCACC GGCAGCGAGT ACGCCTAAGC CAGCACCAGA AGCCGGCAAG CCAGAAAATC CGTTTCCGAA TCGCATTCCT GCCCCTTCGC TCGATGGGGG AATCGAGTGG CTCAACACCA GCCAGCCCCT GTCACTGAAG GATCTGCGCG GGAAAGTGGT GGTGCTCGAC TTCTGGACGT ACTGCTGCAT CAACTGCATT CATGTGCTGC CTGATCTGAA GTATCTCGAA AAGAAGTATG GCAAAGAGCT GGTGGTCATC GGTGTTCACT CCGCCAAGTT TGATAACGAG AAAGAGTCCG GGAACATTCG CAAAGCCATC TTGCGTTACG AGATTGAGCA TCCCGTCGTC AACGATGCGG AGATGACCAT CTGGCGGAAG TTCAGTATTC GGTCGTGGCC TTCTCTCGTG TTGATTGATC CTGAGGGGCA GTTTTGTGGT GTCGCTTCCG GCGAGGGAAA TCGCGAACTG CTGGATCAAG TGATTGCCAA AGTCATCGAT TATCATAGGG CGAAGGGGAC GCTGAACGAA AAGCCGATGG CTTTCGATCT CGAAAGCGGC AAAGAAGCAG CCACTCCTTT GCGATTCCCT GGCAAGCTGC TCGTTGATCC GGCCCATGAG AGAGTCTTTA TTTCAGACAG CAATCATAAT CGCATCGTCG TGGCATCGCT GGCCGGTCAA CTCCTCAAGG TGATTGGAAG TGGAAAAATT GGCGCCAAAG ATGGCCCGGC TGAATCGGCA CAGTTTGACC ATCCGCAAGG AATGGCACTG GACGGGAATA CGCTCTATGT GGCCGATACG GAAAATCATC TGCTGCGGAC GGTGAACCTG ACCACATGGG AAGTTTCGAC ACTCGCAGGG ACTGGTGAAC AGGCCCGCGG CCGCGATCGT GGGGGGGAGT TGCGAACCAC AGCGCTGAAC AGCCCGTGGG ATCTTTACAT CCAACAGGGC GTGCTGTACG TCGCGATGGC TGGGCCGCAT CAGCTCTGGT CGCATGCACT GGGAAGTAAG ACGATTCAGA ACTATGCGGG CTCTGGACGG GAAGATATTA CCAATGGAAG CCTGGCTCAA TCGGCACTGG CGCAACCTTC GGGAATCACC AGTGATGGCG AGTCGCTGTA TGTGGTCGAT AGCGAAGGTT CATCCATTCG CAAAATCACT ACTAGCGAAG CAGACAAACT GGAAGACCCG GAGGGCAAAG TCACCACAGT GGTGGGAGCT TCGGATCTGC CGCGAGGTGC GAGCCTGTTT GAGTTTGGCG ATATTGATGG CAAGGGATCA GCAGTTCGTC TGCAGCATCC GCTGGGGATT GTCTTTCACG AGGGGAAGCT GTTTGTCGCC GACAGTTACA ACCATAAGAT TAAAGTGATC GATCCGATCA AAAGAACATG CGAGAGCTGG CTGGGGAATG GAAAGCCGGG GGCTGCACTT GCTCCGGTCC AGCTATCGGA ACCTGCGGGG TTGGCAACTT ATGGCGGAGT TCTGTTCATT GCCGACACGA ATAACCATCG CGTCCTGAAG GTCGATTTGA AAACGAAAGC TGCCACCGAG TTGAAGATCG AAGGCCTGAC AGCCCCCAAG CCTTGA
|
Protein sequence | MMTCSNRAWF RHFESWPQRC GVWRWDGNAL VGCLAAILLC LFFVPAVAQE TSDGQSPAAS TPKPAPEAGK PENPFPNRIP APSLDGGIEW LNTSQPLSLK DLRGKVVVLD FWTYCCINCI HVLPDLKYLE KKYGKELVVI GVHSAKFDNE KESGNIRKAI LRYEIEHPVV NDAEMTIWRK FSIRSWPSLV LIDPEGQFCG VASGEGNREL LDQVIAKVID YHRAKGTLNE KPMAFDLESG KEAATPLRFP GKLLVDPAHE RVFISDSNHN RIVVASLAGQ LLKVIGSGKI GAKDGPAESA QFDHPQGMAL DGNTLYVADT ENHLLRTVNL TTWEVSTLAG TGEQARGRDR GGELRTTALN SPWDLYIQQG VLYVAMAGPH QLWSHALGSK TIQNYAGSGR EDITNGSLAQ SALAQPSGIT SDGESLYVVD SEGSSIRKIT TSEADKLEDP EGKVTTVVGA SDLPRGASLF EFGDIDGKGS AVRLQHPLGI VFHEGKLFVA DSYNHKIKVI DPIKRTCESW LGNGKPGAAL APVQLSEPAG LATYGGVLFI ADTNNHRVLK VDLKTKAATE LKIEGLTAPK P
|
| |