Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_2606 |
Symbol | |
ID | 9139317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 3379574 |
End bp | 3382726 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003630629 |
Protein GI | 296122851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.320925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCG ACACCGAAAC ACAACTCCCC CCTCAAGCTG CCAGCCCCCT ACCTGTAGTG GATCCTTACC TCCAGGCGCT CCTCTCCCCC CGGGGCGAGG AAGTCCTCCA CGCCGTCTGC CAGAAGGATC AAATCTGGCG GCACGACAAC TTCGATGTCT TCGACATCCA CGAGACTGCC CGCCTGCAGT TTGACAACCT CCTTCGCCGC CTAAAGGCCG AAGGAGGTGC CCCTTACGGC CTGATCCAAC TGGTGCTGGG CGAGGCGGGA AGCGGCAAAA CGCACCTGCT CCGGTTTTTC AGAAACCACG TTCACTACAG CAACGATGGG TTCTGCGGCT ATTTGCAGAT GACGACGGGC ACCCAGAACT ATCCACAGTA TGTCCTTTCT AATCTCATCG ATTCGCTCGA CGAGATTTAC TTCGAACCCG ATGGCGACGA GACCGGATTG ATGCGGCTCG CCGACGCCGT TGGTCAACAA GTGGAACTGT TCCTCCCCAA AGAAATACAG CGTCTTCGCG AAGACGATTT GACCCACGAT GAACTGGCCC AGCTGACCAA TGAACTGGCC GACGCCCTCC AGTCCCGCCC GGGCAATCGC ACGTTGGATG CGGGTGTGAT TCGCGCCCTG CTCTCGCTGA TTCCCCGCCA TCCCAAAGTC CACACGCGGG TGCTCAAGTA CCTCCGCTGC GAGCCGTTCT CCTCGTTCGA CGAGAAGATC CTTCCGGAGA TGCCACGTTG GACAGCCGAC GATGCACCCA CCCACATGAT TGCCGCCCTC GGCAAGATCA TGCGCCAGCT CACCAACAAG ACGCTCGTCA TCCTCGTCGA TCAACTCGAG GAGATGGCCA ATTTCGACGA AGACCCCGCC CGACTGGAAC ACCGCTTCCG CCACGCCATG CAGGCGATCT CCTCGCTCGT GGGCGGGATT CCCGGCGCGC TCTGTGTCGT GGGCTGTCTC GCCGACCTCT ACCACCTCAT GGAGCCGCGA CTTCCCGCGC CGGTTCTCGC GAGGGTCGCC ACCGATCCCC GCTCGATCCA GCTCATCGCC AGCCGCACCC GCCACGAGGC GAAGCAGATC ATCGAGACCC GCCTGAAATC AGTCTACGAA CAAGCAGGTC TGGAGAACCC CGAAACCATC GCCCCCTTCA CGGAAGAATT CATCGACGGC CTCGAAGGAC TCACCGCCCG CGACATTCTC TTCGCCTGCC GCGCGTTCCG GGATCGACTA GCGACAGGAG AGCCTGTCGA GACGCCACCT CCTCCACCAC CACCGGCGAT CGCCTGGCCA CAGGAATGGA ATGACTTCCG CGCACGCCAC ACGCCCGTTA TCCCGGAAGA TGCTGAGCAG CAACTACGCT TGCTGCACTG GGCCATCGAA CAATGCACTC TCGAACGACA ACCCGCCGCC GAATGGTCGA CCGATCTCCA CGACGATTCG TTGCTCATTG GGGCCACCAT CGGCGGCGTC ACCCGCTCTT CCAAACTTCT GGCTGCGTTC TGCGATGAAA ACCCCCAGGG CGGAAAGCTG CAGAAGCGAA TCGAGTCCAT CGTCAAACTG GCGGGCGACC AATCGGCGGC CCTCTTGCGG AATACCGCTT TCCCGGCTGT CAAAAAGGGA ACCAAGATCG GCGAACTGCT CCTCTCGCTC CCCAAGGACA AGTTCAAGCG GATCATCTGG GAAGACAGCC ACTGGCGATT CCTCACCACT CTTCGCGCCT TCCACGAGCA GCATTCCATC CAGGGCCAGT ATCTGGAATG GCGCACTTCC AGCCGCCCAT TGAACGACGT GAAGCCTTTG GTCGATCTCC TCAACCTCGA CCAGATTCCG CTGACTCCCA CACCTCCGAA AATCAAGCCT CTTCCACCGG AGCCGCCAAT CCCACCCGCT GCGAAGTCGC CAGTCCCACC GGCAACCAAA ACGCCGTTCG AAATACCCCT GGGTTTTACC CAAGGTTTGC GACAAACTCC GGTCGTGCTC AACTCTGAAA TCTTCAAAAG GCATGCGGCA TTCGTTGGTG GTGCCGGGAG TGGCAAGACA ACGCTCGCCC TCAATGTCAT CGAGCAACTA TTGCTCGCCG GCATCCCGGC AATCCTCATT GATCGCAAGG GTGATCTCGC CACCTATGCT TCGGACGATT GGGGCACGCC CAACGAACCC GCCCTCCTCA AACGGGCCAC TATCCTCAAG AAGCAACTCG ACATCCGCGT CTACACCCCC GGCGACCCCA CCGGCCATAA CCTCCTGTTG CCGCTGATTC CCGAAGGCGT GGGGCAGATG CCGGACAATG AGCGTGAAGC CGCTCTGGAA GCAACGACAG CCAGCCTGTT GAAAATGCTC GGCTGCTCAG AAAACCAATT CAGGGAGTAC CAGCCGATCC TGTTGACTGC ACTGCAGGTG ATCCTTGATA CCACACAGGG CGAAGTCACG TTGGAGTGGC TGGAAGAGGT GATCGGCAAG AAGGATCAGG AACTGGTCAA ACATCACCGC TGGACACAAC CCACGGTCTA TCTGAAACTC GCCAGGGCAC TCAGCGACCT GCGAATCACC CGCAGGTTGC TCCTCTCCAC AGAGGGAACA CCGCTCAACA TCCCCCGGAT GCTCTCCCCC ACTGCCGATG GCCGTACACC CTTGTCGATC ATCAGCCTCA AGTCGCTGGT CGATATGTCG GCCATCCAGT TCTGGATGTC GCGGTTCCTC ATGACTCTGG GGCGCCACGT CAGTGCCACA CCCTCAGCGA CATTACAGGC GGTGATCCTC CTCGACGAGG CCGATATCTA CGTCCCCGCA ACCAGCAAAC CCTCCACCAA GGAGCCGCTG TTGAACCTGC TCAAGCGGGC TCGCTCCGGC GGACTGGGCG TCTTCCTGGC CACACAAACC CCGGGTGATC TCGATTCCAC TTGCCGCAGC AACTGTGCCA CCTGGGCCAT CGGTCGGCTC AACGACAATG TGTCGATCAA CAAAGTGAAA TCCATGTTCA GCGACACGCC GCAACTTCTG GATCGCGTTG CGCAGCAAGG TCAGGGCGAA TTCGCCCTGG CCTCCGATGG CCTGACGACC CAGTTCAAAG GCGACCGCAG CGCCATGAAC ACCCGCCAGC TCTCCGAACA GGAAATCTTG ACCCTGGCCC GCGCGAACCG AAGCGTCGAA TAA
|
Protein sequence | MPADTETQLP PQAASPLPVV DPYLQALLSP RGEEVLHAVC QKDQIWRHDN FDVFDIHETA RLQFDNLLRR LKAEGGAPYG LIQLVLGEAG SGKTHLLRFF RNHVHYSNDG FCGYLQMTTG TQNYPQYVLS NLIDSLDEIY FEPDGDETGL MRLADAVGQQ VELFLPKEIQ RLREDDLTHD ELAQLTNELA DALQSRPGNR TLDAGVIRAL LSLIPRHPKV HTRVLKYLRC EPFSSFDEKI LPEMPRWTAD DAPTHMIAAL GKIMRQLTNK TLVILVDQLE EMANFDEDPA RLEHRFRHAM QAISSLVGGI PGALCVVGCL ADLYHLMEPR LPAPVLARVA TDPRSIQLIA SRTRHEAKQI IETRLKSVYE QAGLENPETI APFTEEFIDG LEGLTARDIL FACRAFRDRL ATGEPVETPP PPPPPAIAWP QEWNDFRARH TPVIPEDAEQ QLRLLHWAIE QCTLERQPAA EWSTDLHDDS LLIGATIGGV TRSSKLLAAF CDENPQGGKL QKRIESIVKL AGDQSAALLR NTAFPAVKKG TKIGELLLSL PKDKFKRIIW EDSHWRFLTT LRAFHEQHSI QGQYLEWRTS SRPLNDVKPL VDLLNLDQIP LTPTPPKIKP LPPEPPIPPA AKSPVPPATK TPFEIPLGFT QGLRQTPVVL NSEIFKRHAA FVGGAGSGKT TLALNVIEQL LLAGIPAILI DRKGDLATYA SDDWGTPNEP ALLKRATILK KQLDIRVYTP GDPTGHNLLL PLIPEGVGQM PDNEREAALE ATTASLLKML GCSENQFREY QPILLTALQV ILDTTQGEVT LEWLEEVIGK KDQELVKHHR WTQPTVYLKL ARALSDLRIT RRLLLSTEGT PLNIPRMLSP TADGRTPLSI ISLKSLVDMS AIQFWMSRFL MTLGRHVSAT PSATLQAVIL LDEADIYVPA TSKPSTKEPL LNLLKRARSG GLGVFLATQT PGDLDSTCRS NCATWAIGRL NDNVSINKVK SMFSDTPQLL DRVAQQGQGE FALASDGLTT QFKGDRSAMN TRQLSEQEIL TLARANRSVE
|
| |