Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_1581 |
Symbol | |
ID | 9138281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 2035818 |
End bp | 2038727 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | PA14 domain protein |
Protein accession | YP_003629613 |
Protein GI | 296121835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGCT GCAGTCTCAT GCTGTGGTGC TTCTCGACGA TTGCCACCCT GCCGGCAACA AGTTGGGCCA TCGAGAGTGC TGCGAAAACA GATCGCTCAC CGCGTGTTGC CGGTTATGAA CGCTACCATG GCCAGACTCT GGGCGATGCC GCCGATGAGG AAGAAGATCG TCAACCCGTT GATGAGTTAG CACTCGGCCG CCTGCTGGTG GGAGAACTTA ATTGTGTGGC CTGCCATGCC ACCAGCCATG AGTTGAATCA GGTTTTGCTT TCCAAGCAGG CACCCCGGCT GACGAACGTT TCACAAAGAG TTCAGCCCGA GTGGATTCTT GAGTATCTGG CAGATCCGCA TCAGGCCAAG CCAGGCACTA CGATGCCTCA CATTCTGGGT GCTCTCCCGG AGAATGAGCG AGCCCCGGCT GCTTTAGCAC TTACTCATTT TCTAGCTTCC GATAAAGCTC TGAGAAACTC GTTTGTGGAT CGCGGGCTGG CGAATAAAGG GGCCAATACG TTTCAGCGAG TTGGCTGTGC AGCCTGCCAC AATCTTCCCA AGGATGGGGC TGACGATTGG AAGACCTCTG TCAGTCTCTC ACATGTCAGC CAGAAGTACA CCGTCGAAAG TCTGACTCGG TTTCTGAAAG AACCTCACGA AGCCCGTCCT GCTGCACGCA TGCCGAGCCT CAATCTCAAA GATGAAGAGA TCCGCGAAAT CGTCGCCTGG TTTTTCCGCG ATCTCACATT CCCACCCAAT CTGACCTATG CCTACTACGA AGGGAGCTGG GAGAATCTCC CTGACTTTGC CGAACTCAAA GCGGTTTCCA ATGGTGAAAC ATCAGGTTTT GATGTCGAGA TTGCCCCTAA AACATCAAAC TATGGTCTCG TCTTCACAGG CACAATCGAT GTTCCCGCAG CCGGTGACTA CCAGTTCTGG ATTACTTCTG ACGATGGGAG CGAACTGAAA ATCGATGGTG CTTCTGTCGC CAAGATGGAT GGTATTCATC CCGCTCAAAC CAAAGCCGGT AAGACCAGGC TCGAAGCTGG CAAGCATCCT GTCGTAGTAG AGTTTTTCCA GCAAGGTGGT GGAGCCGAAT TGAAGGTCGA GGTTCAAGGG CCAAAGCTCA GCCGACGTCC TCTCGAATCA CTGCTAGTCG CGCTTCCTTC AAAAGCATCT GCCGGTCAAG AACTGGCCAG ACGTCCCTTC GAATTCGATC CCAATCTGGC GGCTCAAGGC AAGGAACTCT TTACCTCTTT GAATTGTGCT GCCTGTCATG AATACAAGCA GGGGAACGAC ACCCTGAAGC CACAACATCA GGCCACTGAA CTGGCAAAGA TCGATCTTGC GAAAGGTTGC CTCGCAGAAT CGGTTTCTGG TAAATCGGCC AATTACCGCC TGTCCGAGGC TCAGAGATCC TCAGTTCAGC AGGCACTCAA GGCATTGAAG GATATTGCTG GCCAACCAGA AGCACTGGCC GCGTTAACTT CTGGCCCGGC GAAAATTCAC CAGACGTTTC TTGCTTTCAA CTGCTATGCC TGCCATCAGC GTGGTCTCTA CGGCGGTGCT GAAAAGATTC GTGACAGCGT CTTCGAGACG ACAATGAAGG AAATGGGTGA TGAAGGACGA CTACCGCCTT CACTCAACGG TGTAGGCGAT AAACTACGAG ATGACTGGCT GGCCAAACTT TTGAATGAAG GTGCCAACAA CCGCCAGAAT TACATGCTCA CCCACATGCC TCGCTACGGG TCGGGAGCGA TTGGTCATCT GAAAGACCTG CTGATCGCCA CTGATCGTAA ACCTGAGCCG GCACCCGAGT CGATTGCGAA GTTCAACGAA CCGGATTATC GCATCAAGGC CGCTGGCCGA CACATGGTGG GGGGCAACTC GCTTTCCTGC ATCAAATGCC ATGATTTTGG CCAATATCCT TCCATCGGGA TTCGGGCCAT CAATCTGGCT TCGATGACCA GCCGCCTTCG CGAAGACTGG TTTGTTCGCT ATCTGGCTAA TCCGCAGGAG TATCGACCGG GAACACGCAT GCCGGCGGCC TGGCCCTTTG GAACGACCAG TTTGCCAGAT CTTTTAGAAG GCAAACCTGA CCTCCAGATT CGAGCTGTCT GGATGTATTT AAGCGACGGT GAGAAGGCCG CGATTCCGGC GGGTTTGATC CGGGAACCGA TTGAACTTGT GGCCACAGAT ACTCCACTGG TCTACCGCAA CTTCATTGAA GGGGCCGGCT TTCGCGGCGT TGGCATCGCT TTCCCTGAGA AGGTGAACCT CGCGTGGGAT GCAAACAACA TGCGTCTAGC CATGATCTGG CACGGAGCTT TCATTGATGC TTCAAGGCAT TGGACAGGTC GCGGCCAAGG TTTCACCGGG CCATCAGGTG ACGAAATACT ATCGATGCCC GATGGTCGCC CGGTCGCGTA TCTGACAACT CCCGATCAGG CATGGCCGGA AGGTCTGGCT CGCGAAAACG GCTATCGCTT CCAGGGTTAC TCGCTTCCCG TGGTTGCCAA GGATCAACCG GCGCTGACGG CTTTCCAGTT CACGGATGGT CGTCTGGATG TCATCGATCA ATTCTCCGCA TGGAAGTCCA GTCCCAACGA CACCACGACA GATTTGCAGC GGACAATCCT CACTCGTCCA TCCAAAGGAT CTACCATCAG TTCGACGGAC GGCACGCCGC AATTCCGTGT TCTGAAAGCC TCACGCATCG AGCAGGAAGA ATCGCTCTTA GGAAGTTCAT CCTCTGGCAC AACATGGTTG ATTGACGGGG TCTGGTGGCT CACCATTGAA CCACAGGCAG GTACACAAGG GCTGCGTCCA CAATTGCGAA CGATGGGCAA TTCCAAAGAA ATTCTCTTGC CGCTTCATCC CGACAAAGCC ACCGGCTGGG TGCTGAAATA CAACTGGTAG
|
Protein sequence | MSRCSLMLWC FSTIATLPAT SWAIESAAKT DRSPRVAGYE RYHGQTLGDA ADEEEDRQPV DELALGRLLV GELNCVACHA TSHELNQVLL SKQAPRLTNV SQRVQPEWIL EYLADPHQAK PGTTMPHILG ALPENERAPA ALALTHFLAS DKALRNSFVD RGLANKGANT FQRVGCAACH NLPKDGADDW KTSVSLSHVS QKYTVESLTR FLKEPHEARP AARMPSLNLK DEEIREIVAW FFRDLTFPPN LTYAYYEGSW ENLPDFAELK AVSNGETSGF DVEIAPKTSN YGLVFTGTID VPAAGDYQFW ITSDDGSELK IDGASVAKMD GIHPAQTKAG KTRLEAGKHP VVVEFFQQGG GAELKVEVQG PKLSRRPLES LLVALPSKAS AGQELARRPF EFDPNLAAQG KELFTSLNCA ACHEYKQGND TLKPQHQATE LAKIDLAKGC LAESVSGKSA NYRLSEAQRS SVQQALKALK DIAGQPEALA ALTSGPAKIH QTFLAFNCYA CHQRGLYGGA EKIRDSVFET TMKEMGDEGR LPPSLNGVGD KLRDDWLAKL LNEGANNRQN YMLTHMPRYG SGAIGHLKDL LIATDRKPEP APESIAKFNE PDYRIKAAGR HMVGGNSLSC IKCHDFGQYP SIGIRAINLA SMTSRLREDW FVRYLANPQE YRPGTRMPAA WPFGTTSLPD LLEGKPDLQI RAVWMYLSDG EKAAIPAGLI REPIELVATD TPLVYRNFIE GAGFRGVGIA FPEKVNLAWD ANNMRLAMIW HGAFIDASRH WTGRGQGFTG PSGDEILSMP DGRPVAYLTT PDQAWPEGLA RENGYRFQGY SLPVVAKDQP ALTAFQFTDG RLDVIDQFSA WKSSPNDTTT DLQRTILTRP SKGSTISSTD GTPQFRVLKA SRIEQEESLL GSSSSGTTWL IDGVWWLTIE PQAGTQGLRP QLRTMGNSKE ILLPLHPDKA TGWVLKYNW
|
| |