Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_4208 |
Symbol | |
ID | 9140929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 5379088 |
End bp | 5382093 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003632215 |
Protein GI | 296124437 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0104266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTTG TCTTGGATGT CGGGTTGATC TCCGCGAAGT TTTTCTTCTG TCTTCTACTG GCTCTTTTCT GTGTGGGAGG AGTTGCCAGT CGGAATCTGT GCGCGGCCGA TGAACCATCC CCGACTGTCA CATGGCGCGA GGCCTACGAG CATCTTCAGA AAGGTCGCTA CGAAGAGGTC GAAGAAGCGT ATGAGTTCCT GAAAAAGCAG TCGGCCACAA AGGATGATCT CCCCGGCGAA TTCGCCAATA GCCCACATCC ACTCACAGGG CCACAGTATG CCGAGATCGC GCTGGCGCGC ATTGATCTCG AAACCGGACA TCGAGCCCGG GCCTACGAGC GGTTGGAAGC ACTCAAGAAA TTGCATTCCG AACAACCGCG GATTCTCGGG TTGCTCGCGT GGTATGCCTT TTTGAATGGC AAACTGGATG TCGCAGAAAC AAGCGCACAA GAAGCCATTC GGATAGAAGC GGATGAACTC TGGGCACGTC GAACGCTGGC AGAAGTTTAT CAGGCGACCG GTCGATTAAA GCAGGCGGAC GAAGGCTGGC GGTATTTCGT GAGGTATTAC AATCGCGTTC AGCCAGAAAA AGCGGAAGAG CTGTTGCTTG CTGCTGAAGG TTCGTTGGCG TATGCCCGCT GGCATACCGG GAAACAGATT TTTGATTTCG TGCTGAATAC GCTGTGCATC GATGCTCTCA AAGCCGATCC ACTTTTCTGG CAGGCTCATG AATTGAGTGG CCGGCTCTTG CTGGAGAAGT ACAACAAGCC GCAGGCTCAA CAGGAGTTCC AGGCTGCTCT CGCGATTAAC CCGCGGGCTG TGACTGTGTT GCTGGGCAGG GCACAAGCTG CGGCCCAGGA TTACGACTGG GATGAATCGA ATCGACTGGC CAAAGAAGTG CTGAAAAATG CCCCTGGCGA ACCCCTGGCA CATACGCTTC TGGCAAGATC CTTGCTGTTC TCGAATCAGC CCGAGGCAGC TCTCGAACAG CTGCAACTGG CAAAAGCGAT CTGTCCGACC GATCCCACCA CGACGGGCCT GATTGTCGCG GCCCAGATTC AACGTGACGG CATTCGTTCA TTGCCCCGCC TGCAGCTACT TCTCGAATCA ATTGATCATA TTGCTGATCT TCCGGCAGCG GCAGATGCTC CCGAGGCCTA CGAGACCACG TTGATTGCCG CAGCGAAGAT CAACAGTGCC TGCGGTGAAG TTTTAGCCAC AACGGGCGAA GCTCTCGAAA TGCATCGCAA GTTTGAACTG GCCGAAAAGT TTTATCGGGC GGCACTCGCC GTCATGCCTC AACTCACTGC TGCCAGGAAT AACCTGGGGA TGCTGACGCT GCAGATGGCG CGGGTGGATG AAGCCCGCAC CATGCTCGAT CAGGCATTCA AGAGCGATCC GTACCACATG CGTGTGAGCA ATATGCGTAA GGTCATTCGT CAACTGGATG GCTACGCGAC ACTTTCGACT GATCACTTTG TAATTCGCTA TGACAACGCG CAGGACGAGT TGCTCGCACG CTATATGTCG AAGTTTCTGG AAAACGAAGT CTATCCGCAA CTGGTGAAAC AGTTCGGATA CGAACCATCC ACACGAACCA CCATCGAGAT TTACAGCAAA GGGAATGGTC AGACGGCTCA TGAATGGTTC AGTGCCCGGA TGGTCGGTTT GCCCTGGGTA CAGACGATTG GTGCCTCCAC GGGGATGATG ATTGCCCTGG CTTCACCGAA CGGGCTGAAT GAACCCTATA ACTGGTCACG CGTCATTCGG CATGAGTATG TGCATGTGCT GACATTGCAG CAGACACAGT TCAACATCCC CCACTGGTAT ACCGAGGCTC TGGCAGTGAG GAACGAAGGT TATCCTCGTC CAGCCGAATG GAATGACATG CTTCGCAAGC GAGTTCCCCG GCGGGATCTG CGGAATCTTT CCAATCTGAA CCTGGGGTTC ATCAGTGCCA AAAATGGCGA TGACTGGAAT TTTGCGTATT GCCAGAGCGA TCTCTATGCG AACTATCTCG TCGAGCGGTT TGGTGAGCCT GCACTGGAGA AACTTTTGTT GGCCTATCGA GCCGGCAAAA CGACTGAGGT GGCTCTGAAG GAACTGTTTC AGACCGACAT CAAAGACTTT GAAGCGGGAT ACCTGGATTA CCTCGACAGG ATCGTGGCAC AGCTTCCGGG GCAGGCTGAG GCAAGTATTG AATATTCAAA AAGTGAGGTG GAGGCGGCTT TAGAGAAAGA ACCTCGGAAT GCCGAGTGGC TGGGCCGGGC CGCGATGTTG AAAGCCAAGG ATCGTCGCCG GGATGAAGCC CGTAAGCTGG CTCGCGAAGC TTTGGAGATC GATCCACATT GTGCGACAGC CGCCATCGCC CTGGCAGAAC TGGATCTTCG CGGAGAAGCA CCCGAGAAAG CCATTCAGAA GCTCGCGTCG GCTCTGGATA ACACTCAGCC AGATCATCAG CTCCTGCAGC GACTTCTCCC ATTATTGATT CAAACCAAGC GATGGGACGA AGCCCTCAAG TATGCGTCTC TGGCGGAGAC AACGTTTCCG AGTGACATTT CATCAACGGT AGCGCTGGCT GAAATTCTGC CACATTTCAT GGATCTGCCC AGATATAAAG CTGTTCTCGA AAAGCTGTCT ATGTATGAGA ATGATGAATC TGAGTACCGG CTCCAGCGAG CAGAAATTGC CTGGAAAGAG AGCGATTTGC CGAACGTGGC CCGGTATGCC GCCATGGTGC TGGAAATCGA TGTGACAGAA CCGAAAGCTC ATTGGCTGCT GGCCGAAGGG TTATCTGGCG TCAAGCCGGA GGAAGCTCTC GAAGAGTTTT CGATCGCCTG CAAACTCGAT GATGAACTGG CCGAAGCGTG GGCCGGCTGG GCGGTTCTGC TGCAAACGCG CGGCCAGGTA CAAGAAGCCC GCCAAAAGGC CGAAACCGCC CTGGCACTTG ATGCCAAGAA TGCACGGGCT CTCGAAGTCC TCAACAAGTC GCAGCCTGCT AAGTGA
|
Protein sequence | MRVVLDVGLI SAKFFFCLLL ALFCVGGVAS RNLCAADEPS PTVTWREAYE HLQKGRYEEV EEAYEFLKKQ SATKDDLPGE FANSPHPLTG PQYAEIALAR IDLETGHRAR AYERLEALKK LHSEQPRILG LLAWYAFLNG KLDVAETSAQ EAIRIEADEL WARRTLAEVY QATGRLKQAD EGWRYFVRYY NRVQPEKAEE LLLAAEGSLA YARWHTGKQI FDFVLNTLCI DALKADPLFW QAHELSGRLL LEKYNKPQAQ QEFQAALAIN PRAVTVLLGR AQAAAQDYDW DESNRLAKEV LKNAPGEPLA HTLLARSLLF SNQPEAALEQ LQLAKAICPT DPTTTGLIVA AQIQRDGIRS LPRLQLLLES IDHIADLPAA ADAPEAYETT LIAAAKINSA CGEVLATTGE ALEMHRKFEL AEKFYRAALA VMPQLTAARN NLGMLTLQMA RVDEARTMLD QAFKSDPYHM RVSNMRKVIR QLDGYATLST DHFVIRYDNA QDELLARYMS KFLENEVYPQ LVKQFGYEPS TRTTIEIYSK GNGQTAHEWF SARMVGLPWV QTIGASTGMM IALASPNGLN EPYNWSRVIR HEYVHVLTLQ QTQFNIPHWY TEALAVRNEG YPRPAEWNDM LRKRVPRRDL RNLSNLNLGF ISAKNGDDWN FAYCQSDLYA NYLVERFGEP ALEKLLLAYR AGKTTEVALK ELFQTDIKDF EAGYLDYLDR IVAQLPGQAE ASIEYSKSEV EAALEKEPRN AEWLGRAAML KAKDRRRDEA RKLAREALEI DPHCATAAIA LAELDLRGEA PEKAIQKLAS ALDNTQPDHQ LLQRLLPLLI QTKRWDEALK YASLAETTFP SDISSTVALA EILPHFMDLP RYKAVLEKLS MYENDESEYR LQRAEIAWKE SDLPNVARYA AMVLEIDVTE PKAHWLLAEG LSGVKPEEAL EEFSIACKLD DELAEAWAGW AVLLQTRGQV QEARQKAETA LALDAKNARA LEVLNKSQPA K
|
| |