Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3167 |
Symbol | |
ID | 9139881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 4096903 |
End bp | 4100043 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | protein of unknown function DUF1080 |
Protein accession | YP_003631181 |
Protein GI | 296123403 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.213021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTTCC ATGTGAAATC AGTGGTTGAG ACAGGTGGGC AGAGTCAGGT CTATTCGGAC AATCAAGTTC CGGCCTGCCT CCGCTTTGGG TCAAATGTCA TATCAACCTT TCTCGATTTG TACCGCAAAA GGTGTTTCAT GAGTTTCTGG ATCAGACCTC GCGGTTCGCA CCATCCAGGC CCTGTCCGCC ACAAGCCCGG CATGGGCATT CACTGGATGA TCCTGTGCTC TTTGACACTC TCGATCATTT CGGGATGTGG GCAAAGCCCA TCCAGCAGCC AGACTCCCGA GCAGGTGACA TCGAATGCAA GCTCTGCTTC CACTGAACCC ACTGCGAATG CCACCACAGC CGCACAGCCT CAGGCAGTTG CCAACTGGCT CGAAAATCCT CAGGCCATCG ACTTGTCCGG GCGGTATGGG CGTCGATTGC AGATCGTCTG GAAAGACGCC ACTCGCGATC CCATGGAGGG CGATGACGAC GACGACTCGA TGGCAGATCG CGATGTGACG TCCCCAATTG ATCCAGCAGA CGCGAACGAT CCCCTGATGG CCGAGAATTT GCCAGCGGGC TCGTCACAGG TCACCACAGG TAAAGACAAC GCTTTTGGTG CTGCCAACGA AGAGAAACCT GACGATGCCA GCGTGAGCCA ATCGTCTCAG GCCGATTGGT ATGTGCTGAT TGACGGTGAG CCCGTGCGAG ACGAGACCGG GATCCTGTTG TCGATCCCTT GTGAAGTCCG TGTCCCTCCT GGTCCTCATG AAGTGACTTT GGCTCAGCCG GGAATGGTTG ATCTATCGAA AAAAGTGGAG ATCCGGCAGG ATCGCAAAGT CAGTTTCGAA CGACCCGAGA AGCCGGTGCG TGGAGCCTCT TCGCTTTCGG GGCCACTCTT TGACCTGGCA AGAGGCGAGT TCCTTCCACT CGATGAACTC AACACTGCCG GGAAAGAGTA CGACCCCTGG CTTTCTGCTG ATGGACTCAC GCTGGTCTTT GCCGGTGATC GAGCGGAAGG CCGAGGCGTT TACCTGGCGA CACGTCCCAC AAGATACCAT GCCTTCTCGC CTGCCGAGTT AATTGAAATT ACACGGTCGG GCGAATTCGT GGCCACGCCC GTCCTCTCAC CGGATGGGTT GAGTTTGATC TATGTGCACC CCGCCCGCAC GCGGATCTGG CTGCTGGAAC GCAATTCCGT CGATGATCCC TTTGTCAAAC GGACAGCTCT TAGGAGCATC GATCAACCGG GGTTCGAATG GATCGGGGCT CAACTGATGT TCCCAGAGGC CATTTCACAG ACACCGCAAT TCAAAGCTTC GGCAAACTTC CAGTTAGCTT GGATCGAACG TAACGCCGCA GGTCAACAGC AGGTTTTTGT CGCCGAAGGG CCGAGTCTGG GGAGTTTGTC CAAAGCCCAG GGCAAAGCGA TCATTTTACC CGGAGACCGG CCCTGGTTTA CCGCTTCGAC GGATCGTCAA TTCAGTCTCG ATGGTCATGT TCTGCAGAGG TGGATTCGTG AACAATCTCT TGGGCCGGTC GCCTCGGCAC TCTATGGAAA TCCATCGCCC ATTGGAGAAT TTCCCGCTGG ATTTCCTCCT CCACTCGCCA ACGAACGATC GCTGTTTATT ACCGATGATG AGCAGTGGGC TGTCGCTGCC GTCACATCCG CAGCCAGCGA AGCCACACTA CAACAACAGC CGGGCGATCT CATGCTGCTT CGTCTCTCTG ATGGCCCGCA ATGGGGTTGG AAATTCCAGG GGCGCAGTCT GAAGCCAATC CCCTCCACCG AGACCACACC GCCGACATTA GTCGCTGATC AGTCAGCCTC GAAAGTCATG AATGCAGAAG ACCGTTCATC ACCAGCCACG ACGAATCCAT CAACTCCCGT TCGTGAGGAG ATGGATCCCT CCGTGAAGAC GAACGCCGAT CCCGCAGCGA CCGTCGCGAA CCAGCCTGTG CAACCAGTGG AAGTTCAAAC GGCCTATTCG ACCTATGAGA AATCTCTGGC TGAGTTTCGC AAGGCTCTGG AAGCTCGAAA TTATGAACAG GCGGCACAAA TCCTCCAGCA GCGGCGAGAA ACTTCGTTTG CGACCGCATT GAATCCACTG ACAGAACTCG ACCTGGCCTG GCTGAAAACA CTCAATGAGT TTCAGGTCAT GGTGAACGAT GGTGTTCGCC AGCTGGAGCC CGGCACCACA GTACGCGTCG GTTCTGCCAA GCTGGAGTTG ATTGGTCTCA AAGAGGGTGT TCTGAGTTTG AAGTCGCGAC TGAAGACCAT TGAAAAACCT TTGTGGGAAA TGTCGACGGG CGATCTGCTG GCACTGGCCG AATCTCTGCC GGGTGGGACG AATCAGGCAT CCGCGCTGAA GACGCTCGCT TTCGTAAAGG CAGATCCTGT TCTTCCAGCA CGAGTCATCG AATTGTGGCT GGGCCGTGCG GGCCCCGGCG GACAGGATTT TCTCGAAGCG TTCAGCACTC GCGAACTTGA AGAAGGCCGC CTGGCTCTGG CCGAAAATCG GTTAAGCAGT GCGATCGAGC ATTTTGGCAA GACGATTGCC GCCGGGCCGG AAAGACCCGC CGCACAGGCA GCCGAAAAGG AAAAAGCCCA GCTCTATGAT CGCACTCGCT GGAAGATTGT CGGCAAGCGT GACTGGGCTC GCGGCCCTGA CGGTGAATGG AGTGCTGATG CCCGGCGGAT CGACGGGGCT TATCTCGTTT CCGAAAGTGA TTACGAAAAC TTCGTATGCG AGTTTGAATG GAAAGCCGAT CAACCCGGTG CACAGGGTGG GCTTTACTTT CATTATGCCG GTGAAGGAAA CCCCTTTGAA TTTGGCTATA AAATCCATCT TGCTGGCGAC ATGGATCAGC AGGGAATGGA TCAATATTCA ACGGGTGCCC TCTTTGGATC GGATGCACCC AAAAAGAAGG TCGCCAAAAA GAATGCTTGG AACCGCTTCC GTTTGACAGT CGTCGGCCCT AAGACGACTG TCCAGATCAA CGATGAAGTC GTGCTCGAAA CCGATGTGCC TGTTTCCAAA AGCGAACCTC GTGGTTATCT GGCAATCGAC GGAGTGGGTG GCGCCTTCCG CTATCGCAAG ATTCTGGTTT ATGAACCGAG CAACTCTCCA GCTGCCAAGC CTCAAAACTA G
|
Protein sequence | MIFHVKSVVE TGGQSQVYSD NQVPACLRFG SNVISTFLDL YRKRCFMSFW IRPRGSHHPG PVRHKPGMGI HWMILCSLTL SIISGCGQSP SSSQTPEQVT SNASSASTEP TANATTAAQP QAVANWLENP QAIDLSGRYG RRLQIVWKDA TRDPMEGDDD DDSMADRDVT SPIDPADAND PLMAENLPAG SSQVTTGKDN AFGAANEEKP DDASVSQSSQ ADWYVLIDGE PVRDETGILL SIPCEVRVPP GPHEVTLAQP GMVDLSKKVE IRQDRKVSFE RPEKPVRGAS SLSGPLFDLA RGEFLPLDEL NTAGKEYDPW LSADGLTLVF AGDRAEGRGV YLATRPTRYH AFSPAELIEI TRSGEFVATP VLSPDGLSLI YVHPARTRIW LLERNSVDDP FVKRTALRSI DQPGFEWIGA QLMFPEAISQ TPQFKASANF QLAWIERNAA GQQQVFVAEG PSLGSLSKAQ GKAIILPGDR PWFTASTDRQ FSLDGHVLQR WIREQSLGPV ASALYGNPSP IGEFPAGFPP PLANERSLFI TDDEQWAVAA VTSAASEATL QQQPGDLMLL RLSDGPQWGW KFQGRSLKPI PSTETTPPTL VADQSASKVM NAEDRSSPAT TNPSTPVREE MDPSVKTNAD PAATVANQPV QPVEVQTAYS TYEKSLAEFR KALEARNYEQ AAQILQQRRE TSFATALNPL TELDLAWLKT LNEFQVMVND GVRQLEPGTT VRVGSAKLEL IGLKEGVLSL KSRLKTIEKP LWEMSTGDLL ALAESLPGGT NQASALKTLA FVKADPVLPA RVIELWLGRA GPGGQDFLEA FSTRELEEGR LALAENRLSS AIEHFGKTIA AGPERPAAQA AEKEKAQLYD RTRWKIVGKR DWARGPDGEW SADARRIDGA YLVSESDYEN FVCEFEWKAD QPGAQGGLYF HYAGEGNPFE FGYKIHLAGD MDQQGMDQYS TGALFGSDAP KKKVAKKNAW NRFRLTVVGP KTTVQINDEV VLETDVPVSK SEPRGYLAID GVGGAFRYRK ILVYEPSNSP AAKPQN
|
| |