Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3972 |
Symbol | |
ID | 9140692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 5094540 |
End bp | 5095937 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | protein of unknown function DUF1501 |
Protein accession | YP_003631982 |
Protein GI | 296124204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCATC ATCTACAGAA ATTGTTGAGT CAGAAACTTC CTCGCCGTCA GTTATTAACG GCCGGTGGCA TGGCTGGTTT TGGGTTGACC TTACCTCGGT GGCTGGCCGC TCAGGATCAA GCGGCTGCTG AGTTACCGGC CGCCATGTCG ACGGCGAAGT CTGTCATCTT TCTTTACCAG TTTGGTGGGC CAAGCCATGT TGATACCTTC GACATGAAGC CACTGGCACC GGATGGCACC CGCAGCCAGT TTGAAACGAT CTCGACATCC GTCCCAGGAC TGTCAATCTG CGAGCACCTG CCCCGAATGG CAGAGGTCAT GAATCGCGTT ACATTGCTCC GCACAGTGTG GCACACCATG AAGAACCACA ACAGTGCCTC CTACTATGCA CTCACCGGCC ATCCACCCGC TGTCGATGAT ATCCGCCTGC GCGACACGCT CGATCTCTTC CCGGCTTATG GTTCTGTGGT GGATCGATAT GCACCCAATA CCAATGGCAT GCCGACATTT GTGGCTTACC CACACGTCAT TCGCGATGGC GAAGTGACCC CCGGCCAGCA CGCGAGCTTT CTGGGGAAAG TGCACGATCC TCTTCTCGTC ACCGCTGACC CAAATGCCCC AGGCTTCGGC TTGCCGGAAC TCAGCCTGCC AGCCGGCGTT TCGACGGCAC GGCTCGAAAA TCGTCGGCAA CTGCAGCAGA TGATCAACGC TCAGGCCAAA CTCGGCGATG CAGCAGTCGC TGCACGCGGC CTCGAAGATT ATTACTCCCG TGCTGTATCA ATGCTGAACT CGCCAAAAAT CCGCCAGGCA TTTGCGATTG ATGAAGAGTC AGCAAGCGTT AGAGATCGCT ATGGTCGCAC GGAGTATGGC CAAGGCTGTC TCCTGGCACG CCGTCTCGTC GAGCGCGGCG TCAAGTTTGT CAGTGTCTAC TACTCGAAGA GTATTGGTGG CCGACGTAAA GAAGAGGGCT GGGATACCCA CGGATTTGAT AACACCCGCA TGTATCCCAT TCTCAAAGAT TATCACCTCC CCTTACTGGA TCAGACATTA CCGACATTGA TTCTCGATCT GGAAGAACGC GGCCTGCTCG ACCAGACGCT CATTGTCTGG ATGGGCGAAT TTGGTCGCAC GCCCCGGCTC AATGCCAATA TCAGCCGGGA TCACTGGCCG CAGTGCTATA GTGTGCTGCT GGCGGGTGGA GGGACGAAAA AAGGCTACGT CCATGGCACA TCCGATAAGA CCGGTGCTTT CCCTGAGAAA GATCCTGTGG CTTTGGACGA TCTCGCAGCG ACGATGTTCT CTGCGATCGG AGTTCCACCA GAGACAGAAC TTCGAGATCG CGGCAATCGA CCACTCGCTG CAGCGCTCGG TCACGTTGTC TCCGAAATCT TTGCTTAA
|
Protein sequence | MNHHLQKLLS QKLPRRQLLT AGGMAGFGLT LPRWLAAQDQ AAAELPAAMS TAKSVIFLYQ FGGPSHVDTF DMKPLAPDGT RSQFETISTS VPGLSICEHL PRMAEVMNRV TLLRTVWHTM KNHNSASYYA LTGHPPAVDD IRLRDTLDLF PAYGSVVDRY APNTNGMPTF VAYPHVIRDG EVTPGQHASF LGKVHDPLLV TADPNAPGFG LPELSLPAGV STARLENRRQ LQQMINAQAK LGDAAVAARG LEDYYSRAVS MLNSPKIRQA FAIDEESASV RDRYGRTEYG QGCLLARRLV ERGVKFVSVY YSKSIGGRRK EEGWDTHGFD NTRMYPILKD YHLPLLDQTL PTLILDLEER GLLDQTLIVW MGEFGRTPRL NANISRDHWP QCYSVLLAGG GTKKGYVHGT SDKTGAFPEK DPVALDDLAA TMFSAIGVPP ETELRDRGNR PLAAALGHVV SEIFA
|
| |