Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_0160 |
Symbol | |
ID | 9136814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 213763 |
End bp | 214851 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | protein of unknown function DUF1559 |
Protein accession | YP_003628211 |
Protein GI | 296120433 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000196505 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA ACCGCGGATT TACGTTGATT GAACTCCTGG TTGTGATTGC GATCATCGCC ATTTTGATTG CACTGCTTTT GCCGGCTGTG CAGCAGGCGC GGGAAGCGGC CCGTCGAACA CAGTGCCGCA ACAACCTCAA GCAGTTGGGG CTTTCGCTGC ACAATTATCA CGATGTTTTT GGCACGTTTG TCTTCCGCCG TGGTGGCACA GGTGGCCAGT GGGATGCAGT GCCCCGCAAC AACAACGAAC GCCGCAGTGG CGTGATCAGC CTGCTGCCTT ACATGGATCA GGCACCACTC TATAATCGAA TCGAAGCTGG CGACCTGACG GGAACCACCA ATGGTGGCAC AGCCGTGGGG CCTGGCGGTA ATCAGGCATG GACTTCCGGC CCAGGTGGTG GCTGGTCGGT TTGGAACGTG GCTGTGAATG GCCTGCAGTG CCCGTCGGAT TCCTTCTCGG GGAGTGTTGG TTGCAACAAC TACATGTTCT CACTGGGTGA TTCAGTCAAC AATGCGGAGA ATCTGCGAGA TGTTCGCGGC CTGTTCGGTT ATGCCAGCAC CTTTGGAGTC CGCGATTGCA CTGATGGCAC CAGCAATACG ATTGCGATGG CCGAGCGATG CAAGGGAAAT CAGGCTCCTG CAACGAATAG TAATCGCCGC GCAATTACTG GTACTGCGAT GAATAAGACA GGCATTGCTG CCAATCCTCT GCAGTGTCGT AATCTGGCAG TCAATGGTGT GTATGCCGCT GCGGAGAATG TCAAAGAACG CGCTGCGACA TGCTGGACCG ATGGTCGTCT CGAACGTTCC GGTTTTCAGA CAGTGTTGCC ACCCAACGGA ACATCCTGTG CCGAAGGTGG AGATACCAAT GCCGATTCCG CCACAGCCAT TATCACTCCC ACCAGTTTCC ACACAGGTGG TGTGCATGCC CTGATGGCTG ATGGCGCTGT TCGCTTTATC AGCGAGAACA TCGATACCGG TAACCTGGCG ACCGGCCCTG CGACGGGGAA TCCCAGCGGT CCCAGCCCTT ATGGTGTCTG GGGTGCTCTG GGAACACGCG CTGGTGGCGA AGTCACCAAC GAGTTCTAA
|
Protein sequence | MKKNRGFTLI ELLVVIAIIA ILIALLLPAV QQAREAARRT QCRNNLKQLG LSLHNYHDVF GTFVFRRGGT GGQWDAVPRN NNERRSGVIS LLPYMDQAPL YNRIEAGDLT GTTNGGTAVG PGGNQAWTSG PGGGWSVWNV AVNGLQCPSD SFSGSVGCNN YMFSLGDSVN NAENLRDVRG LFGYASTFGV RDCTDGTSNT IAMAERCKGN QAPATNSNRR AITGTAMNKT GIAANPLQCR NLAVNGVYAA AENVKERAAT CWTDGRLERS GFQTVLPPNG TSCAEGGDTN ADSATAIITP TSFHTGGVHA LMADGAVRFI SENIDTGNLA TGPATGNPSG PSPYGVWGAL GTRAGGEVTN EF
|
| |