Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3817 |
Symbol | |
ID | 9140535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 4907048 |
End bp | 4908448 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | protein of unknown function DUF1501 |
Protein accession | YP_003631828 |
Protein GI | 296124050 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGATC AGATTGCTGC CAGATTCTCC CGACGGACGG TCCTTTCCTC GCTGGCAGGA AGTCTGGCGG GATTAACACT GGGGACAGGT CTTTCCCGCT GGGCAGGTGC CAATAGCGAG GTTCAGAACG CAGCGTCTGT CGTGGGAGAA CCTCACTTTC CTCCCCGCGC GAAACGGGTC ATTTTCCTGT TCATGCATGG CGGTGTCAGT CAGGTCGATA CCTTCGATTA CAAGCCCGAA CTTTCCAAAC TGGATGGCAA AACGTTGCCC TTTCAGGCAG CAGCGAACAT CGATGCCAAG CCGGTGTTGA TGCAGTCTCC CTGGAAGTTC AACCAGTATG GAGAATCGGG GGCCTGGTGT TCGGAACTCT TCCCCCACAT CGTCCAGCAG ATCGACAGGC TGTGCATTAT CAAGTCGATG CACAGCCGGG GGCAATCGCA TGGTCAGGCG GTTTCGATGT TGCATACCGG AAGTGATAAT CTGGTGCGGC CTTCTGTCGG TGCCTGGGTC TCTTATGGTC TGGGCTGCGA AAACGAGAAT CTGCCCGCTT TTGTTTCAAT TGGCCCTTCG GCAGGTCATG GCGGGCCACG CAATTATGGC GCTGCCTTTC TGCCTGCCAT CCATCAGGCC ACAACGATTG GCAGACAGGG CCGGCTGGGA AATGGACAGA TTGATTTTCT GAGTCAGGCG ACACCTGAGC AGCTCGAACT TGTGCGTGCC ATTCAGAAGA TCAGTCAGAA ACATCTGGAT CGTGTGGGCC CGGATCCTCA ATTGCAGGGG GCGATTGAAA CTTACGACCT CGCTTATCGA ATGCAGGCCG CTGCACCGGA TGTGCTCGAT CTCTCGCATG AGACGGAAGC AACGAAGGTG GCTTACGGGA TAGGCGAAAA AGCGACCGAC GAGTTTGGCA GACAATGCCT GCTGGCCCGT CGACTGGTGG AATCGGGTAT TCGATATGTG GAGTTATCCA CAGGGAACGT CTGGGATCAG CATGGCGGGT TACGAGCGGG CCATGCCAAG AATTCGATGG CCGTCGATCA ACCGATTGCA GCTCTGTTGA ATGATCTTGA TCAACGGGGA TTGCTCGATG AGACACTTGT GGTGTGGGCG GGCGAGTTCG GTAGAACGCC GATCGTGCAG GGTAATGATG GACGCGATCA TAATCCGCAG GGGTTTACGG TCTGGCTGGC TGGTGGTGGT GTGAGAAGTG GATTCTCGTA TGGCGAAACC GATGAGGTGG GCTATTTCGC TGCCCAAGAT CGCGTGCATA TGCACGATCT GCATGCCACG ATGCTGCACC TTTTAGGGAT CGACCATGAG CGGCTGACGT ACAAATATGC GGGCCGCGAC TTCCGACTGA CCGATGTGCA TGGGCGAGTC GTCAAGGAGA TCCTGGTCTA G
|
Protein sequence | MYDQIAARFS RRTVLSSLAG SLAGLTLGTG LSRWAGANSE VQNAASVVGE PHFPPRAKRV IFLFMHGGVS QVDTFDYKPE LSKLDGKTLP FQAAANIDAK PVLMQSPWKF NQYGESGAWC SELFPHIVQQ IDRLCIIKSM HSRGQSHGQA VSMLHTGSDN LVRPSVGAWV SYGLGCENEN LPAFVSIGPS AGHGGPRNYG AAFLPAIHQA TTIGRQGRLG NGQIDFLSQA TPEQLELVRA IQKISQKHLD RVGPDPQLQG AIETYDLAYR MQAAAPDVLD LSHETEATKV AYGIGEKATD EFGRQCLLAR RLVESGIRYV ELSTGNVWDQ HGGLRAGHAK NSMAVDQPIA ALLNDLDQRG LLDETLVVWA GEFGRTPIVQ GNDGRDHNPQ GFTVWLAGGG VRSGFSYGET DEVGYFAAQD RVHMHDLHAT MLHLLGIDHE RLTYKYAGRD FRLTDVHGRV VKEILV
|
| |