Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3168 |
Symbol | |
ID | 9139882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 4100337 |
End bp | 4102322 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | Spore coat protein CotH |
Protein accession | YP_003631182 |
Protein GI | 296123404 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.308168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGTCG GGAGCGTTCT ATTATTGGTG ATTGCCATCG CGATTGAATC GGTCTGGCCA CAGACGGGTG ATCGCGCTCC GGAAAATGGC CCACCCCAAA ATGGCCCCCG AGAAAATGGC CCAGGGGGCG GCTTTCCACC CTTCGGCCCT CCGGGCGGAT TCCCTGGCTT TGGCGGACCA CCCGGCTTCG GCGGCCCGCC CGGAGGTGGA CAGGAACGCA AGCTGCTCTC GAAATTCGAT GCTGACAAAG ACGGCAAGCT GAATCTCGCA GAGCGACAAC TGGCTCGAAA AGAATCCGCT CAGGGAAGTG CTGGCTTCGG CGGTCCCAGA GGGCCTCGCG GCGGTATGCC TGGCATGGGA GCCAATCGTA CGCCACAAGC AGGGAAGAAA ATCCTGCCCG AGAATGTGAC GTCAGCCGGT GATACTGATC TGTACGATCC GTCGATCGTC CGGACAATAT TTCTGAACTT TGAAGGCAAA GACTGGGAAA CAGAACTCTC GGACTTTCAC AACACGGATG TCGAAGTTCC CGCCATGATG CAGGTTGATG GTAAAGACTA CCCCGACGTC GGCGTCAGCT TCCGCGGCAT GTCATCCTAC GGTATGGTGC CAGCCGGCTT TAAGCGGTCA TTCAATGTTT CCATCGATGC CTTTAACGAC CAGCAAAAGC TCGGCGGCTA CAAAACGCTG AATTTACTTA ACTGCAATGG CGACACTTCA TTTTTAAGAG GTTTTGTCTA CTCTCAGATC GCCACGGAAA TGATCCCCGT CCCTCGCGTG AACTTTGTTC GTGTCGTTGT AAATCACGAA GACTGGGGTG TCTTTGCAAA CGTCGAACAA TTCAACAAAG ATTTTATCAA AAGACACTTT GAGAACAGCA ATGGCTATCG GTGGAAGGTT CCAGGAAGTC CTATGGGTCG CGGTGGGCTG GAGTATTTAG GGGATGATAC CAACGCCTAC AAGCGAATTT ACGAGATCAA AAGTAAAGAT ACTCCTGAGG CCTGGGAGCG ATTGATTTCT TTATGCCGCA TTCTGAATGA GACACCTGCG GAGCAACTGG TCGAAAAGCT GGAGCCAGTC CTCGATATTG ATGAGACCCT GACGTTTCTG GCGCTGGATG TCGCGCTCTG CAATAGTGAT GGCTACTGGA CTCGCGCGAG TGATTACAGT CTCTACTGCA CACCCGAAGG GAAATTTACA CTGGTTCCGC ACGACTTCAA TGAGATCTTT CAATCGGGTG GCCCTGGCGG CCCACCAGGT GGCGGACCGC CGGGTGGATT TGGGCCCCCA TCATTTGGTT TTCCTCCATT TGGTCCACCA CCCGAAGGCC AACCCCCATT CGGACCGCCG GGTGGTGCAC CCAATGGATT CGGGCCACCT CCCAATGGGA ATAGCACTCC TCCTCAGGGA CGAGTTGCCG GTAATCCCAA TGGCCCTCCT CAAGGGCCGG GTGGCGGGCA AAATCCGCGG GGTGGCCCAA GACGAGGCCC CGGTGGTGGT GGGCCTGGTG GCGGTGGTCC CGGTGGCGGT GGGCCAGGTC ATGGTGGGCC GACACTCGAT CCTCTGGTTG GGCTCAACGA TTCGACAAAG CCACTGCGCA GTAAACTCCT TGCTGTTCCA GAACTCAAGG CTCGCTATCT GAAGTATGTC GGACAGATTG CCGATCAGTA CCTGGCAGCC GAATTCCTCA AACCTCGGAT GCAGCAGGAG TTTGAACTGA TTTCACCACT GGTGGCTCAG GATCAGAAGA AGCTCTTCAC CACAGCCGAC TTTGTGCGCG AGTCGAAGTT TATTGAGATT CAGAACTCCT CAGAAAATGC TCGCTCGACG CTGTGGGATC AGATTCAGAA ACGACGGGAG TTTCTGATGA AACACGCGGA AGTCCGAGCC GCCCTTGGCA AGCAGAACAC CGATGGACGA ACATCGGCGA TCAAGAATCA GCGATCCTCC AACCGACAAC CGGCTCCCTT GTCGTCAGCC CGTTAA
|
Protein sequence | MVVGSVLLLV IAIAIESVWP QTGDRAPENG PPQNGPRENG PGGGFPPFGP PGGFPGFGGP PGFGGPPGGG QERKLLSKFD ADKDGKLNLA ERQLARKESA QGSAGFGGPR GPRGGMPGMG ANRTPQAGKK ILPENVTSAG DTDLYDPSIV RTIFLNFEGK DWETELSDFH NTDVEVPAMM QVDGKDYPDV GVSFRGMSSY GMVPAGFKRS FNVSIDAFND QQKLGGYKTL NLLNCNGDTS FLRGFVYSQI ATEMIPVPRV NFVRVVVNHE DWGVFANVEQ FNKDFIKRHF ENSNGYRWKV PGSPMGRGGL EYLGDDTNAY KRIYEIKSKD TPEAWERLIS LCRILNETPA EQLVEKLEPV LDIDETLTFL ALDVALCNSD GYWTRASDYS LYCTPEGKFT LVPHDFNEIF QSGGPGGPPG GGPPGGFGPP SFGFPPFGPP PEGQPPFGPP GGAPNGFGPP PNGNSTPPQG RVAGNPNGPP QGPGGGQNPR GGPRRGPGGG GPGGGGPGGG GPGHGGPTLD PLVGLNDSTK PLRSKLLAVP ELKARYLKYV GQIADQYLAA EFLKPRMQQE FELISPLVAQ DQKKLFTTAD FVRESKFIEI QNSSENARST LWDQIQKRRE FLMKHAEVRA ALGKQNTDGR TSAIKNQRSS NRQPAPLSSA R
|
| |