Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3139 |
Symbol | |
ID | 5712195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3305602 |
End bp | 3306732 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641269066 |
Product | cupin domain protein |
Protein accession | YP_001534473 |
Protein GI | 159045679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.557804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGA TCACCCAGAC CGAAGCCGAG GCCCGCCGCG TCAAACGCAG CGACTACACC TCCTGCACCG TCGCCTTCAT CGACTGCAAG AAGCCCGGCT CCCACCTCAA ACAGAACTAC TCGATCATCG GCCCCGGCGT GACCTCGTCC TCCGAGCAGG TGATCAACCT GCCCGAAGCC CACGGGTTCA ACATCGGGGC TGCCGCCATG CCCACCGGCA TCACCAACAA CCTGCACATC CATTTCACCG CCGAGGTGTT CATGGTCTAT GACGGGGAAT ACACCTTCCG CTGGGGCTCG AACGGCGAGC ACGAGATGGT CGGCCGCCCG GGCGACATCC TGTCGGTGCC CACCTGGATC TTCCGCGGCT TCACCAATAC CGGCGCGGGC GAAGGCTGGA TCTTCACCAC GCTGGGCGGC GACAACACCG GCGGCGTGAT CTTCCATCCC GACATCCTCC GCGAAGCGGC CGATTACGGG CTCTACCTGT CGCGCAACAA CGTGCTGATC GACACCACCC GGGGCGACCC CACCCCCGGC GAGGGCGAAA CCCTGCCGCC GATGAGCGAT GCGGAGGTCG CCAAGCTCCG CCATTACAGC CCCGAGGAGA TGCGCGAGCG CCTCGTCAGC ACCGCCGAGC GCGACTTCCG CCCCGCCTTC GTCGACCGGG CGCTGGACGG TTGCGGCGCC GAGATCGCTC CGGTGATCGG CCACGGCATC AGCCAGAACC GCGACCACGC CGCCAAGATC CGCAACCCCC ACGGGTTCTC CATGGAATGG CTCCGGGTCG CACCGGGGCA GACCGTCTCC CCCTTCACCC TCGACGACAA GATGGTGGTG ATCCCGCGCA CGGACGGGCT GCGGGCGAGG CTCAACCCCG AGGGCGACGT GACCCTGGAC CTCGGCGGGT GGGAGACGTT CTCGATCCCC GCCGAAATCA CCCGCGCCTT CGAGAATACC TCCGACAGCC CGGTCGAAGC CCTGATCATC GTCTCCGGCG ACCACCAGAA ACGGCCCCGC TTCGCGCCGC AGGTGCTCGC GGCGGCGGCG GAAAAGGACC TCGCCCTGGA TGCCCAGGGC TACGTGGCCG CCGCCCACCT GATCCCGAGC TATGGCCTCG TGGCCGAGTA A
|
Protein sequence | MTRITQTEAE ARRVKRSDYT SCTVAFIDCK KPGSHLKQNY SIIGPGVTSS SEQVINLPEA HGFNIGAAAM PTGITNNLHI HFTAEVFMVY DGEYTFRWGS NGEHEMVGRP GDILSVPTWI FRGFTNTGAG EGWIFTTLGG DNTGGVIFHP DILREAADYG LYLSRNNVLI DTTRGDPTPG EGETLPPMSD AEVAKLRHYS PEEMRERLVS TAERDFRPAF VDRALDGCGA EIAPVIGHGI SQNRDHAAKI RNPHGFSMEW LRVAPGQTVS PFTLDDKMVV IPRTDGLRAR LNPEGDVTLD LGGWETFSIP AEITRAFENT SDSPVEALII VSGDHQKRPR FAPQVLAAAA EKDLALDAQG YVAAAHLIPS YGLVAE
|
| |