Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1014 |
Symbol | |
ID | 5710530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1048493 |
End bp | 1049401 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641266925 |
Product | CBS domain containing protein |
Protein accession | YP_001532357 |
Protein GI | 159043563 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0266413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACA CCTCTGACGG ATCGTCTCCC GCGGCGCAAC GCGCGCAAAC AGACACGGAT CACCCAGCCA GACCGAACGT GTTCCGACGC GTTCTGGAGG CCCTGAAACC CCCGAAAACA GCTGAGCACG GCTCCGACGA GCATGGCATG CACCCGGGCG CCACGCCCAG CAGCAACGCC TTCGGCATCG GCAATCTGCG CCGGATGCGC GTGGTCGACG TGGCGATTCC CCGCGCCGAC ATCGTGGCGG TGCCCTCGGA CATCACCCTG CCCGAGCTGG TCCAGGTTTT CCGCGACAGC GGCATGACCC GCCTGCCGGT CTATCGCAAC ACGCTGGATA CGCCCCTGGG CATGGTCCAC CTGAAGGATC TCGCCCTGCA ATACGGCTTC AACGGCGCGA CCCCGAAATT CAGCCTCAAG GGTAATTTGC GCCCGGTGCT CTACGCGCCC CCGTCCATGC CCATCGGCGT GCTGCTGCAA AAGATGCAGA AGGACCGCAT GCACATGGCG CTGGTGATCG ACGAATACGG CGGCACCGAC GGGCTGGTGA CCATCGAGGA CCTGATCGAA CAGGTCATCG GCGAGATCGA GGACGAGCAT GACGAACCCG AGGACCGGCT CTGGTCGCGG GAGAAACCGG GCGTCTACCT GGTGCAGGCC AAGGCGCCCC TGGACGAGCT GGAGCGCGAG CTCAGCGTCT CGCTGATCTC CCCCGGCGAC GACGAGGAGA TCGACACCCT CGGCGGCCTG GTCTTCAAGC TGACCGGCCG GGTCCCAGTG CGCGGGGAGG TCATCCCCCA CGAGACCGGG GTCGAGTTCG AAGTGGTCGA CGCCGATCCG CGCCGGATCA AGCGCGTCCG CGTCCGCAAG AGCCCGCCCC CCGCGCCCCT GGCCCCCGCG GCGGAGTAA
|
Protein sequence | MADTSDGSSP AAQRAQTDTD HPARPNVFRR VLEALKPPKT AEHGSDEHGM HPGATPSSNA FGIGNLRRMR VVDVAIPRAD IVAVPSDITL PELVQVFRDS GMTRLPVYRN TLDTPLGMVH LKDLALQYGF NGATPKFSLK GNLRPVLYAP PSMPIGVLLQ KMQKDRMHMA LVIDEYGGTD GLVTIEDLIE QVIGEIEDEH DEPEDRLWSR EKPGVYLVQA KAPLDELERE LSVSLISPGD DEEIDTLGGL VFKLTGRVPV RGEVIPHETG VEFEVVDADP RRIKRVRVRK SPPPAPLAPA AE
|
| |