Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0896 |
Symbol | |
ID | 5710586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 914230 |
End bp | 915495 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641266806 |
Product | hypothetical protein |
Protein accession | YP_001532242 |
Protein GI | 159043448 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.08458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.380175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAAG CCTTCCTGAC CAACACGTCC ATCGAGGGGT TGCGCCCGCT CGGGACGGCG CCGCAGCGCT CCTACGAGCT GATCACCGGC ACGGTGCGGG CCGAGATCGG CGCGGCGCCG GCGGCCCTTT TTGCCGAGCC GGTGGCGACG CAGTTCGGCG ACCGGTTCGA CTGGTACGCG GCGGTGGAAG GCAAGGCGCA GCCCTTGGCC GACCTGCCCG AGCAGGACCG CGCACGGGCC GAGGCGACGC TCGCGGAGCA GATCGCGGAG GTCCGCGCAC TTGCGGCGCG CTACAGCAGT GCCGAGAGCG CCGAGGAGCA GCGGCTGGGC GAGGCGCTGG AGAATGCGCT GTCCTATCCC GAGGACGGGG TGTTCGTGGT GTGGGGGCCG GATGGCGCGT TGCAGCCGGT TCTGGTGAAC TGGGCCTGGG TGTCGGACAA GCAGGTGGTG GTGGAGGGCT CCTTGCGCGC GCCCGACGCG CGACCCGCGC CGAAGCCCGC CCCACCGCCC GTGGGCACGG GCCCGGCAGG CGCCGCGGGG GGCGCCGCGG CCACGGCGAC ATCCGAGCGG ATCTGGGCCG CGCCCCTCTG GCTGCTGTGG CTCCTGGGTC TGTTGCTGGC CCTGCTGCTG GCGGCGATCG TCTGGCTAAT GGTGCCGGCC TGCGGGATCA GGACGCCCTT CACCCTGTCC TATTGCGAAG CGGCGGCCCA GAGCGATGCG GCCACCCGGC GCGGGCAGGT CCTGAAGGAC CGGATCGCGA TTCTGGAGCG GCAGATCGGC ATCGCCGACC GGGCCTGCCA GCCCGACCCG AGGGACGGGC TGCCGATCCC GGACCTGCCG GCCCTGGCCC AACGCCCGCC CGAGGCGCTG CCCGATATCG ACGCGCGCCG GTCCGAGGCC GGTGCAGAAT TGGGCGATCT GACCTTCACC CTGGCCTGGG ACGGGCCGGA CGACCTCGAT CTGTCGGTGA CCTGCCCGGC GGGGGTGACC GTGTCCTACC TGCGGCGCGA TGCCTGCAAC GGGCAACTGG ACGTCGACAG CAATGTGGGC GCGCCGGTGG ACAAGCCGGT GGAGAACATC TTCTTCACCG GGCCCCAGGG CGGGGTCTAC GAGATCCGCG TGCGGATGTA TTCCTCGCGC TCCGGGGGGG GCGATACGCC CTTTCAGGTC CAGATCCGCG CGGCCGACAG GGTTGAAAAC CTGACCGGGA TCGTTTCAGG TCAGAACAGG GACTGGCAGC AGTCCTACAA TTACGGGGGG CAGTGA
|
Protein sequence | MFEAFLTNTS IEGLRPLGTA PQRSYELITG TVRAEIGAAP AALFAEPVAT QFGDRFDWYA AVEGKAQPLA DLPEQDRARA EATLAEQIAE VRALAARYSS AESAEEQRLG EALENALSYP EDGVFVVWGP DGALQPVLVN WAWVSDKQVV VEGSLRAPDA RPAPKPAPPP VGTGPAGAAG GAAATATSER IWAAPLWLLW LLGLLLALLL AAIVWLMVPA CGIRTPFTLS YCEAAAQSDA ATRRGQVLKD RIAILERQIG IADRACQPDP RDGLPIPDLP ALAQRPPEAL PDIDARRSEA GAELGDLTFT LAWDGPDDLD LSVTCPAGVT VSYLRRDACN GQLDVDSNVG APVDKPVENI FFTGPQGGVY EIRVRMYSSR SGGGDTPFQV QIRAADRVEN LTGIVSGQNR DWQQSYNYGG Q
|
| |