Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3109 |
Symbol | |
ID | 5710961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3274527 |
End bp | 3276476 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269036 |
Product | hypothetical protein |
Protein accession | YP_001534443 |
Protein GI | 159045649 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTTT CCGTGATGAC AGTTTTCCTG GCGCTTGGCG TGGCCCTCAC GGCCTCGATC GCGCGGGCGG AAGACCCGCG CGTGCCGGAC TATATCGGCA CGGCGGTGTG CGCGGATTGC CACCAGGAGG CGTATGAGGC CTGGCAAGGC TCCCATCACG ACTTGGCCTG GACGCCGCCC GATGCGCTGC ATGTGCTGGG CGATTTCGAC GATGCGGAAT TCGTCCATAA CGGCGTGCGC ACGACCTTCA CCACCGAGGA CGGCGTGTTC TACATCACCT CCGACGGGCC GGATGGCAGG TTGCAGAAAT ACCCGGTGCA CAGCGTTGCG GGGATCGCAC CGTTGCAGCA ATACCTGATC GAGACGGAGC CGGGGAAGAT CCAGTCCTTC GACGTGGTCT GGGATATCGA GCGGGGCGCG TGGTATCACC TCTATCCGGA TCAGGACTTG CCGGCCTCGG ACGGGTTGCA CTGGACCGGG CCCTACAAGA ACTGGGGCGC GCGCTGCGCC GAGTGTCATG CCACGGGCTA CAAGAAGGGG TACGACCCGG CGACGCGGAC CTACACCAGC ACCCAGGCGG AGATCGGCGT GGGATGCGAG GCCTGCCACG GACCGGGGGA GGCCCATGTG AGTTGGGCCC TGCCGGGGGG CGATTACGAT CCGACCCGCT GGGACCGGGT GGGCGAGACC GGGCTGACCA TGGATTTCGC GGCCGGTGCG GAGGCGGAGA TCCAGCAATG CGCAGCCTGC CATTCCCGGC GCGAGGCGTT CGAGGAGGGC AATCCGCTGC CCGGCACGCC TTATCACGAT GCCTATCGGC TCAGCCTCTT GCGGGACGGG ATGTATCACC CGGACGGGCA GATCCTGGAA GAGGTCTATG TCTATGGCTC GTTCATCCAG TCCAAGATGT ACGCCGCCGG GGTGGCCTGC ACCGATTGCC ACGATGCGCA TTCGGCGCAG CGGCTGACCG AGACCAATGA CCTGTGCACC CAGTGCCATT CGACGGCCGG GAACCCGGAG TTCCCGACCC TGCGGCTGGC GGAGTATGAC GACCCGTCCC ACCACTTCCA CGAGGTCGGC AGCGAAGGGG CGCAGTGCAA GAGCTGCCAC ATGATCGAGC GCGACTACAT GGGGATCGAC GGGCGGCGCG ACCATTCCTT CCGGGTGCCG CGCCCGGACC TGACTGTCGA GACCGGCGCG CCCAATGCCT GCAACGATTG TCACACGGAT CGCAGCGCCG AGAAAATGGC GGCCGAGCTG GAGGCGCGCT TTCCCGACAG CATCCATCGC GGTCCGCATT TCGCCCAGGT CTTCGCCCGC GCCCGGATCG CGCCGCAGAG TACGGCGGAG AACATCGTGG CGCTGACCGA ATACCTCGAT CTGCCGGGGA TCGTGCGGGC GTCGGCGCTG GAGCTGCTGC ATCCGATGGC GGGCCCGGAG CTGGCGGATC GCATGGCGAT GGCTCTGCGC GATCCTGATC CGCTGGTGCG GGCCGCGGCG GTGCCGATCC AACGTGCGGC ACCCGCGCCG GTTCGGGTGG AGCGCCTTGC GCCCCTGCTG TCAGATCCGG TCCGAAGCGT GCGGATGGCG GCAGCCCGTG AATTCCTCGA TCTGAACATG CGGGTTGTTC CGCCGCAGAC CACGCAAGCG CTCGGAGCCG CGATGTCGGA ATGGCAGTCG TCGTTGCGGG CAAAGGCGGA TTTCCCGGAG ATCCAGATCA TCCTCGGCGG TGTCGGCTTG ACGATGCGGA ACATGCCTGC CGCCCTGTCG GCTTTCTCCG AAGCGGTCGA CCTCGATCCG CAGCGCGAAG AAGCCTGGTC TATCATTGTA CGCATCTATG CCGCGCTCGG GGATATGGCG GCGGCGAGAG ATGCCGTGGA CGCAGCACTG GTTGCAAACC CCGAGAGTGT GGCGCTGCGG GTTCTGCGGG GTGAAATCCT CCAGCAATAG
|
Protein sequence | MGFSVMTVFL ALGVALTASI ARAEDPRVPD YIGTAVCADC HQEAYEAWQG SHHDLAWTPP DALHVLGDFD DAEFVHNGVR TTFTTEDGVF YITSDGPDGR LQKYPVHSVA GIAPLQQYLI ETEPGKIQSF DVVWDIERGA WYHLYPDQDL PASDGLHWTG PYKNWGARCA ECHATGYKKG YDPATRTYTS TQAEIGVGCE ACHGPGEAHV SWALPGGDYD PTRWDRVGET GLTMDFAAGA EAEIQQCAAC HSRREAFEEG NPLPGTPYHD AYRLSLLRDG MYHPDGQILE EVYVYGSFIQ SKMYAAGVAC TDCHDAHSAQ RLTETNDLCT QCHSTAGNPE FPTLRLAEYD DPSHHFHEVG SEGAQCKSCH MIERDYMGID GRRDHSFRVP RPDLTVETGA PNACNDCHTD RSAEKMAAEL EARFPDSIHR GPHFAQVFAR ARIAPQSTAE NIVALTEYLD LPGIVRASAL ELLHPMAGPE LADRMAMALR DPDPLVRAAA VPIQRAAPAP VRVERLAPLL SDPVRSVRMA AAREFLDLNM RVVPPQTTQA LGAAMSEWQS SLRAKADFPE IQIILGGVGL TMRNMPAALS AFSEAVDLDP QREEAWSIIV RIYAALGDMA AARDAVDAAL VANPESVALR VLRGEILQQ
|
| |