Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2552 |
Symbol | |
ID | 5713449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2715478 |
End bp | 2718297 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641268475 |
Product | hypothetical protein |
Protein accession | YP_001533886 |
Protein GI | 159045092 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02243] conserved hypothetical protein, phage tail-like region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00108717 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGCG ATTTCACACG TTGGAACCGG GCCGGGCTGC GCCGCTTTCG CTATGTCGGT GGCAACGCGG TCACCTTTCT GGAAACGCTG CGTGCGCATC TGGCGGAGGG ATACCGCGCA CCCGGGGAGG CCTTGCCGCG CTGGAGCGAT CTGGTCACCC GCAACCCGGT CGCAGCGGAC GAGGACCCGG CCGAGCGCGA AGCGCGGCTT CTGGCGCAAT ACGCGGACAT CCGGCGGGAT CATGGGTGGG AAATCCTGCG GGCCTTTGCC CGGTCCACCC ATGTTCTGAC CGAGCATATC GATGCCTACG CCAATGAGAC CTTTATCCGT ACGGCGACCC AGTGGGACAG CGTGCGGCAT CTGATCAAGC TGCTGGATGC GCGGCCCGCG CCGCCTGCAT CAGCGCGCAC GGTCCTGGCG TTTGATGCCA GCGAGGCGGG CAAGCTGGCC AAGGGGTTCC AGGTCAAGGC CGGGCCGGGT GGCGACGGGC CGCCCCCGGT CTTCGAGACG ATTGCCGATC TGGATCTGGA CCCTGCGCTC AATGTGCTGC GCCCGACCGA CTGGAATCGC GGTCAGGAGG ATTTCGCCTA CAGCGCTGCA GCGAGTGGTC ATGTCGCTAG TTTTCCCCTG CCCGATTCCG GGATGGCGCC GGTGGTGGGC GAGATCGCGG TGCTGGAGAT CGCCGCGGCA GACGGGGTGT CCGCGCCGAG TGCTTTCGGG GTGCAGGTCA ACCCCGGCAT AGGGGGCATG GCCATGTTGC AGGGCCAGAG CCCCGCAGGC GGGGGCGACC TGACGGTCAA GCGCTGGCGC GTGGCGCTCC ATGCGGGCGC GCGTCTGGTG CAGACGCCGC GGCTTGCAGG TCCGGACGTG GTGATCCTGA CACCGGAGCA CAATCTCGCC GCCGGGCGCA GCACAGTGAC CTGGCTTGCG GGCGGGACCT GGCGCACGGC GCGGGTGATG GAGATCGACG GGGACCGCGT GCGGTTGGAC GGCGGTCTCA TGCCAGGGGT CGGCGCGGAT CTCTACGCGA TGGTCGAGGC GCGGTCTCAG GTGCTGCCCG GGTCAGAAAC GCTTGTTCTG CCGTTCGAGC GGGGCGCGTC CGGGCGGGTC TGGTCCGTGG CGCTTGTCGA CAAGACGGCG GATGTCCGGA CCAAGAGCTC CAATGACGCG CTGGGCAACT CGTTCCCGGA ATATGACTTC GTCACGGGCC TCGCCACGGC GCTCTATGTG CCCGGAGGAA GCGATCCGGT GGCCCGGGTC GAGGTCGCGG CCCCTTCCGG TCTTGTGCTC GGCGGCGCGG TGGAGGGGCT GCGTCAGGGC GACTGGATCG TCTCGGCGGG TTCGCAGTTG CAGGCGGTTC GGGTGACGGC GGTGACTGAG CGGGAGGGCG ATACAGCCAT CGATACGGTC CCGGAGATTA CCGATCTGCA AGCGCCCGTG CACTTGCACT TCGCCGATGT CTTGCGTCCG CTGGATCATG ATCGCAACCG TAGTTCGGTT CATGATGCCA CGGCACTCTC AGACCGGGTG ACGCGCGTGC TGGTCGATGT CGGGGGCTCG GCGCGGCTGG CCCGGGGGCG ACAGGTTATA CTTGAGGATC CGGAGCGCTC CCATGCCGGG GAGATCACCG CGACCGGTCC GGGCTGGATC GAAATTGCGC CGGTGATCCC GGGAACGGAA TTGACCGACC CCCTCGATGT CACGCCGTTC GAGCGGTGGT CCACCACCCT CTCTGCAAAT GTGGTGACCG CCGATCACGG GGAAACACAG GACCTGAAGA TCATCGGCTC GGGGGATGCG ACGGCCTCCA ACCAGGTGTT CGAGATCAAG GCGCAGGAGC TCTCGTTCAC CACCGACCCG CTGATGCCAG CAGGGGTGCG GGCGGGGGTC GAGATCTTTG TGGGAGCCCG GCGCTGGCGC CAGGTGGGCA ACCTGCGCGA CAGTGGGCCG ACGGATGCGG ATTACGAGGT CCGGGTGGCC GAGGATGGCA CGGTCACTGC CCATTTCGGC GATGGGCGCC ATGGGCGACG TTTGCCCACG GGCACAGACA ATGTCCGCGC GACATGGCGC AAGGGCGTAG GTTCGGTGGG CAATTTGCCG CGCAGCACGC TGCGCAAGAT CGTGCAGCCG GACCCACGGC TCTCCGCTGT CCGGCAGCCT GTGGCAGCGG CGGGCGGGGC AGAGGCCGAA GGGGTCGAGT CTCTGCGGGA CAATGCGGCG GCGGGCCTTC TGACCCTTGG GCGCGCTGTC TCGGTCAGTG ATTTCGGCAA GCTCGCGGCG CAGAACGCCC AGGTCTTGCA GGCCATGTCC TACAGGGTGG CGCCGGGTAC GGCACGTGGC GAACTCGTTG AGGTGGTCGT GGTCCCGGCA GGTGGGCGCA TGGGCACACT GGGCGAGGAT CTGACGCAAT TTCTCGCAGG AAACGGGCTC CCCGGTGTCA ATCTGCGAGT GGTGCCGTAT GTGGCGTTGC CACTGAGCCT GTCGGTTCGG ATCGAAGTCA GGAGCGCCGC GTACGATCCC GATGAAGTGG CCGAGGTCGT CCGCGTGAAG ATCAGTGAGG CCTACGCGCT GGAGCGTGCA CAACTCGGTG CGCCGCTCTT CCGCAGTCAG ATCCTGCACC TCGTGGAAGG GGTAGAGGGG GTCGAGAACG CCCGGGCCGA TATCCTGACC GCGGGGTGGG CGGGGATCAC GCCCGCGCCC GGCATCGGCC TGGGCACAGG GAGCGGGGTG CGCAGCGTGC GCCCGCGGCC GAACCAGATG ATCCACCACG ACCCGGACCT GTCCCAGCTC ACCATCAGCA CACAGGAGTT CACGTTGTAG
|
Protein sequence | MSGDFTRWNR AGLRRFRYVG GNAVTFLETL RAHLAEGYRA PGEALPRWSD LVTRNPVAAD EDPAEREARL LAQYADIRRD HGWEILRAFA RSTHVLTEHI DAYANETFIR TATQWDSVRH LIKLLDARPA PPASARTVLA FDASEAGKLA KGFQVKAGPG GDGPPPVFET IADLDLDPAL NVLRPTDWNR GQEDFAYSAA ASGHVASFPL PDSGMAPVVG EIAVLEIAAA DGVSAPSAFG VQVNPGIGGM AMLQGQSPAG GGDLTVKRWR VALHAGARLV QTPRLAGPDV VILTPEHNLA AGRSTVTWLA GGTWRTARVM EIDGDRVRLD GGLMPGVGAD LYAMVEARSQ VLPGSETLVL PFERGASGRV WSVALVDKTA DVRTKSSNDA LGNSFPEYDF VTGLATALYV PGGSDPVARV EVAAPSGLVL GGAVEGLRQG DWIVSAGSQL QAVRVTAVTE REGDTAIDTV PEITDLQAPV HLHFADVLRP LDHDRNRSSV HDATALSDRV TRVLVDVGGS ARLARGRQVI LEDPERSHAG EITATGPGWI EIAPVIPGTE LTDPLDVTPF ERWSTTLSAN VVTADHGETQ DLKIIGSGDA TASNQVFEIK AQELSFTTDP LMPAGVRAGV EIFVGARRWR QVGNLRDSGP TDADYEVRVA EDGTVTAHFG DGRHGRRLPT GTDNVRATWR KGVGSVGNLP RSTLRKIVQP DPRLSAVRQP VAAAGGAEAE GVESLRDNAA AGLLTLGRAV SVSDFGKLAA QNAQVLQAMS YRVAPGTARG ELVEVVVVPA GGRMGTLGED LTQFLAGNGL PGVNLRVVPY VALPLSLSVR IEVRSAAYDP DEVAEVVRVK ISEAYALERA QLGAPLFRSQ ILHLVEGVEG VENARADILT AGWAGITPAP GIGLGTGSGV RSVRPRPNQM IHHDPDLSQL TISTQEFTL
|
| |