Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1904 |
Symbol | |
ID | 5712897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1987353 |
End bp | 1988663 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641267829 |
Product | hypothetical protein |
Protein accession | YP_001533247 |
Protein GI | 159044453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000430551 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACCCC TTCCCTTCCT GAGCATCGCC CTGTTGCTCG CAACCGGCGC GCAGGCCGAG CCATATTGCG CCGATCTGCT GGAGGCAGAG CGCCTGCCTG CCAAATACGC CCGTAACGCC CCGATCTACA GCGATGCGGC AAGCGGCTGG GTCTTCACCC AGGACCAGCT GAAGGAGCGC TACGCGATGA AACCCGCCGC CCAGGCGCTG GTCGCCGAGA TCGTGCGGGA ATTCGACAAG CGCGACGTCG CACTTGCCAT CGTGGTCCCG CCCCCTCGCC CGATCATCGC AGGCCAGGCG CACTTTGACG CGGCGATGGG CGAGGCGCAC CATGACCTCG ACGCGGCACA AGCCTCCTTC GGCGACCTCA TCACGGGGCT CGCCGCGACC GGGGCCATCG TGCCCAATCT GCAAGAGCTT GCCCTCAGCG ATGCGGACCT GCGCGCCGCG TTCTACTTCA AACGTGACAC CCATTGGACG ACCACGGGCG CCCTGGCCAG CGCCCGCGCC GTGGCACAAG CCACCGCGAC CGCAAGATCG GATCTTTTCC CGGACATGCC GGCCGATGCG CCCGCGATGG CACCGCTCGA AACCACGATC GAGGAAAAAG GCTCCCTCGC CCGGATCCTG CGCGACGTGT GCGACATCGA GCCGGGGCGC GAAACCGCCC CGCTTTACAA CTACGCCCGC GCCTCAGCGG CAGGGCTGCT GGCCGATGTC ACTGATGCGC CCCGGGTGGC CCTTCTCGGC AGCAGCTTTT CCGACCGCTA TCAGACCGAC CACTACCGCT TCGCAGATGC GCTGTCCCAA GCCTTCGATC TGGACGTTGA GAACTTCTCC GTTTCCGGTG GCGGGCCCAT TGGCGCGCTG GAAGGCTACA TTCTGTCCGG TGCCCTCGAC CGGCGCGACC ATCCCATGGT GATCTGGGAA TTGCCCTATA CCGAGAATTT CAACTCGGTC TCCTTCCTGC GGCAACTTCT TGGCGCTTTG AAATATCGAT CCAGCGGGAC CGGCGACACC CATGCGGTGC GCGATGCGGG CGCGCCACTC GAGATCGAGG TTGCACAGTC GGGCCTGTCC GGGATCGGGA TCGAAACCGG AAGCCTGGAG CACCAGAATA TCCGGGTCGA TGTGGAGTTT GTCGATGGCT CCGCCCAGCG TGTCACCCTG CGCCGCCGCC CCGCGGTGCC GGTTGACCTG CGTGGCACAC ACCTATTCGG CGTCTTGGGC CCCTTCGGCG ATCGAACCCC CAAGACGGTC ACAGCAACGC CCCTCAAGGG CACCGAAATG ATCACCATCC AACTGTTCTG A
|
Protein sequence | MKPLPFLSIA LLLATGAQAE PYCADLLEAE RLPAKYARNA PIYSDAASGW VFTQDQLKER YAMKPAAQAL VAEIVREFDK RDVALAIVVP PPRPIIAGQA HFDAAMGEAH HDLDAAQASF GDLITGLAAT GAIVPNLQEL ALSDADLRAA FYFKRDTHWT TTGALASARA VAQATATARS DLFPDMPADA PAMAPLETTI EEKGSLARIL RDVCDIEPGR ETAPLYNYAR ASAAGLLADV TDAPRVALLG SSFSDRYQTD HYRFADALSQ AFDLDVENFS VSGGGPIGAL EGYILSGALD RRDHPMVIWE LPYTENFNSV SFLRQLLGAL KYRSSGTGDT HAVRDAGAPL EIEVAQSGLS GIGIETGSLE HQNIRVDVEF VDGSAQRVTL RRRPAVPVDL RGTHLFGVLG PFGDRTPKTV TATPLKGTEM ITIQLF
|
| |