Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4190 |
Symbol | |
ID | 5714721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009959 |
Strand | + |
Start bp | 35941 |
End bp | 37218 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641277085 |
Product | hypothetical protein |
Protein accession | YP_001542381 |
Protein GI | 159046713 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.663352 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGCC CGAAGGGCAC CCTGATCGTC AGCGGACGAT GGGATGACAA TGTGGCGATC GTCGACATCG CACAGGCGCT TCTGCCCGAA AACGACGGCA CCCCGAATGC CATCCTGTCG CGCCCGCGCG CCACGCCGGA CCTCGATCTC GATGGCGACG GGGTACCCGA TGCGCGTGCC AGCGGCCAAC CCGTGGCGGT GGCCGTGGAT GTCGCCGCGC GCCACGCCTA TGTCGTGTGC CACTCCGGCG ATGCGACGCC CGAAGGGGCC GCCGCCTACC AGCATGGGCA TCCGGGGCTG GTCACGGTGC TCGATGTCGC CGCGGCCACC GATCCCGCCC ATGACGACAC GCTCGGCGCG GTGGTCGAAT TCGTCTCCAC CGGCCGCACC GGCCCCGTGG GCTGCGCGCT CACCCCCGAT GGCCGCGCCC TTCTGGTGAA TTGCGGCGAG GCCGAGGGCA GCGAGGATGG CGGTGACGAA GTCACCGTGA TCGACGTGGC CACCCGCCGG GTCACCGCCC GCGTGCCACT GGCCCTGAAC CCCGACCATC CCGCGCGAAG CCCCAGCCCC CATGACAGCC CCCATGAGAG CTTCGGCCAC TACCCCAACC CCACCGGCAT CGTGATCTCG CCGCGTGCGG GCGGCGTCGT CTTCGTCGGC AATGGCGGGT TCAGCGATGT CTCCGTCCTC GACCTCAAGG CCGCCCTCGC CGGGTCGGCG GAGGCCGAGA TCAACCGGGT CGCCGTCGAA ACCGGCCCCT TCGGCATGGC GCTCAGCCCC GATGGGGCGC TGGTGGCGGT CGCGTCCCGC GAGAGCATGT CGCGCCCGAC CGAGGGCGGC ACGGTCTCGA TCATCGATGT GGGTCGCGCC GCCGCCAGGC GCCCCGATGC CGAGCTTGCC CGCATCCCCG TGGGCGGCAC CACCGACGAG ATGCCCAGCC GCCCCTTCGG CGTCGCGTTC TCACAGAACG GCAGCCGGCT GGTCGTGTCG TGCTTCCGCA CCAACGCGAT CTCGATCCTC GATGTCGCCG CCGCCTGCGC CGGCAAAGCT TGCGAGCTGC ACCGGCTCTA TCCCGAGGCC CCCGGCGGCG CCACGCCCCG GCCCCGCGGC ATCGCCGTCT GCGGCCCCTA TGCCTGCGTC ATCGGCGGGG CGAAGGAAGG CGCGCGCAGC AGTCTCGTCT GGGTCGTGGA TATCGCGGCG GGCACCGTGG TCGGCACCGT GACCGGGGTC GGCAACGAAT CCTATTTTCT CGCCGCGATT CCACCACCGT CGACATGA
|
Protein sequence | MIGPKGTLIV SGRWDDNVAI VDIAQALLPE NDGTPNAILS RPRATPDLDL DGDGVPDARA SGQPVAVAVD VAARHAYVVC HSGDATPEGA AAYQHGHPGL VTVLDVAAAT DPAHDDTLGA VVEFVSTGRT GPVGCALTPD GRALLVNCGE AEGSEDGGDE VTVIDVATRR VTARVPLALN PDHPARSPSP HDSPHESFGH YPNPTGIVIS PRAGGVVFVG NGGFSDVSVL DLKAALAGSA EAEINRVAVE TGPFGMALSP DGALVAVASR ESMSRPTEGG TVSIIDVGRA AARRPDAELA RIPVGGTTDE MPSRPFGVAF SQNGSRLVVS CFRTNAISIL DVAAACAGKA CELHRLYPEA PGGATPRPRG IAVCGPYACV IGGAKEGARS SLVWVVDIAA GTVVGTVTGV GNESYFLAAI PPPST
|
| |