Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4220 |
Symbol | |
ID | 5714749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009959 |
Strand | + |
Start bp | 67749 |
End bp | 69233 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641277115 |
Product | hypothetical protein |
Protein accession | YP_001542411 |
Protein GI | 159046743 |
COG category | [S] Function unknown |
COG ID | [COG3333] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGC TTCTGCCCCA GGCCTTCGCG CTGCTGTTGT CGCCGCAGGG ATTGCTGGTG CTCAGCGTGG GCACGATGCT GGGCATCGTG CTGGGCGCGC TGCCGGGGAT CGGGTCGACC GTGGCGGTGG CGATGATCCT GCCTTTTACC CTGACCATGG ATCAGGCGCC CGCGATCCTG CTCTTGCTGG CGATCTATGC CGGGTCGGTT TATGGCGGCT CGATTTCCGC GATCCTGATC AACACGCCGG GCACGCCGCA ATCGGCGGCG ACCTGTCTCG ACGGGTTCCC GATGGCCCAG CGGGGCGAGG CGGGCAAGGC CCTGGGCTGG GCCACCATCG CGTCGGTGGT GGGGGGTCTG ACCTCGGCCG TGGTGCTGAT CTTCGCGGCC CCGCAACTGG CGGCCTTCGC GCTGAATTTC GGACCGATCG AGACCTTTGC GCTGATCCTT CTGGGCCTGA CCTGCATCGT GTCGGTGTCG GAGGGGAGCC TTGTGAAGGG CCTGATGGCG GGGATGCTGG GGATTTTCCT GTCCACCGTC GGGGGCGATC CTATCACCGG GGAGGCGCGG TTCACCTTCG GGCAGTTCCA GTTGATCGCG GGCTTCAACC TGCTGGCGGT GGTGATCGGG GTCTTTGCCC TGTCGGAGGT GCTGATCCGC GCCAGTTCAG GGCCTGACAG CACCGGGGTC CTGGTGGATT TCAAGGGCAT CGTCCTGCCG CGTTGGGCGG AGTGGAAGGG ACGTTTGCGC GGGCTGGCCA AATCCGTGGC CATTGGCAAC GGGGTGGGCA TTCTGCCCGG GACCGGGGCG GCGACGGCGG CCTTCATCTC CTATGCGGAG GCGCGGCGGT CTGCCCCCAC GCGGGCGAAT TTCGGCAAGG GGGAGCCGGA CGGGCTGATC GCGTCGGAAT CGGCCAATAA CGCGGTCACC GGCGGGGCGC TGGTGCCGAC CATGGCGCTG GGGATCCCCG GGGACGCGAT CACGGCGGTG ATGCTGGCGA CGCTGACCCT GCACGGGGTG ACGCCGGGCA TTCGGCTGAT GCAGGACAAT CCGGTGCTGA TGGCGTCGAT CTTCGCGGGG TTCTTCATCA TCAACCTGAT GCTGCTGCCC CTGGGAATGC TGGTGTCGAA GCTGGCCGCG CCGCTCCTGC GGATGCGGGA GGCCTATATG CTGATCGTGA TCACGCTTTT GTGCACGGTG GGCGTGTTCT TCGTGCGGGG CAATCCGTTC GATTTGCTGG TGATGGCGGG GGCGGGGATC GTCGGCTTCG TGCTGCGGCG GCAGGGCTAT CCGATGGCGC CGCTGGTGAT CGGCATGGTG CTGGGTCCGA CGCTGGAACT GAGCCTGCGG CAGGGGTTGA TCATCACCGA TGGCAATTTC GGGGCGTTCT TCACCGGGCA TCCCATCGCG CTGGGTCTGA CCATCGCGGC GGCGGGGATG CTGAGCCTGC CGCTCATTCG GGCGCTGCGC AGCAAGGGAG CATGA
|
Protein sequence | MIELLPQAFA LLLSPQGLLV LSVGTMLGIV LGALPGIGST VAVAMILPFT LTMDQAPAIL LLLAIYAGSV YGGSISAILI NTPGTPQSAA TCLDGFPMAQ RGEAGKALGW ATIASVVGGL TSAVVLIFAA PQLAAFALNF GPIETFALIL LGLTCIVSVS EGSLVKGLMA GMLGIFLSTV GGDPITGEAR FTFGQFQLIA GFNLLAVVIG VFALSEVLIR ASSGPDSTGV LVDFKGIVLP RWAEWKGRLR GLAKSVAIGN GVGILPGTGA ATAAFISYAE ARRSAPTRAN FGKGEPDGLI ASESANNAVT GGALVPTMAL GIPGDAITAV MLATLTLHGV TPGIRLMQDN PVLMASIFAG FFIINLMLLP LGMLVSKLAA PLLRMREAYM LIVITLLCTV GVFFVRGNPF DLLVMAGAGI VGFVLRRQGY PMAPLVIGMV LGPTLELSLR QGLIITDGNF GAFFTGHPIA LGLTIAAAGM LSLPLIRALR SKGA
|
| |