Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0997 |
Symbol | |
ID | 5710513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1027607 |
End bp | 1029982 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641266908 |
Product | hypothetical protein |
Protein accession | YP_001532340 |
Protein GI | 159043546 |
COG category | [S] Function unknown |
COG ID | [COG3002] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0514673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATA CAGCAACCCT GATTTCCGTC GACACCCTGA CCGCCGCCCA GGCGGGTGCG GTCCAGGCAA TCCCCCCGGC CTTCCCGCTT TCGGCCACCG TGGCGGTCAA CCCGTTCCTC GGGCAGGCGG GCCACTCCCT GCCGGACACC GCAGCAGTCC TGGGCAAGAC CGCCGGCTGC GCCACCACCG CCCCGCGGAG CTGGTTCGCC GCCGAGATCG CGGCGGGCCG GATCACCAAA CCGGCCGTGG CCGAAGCGCT CGCCGCCGCG GGGCTCGACT GGCCCGTGGA CAAGGTGATC TCGGCGGCCA GCCGCCAGCG CCCGGCGCCG CAGGCCCTGC CCACGGTCGC CGATCTCGCC GGTGCCTCCG AGGCCCCCGG CTGGCCCGCC CAGGCAGAGG CACGGATCGC GGCCTGGGTG GCCGGGCATT TCGATCAGGG CCAGGCGCTC TGGGCCGCTG GTGCGCCCTC CGGCACCTAT GCCGACTGGC TGGCCTTCGC CACCACCGAC ATGACCCCCG ATCTCGCCGG GCTTCCGGGC TTCCGGGCCT GGCTCAAGGC CCTGCCGTCC GACCCGACCG AGGCGCTTTT GGCCGCGGTC AACACGCTGG GGCTGACCGA GGCGGCGCTG CCGCTCTACT TCCACCGGCT TGCCATGTCG CTGGGCGGCT GGGCGCAGGC CGCGCGCTAC CGGCTCTGGC AGGCGGAGCT GGCGGGCCAG ACCGACACCA CGCTGGCCGA GCTGATCGTA ATCCGCGCGG TCTGGGATGC CGGCACGCTG GCCACAAGGC CCGCCCTGGC CGCACAGTGG GACACCGCGC GCGCCGCCTT CGCCGCCCCG GTCACGCCCA GCGAGGATGA CCTGATCGAC GCCGTCCTGC AGGACGCGGC GGAGCGGAGC ACCCAGGCCG ATCTCGCCCA GGCCTTCGCC CCCGTGGCCA AGGCCGAGGC CCGGCCCGCG CTGCAGGCGG CCTTCTGCAT CGACGTGCGC TCCGAGGTGA TCCGCCGGGC GCTGGAGACC TGCGATCCGG GCATCGAAAC CCTCGGCTTT GCGGGCTTCT TTGGCCTCAC TGCCGCCCAC ACCCCCACCG GGTCCTGCAA TTCCGAGGCG CGGCTGCCGG TCCTTCTGAC CGCCGGGGTG ACGAGCAAGG CGAGCGGCGA CCACGACGCC GCCCGGATCA CCACCCGCGT CACCCGCGCC TGGGGCCGGT TCCGGCAGGC GGCGGTGTCC TCCTTCGCCT TCGTCGAGGC GGCGGGCCCG TTCTATGCGG GCAAGCTGGT GCGCGACACG CTGGGCCTGG GCAAGGCCGA CGCGATCCCG GGCAAGCCGG TCTTCGACCC GCCCCTGCCG GAGGAGGCGC AGATCGACGC GGCGGCCACG ATCCTGAACG CCATGTCGCT GAAATCCAAC TTCGCGCCGC TGGTGGTGAT CGCGGGCCAT GGCAGCCATG TGAACAACAA CGCCCATGCC AGCGCGCTGC AATGCGGGGC CTGTGGCGGC TATGGCGGCG ACGTCAACGC CCGGCTTCTG GCCGACCTGC TGAACCAGCC CCATGTGCGG GCCGGTCTGG CCGCCAGGGG CATCGCGGTG CCCGAGGACA CGATCTTCGT CGCGGCCCTG CACGACACCG CGCAGGACGC GATCACGCTC TATGCCGATG ACCTGTCCGA GGCCCACCGG GCCGCGGCCA CCGCGTCGCT GGCGCAGGCC CGGCAGTGGT GCGCCGAGGC CGGGCGGCTC GCCCGGTCCG AGCGGCAGCC GAGCCTGCCG GGCGCGACCG AACGCGACGG CATCGCCGCC CGCGCCCAGA GCTGGGCCGA AACCCGGCCC GAATGGGGGC TGGCGGGCTG CAAGGCCTTC GTCGTCGCCC CGCGCACCCA GACCGCGCCC GCGCAGCTTG ACGGGCGGGT CTTCCTGCAC AGCTACGACT GGGCGCAGGA CGAGGGCTTC GGGGTGCTGG AGCTGATCCT GACCGCGCCT GTGGTGGTCG CGAGCTGGAT CAGCCTCCAG TATTACGGCT CCGTGGTGGC GCCCGAGGTG TTCGGCGGCG GCTCCAAGCA GGTCCATAAC GTGACCGGCG GGATGGGCGT ACTCGACGGC GGCACCGGGG CGCTGCGGAT CGGCCTGCCG ATCCAGTCGG TCCATGACGG CGGCAGCTTC GTGCATGACC CGCTGCGCCT GACCATCGTG GTCAATGCCC CGCAGGAGGC GATCACCGAC ATCCTCGCGC GCCATGACGG GGTGCGGGCG CTGTTCGACA ACGGCTGGCT GAAGCTTCTG CGGCTCGAAG CGGATGGCAC CATCTCGGAG CGCTATACCG GCGACCTGAC ATGGGAGGCC TTCGCGCCGG GCACCGAGGC TGCCCAGGCC GCCTGA
|
Protein sequence | MTHTATLISV DTLTAAQAGA VQAIPPAFPL SATVAVNPFL GQAGHSLPDT AAVLGKTAGC ATTAPRSWFA AEIAAGRITK PAVAEALAAA GLDWPVDKVI SAASRQRPAP QALPTVADLA GASEAPGWPA QAEARIAAWV AGHFDQGQAL WAAGAPSGTY ADWLAFATTD MTPDLAGLPG FRAWLKALPS DPTEALLAAV NTLGLTEAAL PLYFHRLAMS LGGWAQAARY RLWQAELAGQ TDTTLAELIV IRAVWDAGTL ATRPALAAQW DTARAAFAAP VTPSEDDLID AVLQDAAERS TQADLAQAFA PVAKAEARPA LQAAFCIDVR SEVIRRALET CDPGIETLGF AGFFGLTAAH TPTGSCNSEA RLPVLLTAGV TSKASGDHDA ARITTRVTRA WGRFRQAAVS SFAFVEAAGP FYAGKLVRDT LGLGKADAIP GKPVFDPPLP EEAQIDAAAT ILNAMSLKSN FAPLVVIAGH GSHVNNNAHA SALQCGACGG YGGDVNARLL ADLLNQPHVR AGLAARGIAV PEDTIFVAAL HDTAQDAITL YADDLSEAHR AAATASLAQA RQWCAEAGRL ARSERQPSLP GATERDGIAA RAQSWAETRP EWGLAGCKAF VVAPRTQTAP AQLDGRVFLH SYDWAQDEGF GVLELILTAP VVVASWISLQ YYGSVVAPEV FGGGSKQVHN VTGGMGVLDG GTGALRIGLP IQSVHDGGSF VHDPLRLTIV VNAPQEAITD ILARHDGVRA LFDNGWLKLL RLEADGTISE RYTGDLTWEA FAPGTEAAQA A
|
| |