Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3242 |
Symbol | |
ID | 5712299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3410436 |
End bp | 3411863 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641269170 |
Product | protein of unknown function DUF404 |
Protein accession | YP_001534576 |
Protein GI | 159045782 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.657698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA AGCAGCCCTT TGACGAGATG ACGGGCATGG GGGACGGCGC GATCCGGTCC CCGTATGCAG ATTTTGACAC CTGGTTCAGC GGCGAGGACC CCGCCCGCCT GCGCAAGAAG GCCCGCGAAG CCGAAGACGT TTTCCGGCTG ACGGGGATCA CCTTCAACGT CTATGGGCGC GAAGAGGCGG CCGAACGGCT CATTCCCTTC GACATCGTGC CGCGCATCAT CTCGGCCCGG GAATGGACCC GGCTGGCCAA GGGGATCGAG CAGCGGGTGC GCGCGATCAA CGCCTTTCTG CACGACATCT ATCACCGGCA GGAGATCATC CGCGCGGGCC GCATACCCGC CGAGATGATC GCCAATAACG AGGCGTTCCT GCCCCAGATG ATCGGCATGA CGCCGCCGGG CAATGTCTAT ACCCATATCG TCGGCATCGA CCTGGTGCGC ACCGGCGAGG ACGAGTTCTA CGTGCTGGAG GACAATGCGC GTACGCCTTC GGGGGTCTCC TACATGCTGG AGAACCGGGA GACGATGCTG CAGATGTTCC CGGAGCTGTT CGCGCGCAAC CGGGTGCGGT CGGTCAGCGA GTACCCGCAA AACCTGCGCC GGTCGTTGAG CGACTGTTTC CCGCCGGCCT GTACCGGCAA GCCGGTGGTG GGGGTTCTGA CCCCGGGGAT CCACAACTCG GCCTATTACG AGCATGCGTT TCTCGCCGAC AAGATGGGCG CGGCACTGGT GGAGGGGCAT GACCTGAAGG TGGTGGACGG GCGCGTGGCG ATGCGGACCA CCCGCGGCTT CACCCCGATC GACGTGCTCT ACCGGCGGGT GGATGACGAT TTCCTCGACC CGATGAATTT CAGGCCCGAG AGCCTGCTGG GCGTGCCGGG TATCATGGAT GTCTACCGCG CGGGCGGGAT CACCATCGCC AACGCCCCGG GCACCGGGAT CGCGGACGAC AAGGCGATCT ATTCCTACAT GCCGGAGATC GTCGAGTTCT ATACCGGCGA GCAGGCGATC CTGAAGAACG TGCCGACCCA TCGCTGCAAC GACCCCGACA CGCTGGCCTA TGTTCTGGAC AATCTGGCCG ACTTGGTGGT CAAGGAGGTG CATGGCTCGG GCGGCTACGG GATGCTGGTG GGGCCTGCGG CCTCGAAAAA GGAGATCGCC GCCTTCCGCG AGAAGCTGAT CGCCAAGCCC GACAGCTATA TCGCCCAGCC GACGCTGAGC CTGAGCACGG TGCCGATTTT CGCGCGTTCG GGGCTGGCGC CGCGGCATGT GGATTTGCGG CCCTTCGTGC TGGTCTCGCC AAAGAAGATC CATATCACGC CCGGCGGGCT GACGCGGGTG GCGTTGCAGA AGGGGTCGCT GGTGGTCAAT TCGAGCCAGG GAGGCGGCAC CAAGGACACC TGGGTGCTGG AGGAGTAG
|
Protein sequence | MKIKQPFDEM TGMGDGAIRS PYADFDTWFS GEDPARLRKK AREAEDVFRL TGITFNVYGR EEAAERLIPF DIVPRIISAR EWTRLAKGIE QRVRAINAFL HDIYHRQEII RAGRIPAEMI ANNEAFLPQM IGMTPPGNVY THIVGIDLVR TGEDEFYVLE DNARTPSGVS YMLENRETML QMFPELFARN RVRSVSEYPQ NLRRSLSDCF PPACTGKPVV GVLTPGIHNS AYYEHAFLAD KMGAALVEGH DLKVVDGRVA MRTTRGFTPI DVLYRRVDDD FLDPMNFRPE SLLGVPGIMD VYRAGGITIA NAPGTGIADD KAIYSYMPEI VEFYTGEQAI LKNVPTHRCN DPDTLAYVLD NLADLVVKEV HGSGGYGMLV GPAASKKEIA AFREKLIAKP DSYIAQPTLS LSTVPIFARS GLAPRHVDLR PFVLVSPKKI HITPGGLTRV ALQKGSLVVN SSQGGGTKDT WVLEE
|
| |