Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3862 |
Symbol | |
ID | 5714391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | + |
Start bp | 70763 |
End bp | 72700 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641276775 |
Product | hypothetical protein |
Protein accession | YP_001542071 |
Protein GI | 159046400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.242184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCCT ATATCATGAC CGCCGCGCAA CAGGCGCCGG ATGGTGATGC CAAGGACCTG AAACAGGCTC TTCAAAAGGC TTTTGAGAGG CATGCGGGCC GTCTTCCCGC TGATGGATCA GAGGTGATGT CCTCGATCAA CGCCTCGGTG GCGGTCCGGC TCTTTGCCTT GATGTACATG CGCGGTGTGC AGCATCGCCT GATGCCGGAT AACGAGAGTG CGGTCCTGTG CGAGGCTTTG TGCAAGACGG TCTGCGAGGT CGAGATCACC GATAAGTTCG ACGACATGTT CGTGTCGAAC TTCGTGCTGG GGCTGATCGT GTTTCTCGAT CTTGTGCAGG ATGATTTGCA CCCTGACACG CGGGCCGGCC TGGTTGCTAA GATCGCCGAA TGCCGGGACT GGTTGTCCGA GGCGCGTCAT CGCAAGGTAT TCGGCACGCG CGAGACCGAG GGCACCTATG CCTGGAATCA CTCCGCCTGC GCGGCGGCGG GCCTGGCGCT GAGCGTGATC TGGACCCGGG ATAGCCAGGC GGACTGGACC GACACAGACT TTCACGATGT GGATTTCGGG CTGCGCCGGA TCGAAGACTA TTTCCTGCAC GGCATCCGCG AAACAGGGGT CCCCTATGAG GGGTTCTATT ACTGTGGCGC GGTGTTTCGG GTGCTTGGCC CCTTCGACAT TCTGGTGCGC AAGGACGCCG AGGTAGAACG CCGCTATCGG CGCATCCGGG ATCGTCACAA GCGTAAGCTT GGCCAGTTGC TGGACTGGTA CGAGAGTGGC ACCATCGTCA ACCAACGCGC GCTGCTCAGC TACAATCATT CGCTTTATGA CGCTCACCCG GCGGTGAACG GTTTTCTGAC CTTCTTTCGG TCCGAGTTCG AGGTAAAGGC GGGCCGCATG TGGTCTCGGC TGATGGCGAA AGGCACGTCC CTGCAATTCG TCGAGCGCAG CCGGGACTGG GGCGACAACA CTCTGCACGA GGCTTTGTTG TTCCTGCATC CCAAAGGCTA TTCAGCACCG CGTAACAAGG TGCAGACCCT TCTGTCGCGG ACCGAAGGCT ATGGGCTGCT GGTGTCGGAG GACGCCTCCA GCCGACTGTT CGTGAAGGCC AGCAAACTGC TGATCGGGCC GCATAACCAG TCTGATGCGG GCCATGTCAG TTGGGTCTGT AATGGTGATG CGGTGTTGAT CGACGCCGGG CCGGGCCGCA AGGTGCGCGA TGCGTCGAAG AAGTGGGCGG AGTATTCCAA GGGCACCTAC CGGACCGAAG GCAGCGGCGC TTCGTCCTAT GGCCATAACG CCGTGCTGAT CGACGGGCGG GGGCAGTTGC CCTCTGGCGA GGGGGACGGC ATCGAGGGGC GCCTAAGCTA TGTCCGTCAA ACCGAAGATT TCTGGTTGCT CGGGACCGAT GCCAGGGCGG CCTACAATAA GGATGAGTAT AACCCCGTCC AGGTGGCCGA TAGGCATGTG GCGTTTTCCA AGGCGGCGGG CGCCTATCTG ATCCTGATCG ACCGGGTGGT ACCGCAGGCG CCGGGGACCC ATCGGTTCCA GCGGCTTCTG CAGTTTGCAG ACCCCGCGCA GGTGGTTGAG GAGGATGGTC CCGGCCGAAT GGCGGTGACC AGCGGGGGCA CGGTCTATGA CCTGTGGACC CTGAGCCCGA CCGGGCCCCT GAGCACGGTC TACGAGGAAG AGAAATTCCA GATGCCGATC AAGACCCGGG GCGTGCTGGC CCATGGGGTC GAGGCCGAGG ACCTGTGGAT GTATACGGTC CTCGCCGCCC GCGGGAGTGC CGGGGTGCCG ACGGATGTGT CCCTGCGCCC GGCCGAGGAC GCGGCCTTCG GGGCCGCGCT GCAGCTGGTG CTGGAGGGGG GCGACACCCG GCTCCTGGCC CTGTCGCGGA CGACGGGGGA GCTTGAGCGG ATTTCCGACG ACGGCTGA
|
Protein sequence | MTSYIMTAAQ QAPDGDAKDL KQALQKAFER HAGRLPADGS EVMSSINASV AVRLFALMYM RGVQHRLMPD NESAVLCEAL CKTVCEVEIT DKFDDMFVSN FVLGLIVFLD LVQDDLHPDT RAGLVAKIAE CRDWLSEARH RKVFGTRETE GTYAWNHSAC AAAGLALSVI WTRDSQADWT DTDFHDVDFG LRRIEDYFLH GIRETGVPYE GFYYCGAVFR VLGPFDILVR KDAEVERRYR RIRDRHKRKL GQLLDWYESG TIVNQRALLS YNHSLYDAHP AVNGFLTFFR SEFEVKAGRM WSRLMAKGTS LQFVERSRDW GDNTLHEALL FLHPKGYSAP RNKVQTLLSR TEGYGLLVSE DASSRLFVKA SKLLIGPHNQ SDAGHVSWVC NGDAVLIDAG PGRKVRDASK KWAEYSKGTY RTEGSGASSY GHNAVLIDGR GQLPSGEGDG IEGRLSYVRQ TEDFWLLGTD ARAAYNKDEY NPVQVADRHV AFSKAAGAYL ILIDRVVPQA PGTHRFQRLL QFADPAQVVE EDGPGRMAVT SGGTVYDLWT LSPTGPLSTV YEEEKFQMPI KTRGVLAHGV EAEDLWMYTV LAARGSAGVP TDVSLRPAED AAFGAALQLV LEGGDTRLLA LSRTTGELER ISDDG
|
| |