Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1821 |
Symbol | |
ID | 5712812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1898926 |
End bp | 1900176 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641267744 |
Product | hypothetical protein |
Protein accession | YP_001533164 |
Protein GI | 159044370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.117561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.653656 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATGAAG CAGGAGCGAC CGGCGTGGCG GATTTTCATC AGAACGGGAA CATCACGACG CTGCACAATC TGCGGACGCG TTCGGTCGAG GAGCTGACGC AGGAGCTCAA GGCCTATTCC GTCAGCCGCA AGATCAGCCT GATCCTGCCC TCGCTCTACT CGGAGTTGGA AGGGCCCGCC CTGCCCCGGA TCCTGGATGA GCTGAGCGAG GTGCCCTATC TGCACCGGAT CATCATCGGG CTCGACCGGG CCGACGAGGC GCAGTACCGC CATGCGCGGG ATTTCTTCGG GCGGCTGCCG CAGGACCATA TCGTCATCTG GAACGACAGC CCCCGGATGA CGGGGTTGGG CCGGCGGCTG GAGACCATGG GGCTGGCGCC GCAGGAGGCC GGCAAGGGCA AGAACGTCTG GTCCTGCCTG GGCTACCTGA TGGCGTGTCA GGACAGCGCG GTCATGGCGA TCCACGATTG CGACATCCTG ACCTATGACC GGGACATGCT GGCGCGGCTG GTCTATCCGG TGGTGAACCC GAACTTCCCC TACCAGGTCG CCAAGGGCTA CTACCCGCGG ATCGGCGAGA ACAGGATCAA CGGGCGGGTC ACGCGGCTGC TGGTCAGCCC GCTGCTGATC GCGCTGAAAC GGGTGATCGG GGATCGCGAC TATATCGATT ACCTGCGCAG CTTCCGCTAT CCCCTGTCGG GGGAGTTCGC CATGCGCACG GGCATCCTGC CGGACCTGCG CATCCCGTCG GACTGGGGGC TGGAGATCGG GGTTCTGTCG GAGGCCTGGC GCAACCTCGC GCCCAAGGCG GTCTGCCAGG TGGAGATCAG CGACGCCTAC GACCACAAGC ACCAGGATCT GAGCGAGGAT GACGCCTCGG CGGGGCTGAG CCGGATGTCC ACGGATATCT GCAAGTCGAT CTTCCGCAAG CTGGCGATGG ATGGCACGGT GTTCACCACC CATGTGTTCC GCACCCTGAA GGCGACCTAT TACCGCTCGG CGCTCGACCT TTTGGAGGCG TATTACTCGG ACGCGATGAT GAACGGGCTG GCCATCGACC GGCACCGGGA GGAGCAGTCG ATCGAGCTCT TTGCGGAGAA CATCATGCGG GCGGGACAGA TCTTCCTCGA CAACCCGTCG GAGACGCCGT TCATCCCGAC CTGGAACCGG GTGCATGCGG CGGATCCGGA TTTCCTGAGC GATTTCCGCA GGGCCGCCGC CGAGGACGTG GCGGAATACG GGGCGGGCTG A
|
Protein sequence | MHEAGATGVA DFHQNGNITT LHNLRTRSVE ELTQELKAYS VSRKISLILP SLYSELEGPA LPRILDELSE VPYLHRIIIG LDRADEAQYR HARDFFGRLP QDHIVIWNDS PRMTGLGRRL ETMGLAPQEA GKGKNVWSCL GYLMACQDSA VMAIHDCDIL TYDRDMLARL VYPVVNPNFP YQVAKGYYPR IGENRINGRV TRLLVSPLLI ALKRVIGDRD YIDYLRSFRY PLSGEFAMRT GILPDLRIPS DWGLEIGVLS EAWRNLAPKA VCQVEISDAY DHKHQDLSED DASAGLSRMS TDICKSIFRK LAMDGTVFTT HVFRTLKATY YRSALDLLEA YYSDAMMNGL AIDRHREEQS IELFAENIMR AGQIFLDNPS ETPFIPTWNR VHAADPDFLS DFRRAAAEDV AEYGAG
|
| |