Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3838 |
Symbol | |
ID | 5714367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | - |
Start bp | 46686 |
End bp | 48080 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641276753 |
Product | hypothetical protein |
Protein accession | YP_001542049 |
Protein GI | 159046378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGATC ATGCCCGAAA GGCCAGCCCG GAAGGGGCGG TGGCCCCGCT CATGCCGGTG CGACAGCGGC GCGCGACCTT GGCCGCGACC CCGTTGCAGG CCGTGATGCA GGCGAGCCCG CGGGTGACAC AGCTTCTGGC GATGCAACAG GGGATGGCCG CGCGGGCGGT GCAGCGGATG GCTGTCGAGG ACGAGGCGCC TTTGCAGGGC AAGGGTCTGG AGGATGAGGA GCCGCTTCAG GCCGCCGGGC TGGAAGATGA AGAGCCGCTG CAGGGCAGGG GTCTGGAGGA CGAAGAGCCG TTGCAGGGCA AGGCGAACGG CGCCGCGCCC GTGCAGGCGC AACCGGGTCC GGGGGGGAAT GGGGGCCTGC CGCCGCAGTT GCAGGCGGGC GTCGAGGCGT TGTCGGGCCA CTCCATGGCT GATGTGCGGG TGCATCGCAA TTCCCCGCAG CCCGCGCAGG TCGGCGCGCT CGCCTACGCC CAGGGCCGTG ATATCCACCT GGGCCCGGGA CAGGAGAAAC ATCTGCCCCA CGAAGCCTGG CACGTGGTCC AGCAGGCCCA GGGCCGGGTG CGGCCCACGA TGCAGGCCAA GGGCGTGGCG ATCAACGACG ACGCGGGGCT GGAGGCCGAG GCGGACCGGA TGGGGGCGCA GGCTCTTCAC CAATCGAAAT TCACGCCGAC CGAGGTCATG AAGACCGGGG CGCCGATGCA GGCCCGCGCC GTTCTTCAGC GCAAGAGCAA ATCCGATATT ACCGACGCAA TCATGGATCA ACACAGTCAA ACGGTTGCGA TGTTTCCCGG GAATGCGAAC ACGCCTATTA CGAGGGTTTT CGGCCTTCGC GAGCTGGCAG CCGCGATTGT CCACAGAGGC TATAGCGGGA CCCAAAAAGA CTTCGCGGCT TTGATGAACG AGATGGAGAC AAACGCTTGG TGCGTTGCGG CCTGGGTTCA CAAGGGGGGT CTTGGTGGCG CCGCGGGTGG CGCGGACCCT GAACCCCACA TCACGCTTAT GGTCGGCGGC ACGGGCTACC ATGTCAGGGT AAAGACGGGC AGCAAACTGG TCCTGAAGAA CGCTAAAGGC GGCATGCTGC CCCAAGGCGG CGCCCAGGTT GACAACTCGG TTGATCCTTA TGCGGATTTT GCGGCCACGG ACCTGACAAC CAAGGGTAAA GAGAAAGCGC TCAAATACCG AAACAAAAAC GGAGGGAGTG TGCCCCAGGC GGTCGCATTT GTAGCAAGCG GTTGTCCCGA TGAAGGCCGA TACTCAAAGG GAACGGAAAC TCTCAGGTAT CGGGCCGATA CCGGGTATTT TGCCATATAT GACTCCGCTG GTGGCGGCGG CTGGAAAAAA GACGTCAAGG CCAGCAAGAC TGCACGGAGG CTCTACTTCA AGTAA
|
Protein sequence | MPDHARKASP EGAVAPLMPV RQRRATLAAT PLQAVMQASP RVTQLLAMQQ GMAARAVQRM AVEDEAPLQG KGLEDEEPLQ AAGLEDEEPL QGRGLEDEEP LQGKANGAAP VQAQPGPGGN GGLPPQLQAG VEALSGHSMA DVRVHRNSPQ PAQVGALAYA QGRDIHLGPG QEKHLPHEAW HVVQQAQGRV RPTMQAKGVA INDDAGLEAE ADRMGAQALH QSKFTPTEVM KTGAPMQARA VLQRKSKSDI TDAIMDQHSQ TVAMFPGNAN TPITRVFGLR ELAAAIVHRG YSGTQKDFAA LMNEMETNAW CVAAWVHKGG LGGAAGGADP EPHITLMVGG TGYHVRVKTG SKLVLKNAKG GMLPQGGAQV DNSVDPYADF AATDLTTKGK EKALKYRNKN GGSVPQAVAF VASGCPDEGR YSKGTETLRY RADTGYFAIY DSAGGGGWKK DVKASKTARR LYFK
|
| |