Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3581 |
Symbol | |
ID | 5713812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3772212 |
End bp | 3773648 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641269510 |
Product | hypothetical protein |
Protein accession | YP_001534915 |
Protein GI | 159046121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGATC GCTTCGACTA TTTCGTGATC TTCGCCGGGA TGCGCACCGG TTCCAACTAC CTGGAGCGGA ACCTCAACGC CGCGCCGGAC TTGCGTTGCT ACGGCGAGCT CTACAACCCC TATTTCATCG CCCATGAGGG GCAGGAGTCG TTCCTCGGCC TCACCCTCGC GCAGCGCGAG GCCGACCCGG ACGCCCTGAT CGCGCGCATC CGGGAAGAAA CCGTGGGCCT GCCGGGGTTT CGGCTGTTTC ACGATCATGA CCAGCGGGTG CGCGACCGGG CACTTGCCGA CCCGCGCGCG GGCAAGATCA TCCTGACGCG CAATCCACTG GACAGCTATG TCTCCTATTG CCGGGCGCTG GCGTCGAACC AGTGGGTGCT CACGGACGCC AAGGGCCAGA TCGACACGCC CGCCATCGAT TTCGACGGGC CGGGATTTGC GACTTTCCTG GCGGATCAGA CCGATTACTT CACCGCCGTG CGGCAAGGGC TGGCCCGCGC CGGGCAGACC GCGCTGACCC TGCGCTACGA GGATCTGCAG CAGATCGAGG TGATCAATGG CGCGCTGGCC TATCTGGGCT CGCCTCACCG GCTTGCACGC CCCGAACGCA CACTGAAACG CCAGAACCCC GAACCGTTGG AGGGGAAGGT GACGGATATG GCGGTCCTGC GCGCGGCGGT GGCGGGGCTC GACCCGTTCG GGATCGACCA TCTGCCCCCC TCCGCGCCGC CGCGCGGACC CGCGGTGCCC AGCTATGTCG CCTGTCCCGA CACGGGGCTG CTCTATCTGC CCATGGCGGG GCCGGAGCCC GACCCGGTGC TGGCCTGGAT GGCCGCGCTG GACGGGGTGC CCAAGGACGC ACTGATCACG GGTATGCGCC AGAAAGACCT GCGCGCCTGG CGCCAGGCGC ATCCGAACGC ACGCTGTTTC ACGGTGCTGC GCCATCCCGT CGCCCGCGCC CACACGGCCT TTGTCACCCG TATCCTGCCC AGCGGATTGC AGGCCTATGC CGAGTTGCGC CATGGGTTGA TCGCCGCCTA CAAGCTGCCC CTGCCGGAGC ATTTCCCGGC CGACGAGGTG GACCCGGACC GGATCGGCAA GGCCTTCCTG CGATTTCTGA AGTTTCTCAA GCCCAACCTG GCGGCCCAGA CCAGCCTGCG GATCGACCCG TCCTGGGCGG CGCAGATGAC GGTGCTGCAG GGCATGGCCC AGGTCGCGAT GCCGGATGTG ATCCTGCGCG ATGGTCCGAC GCTGCAGACG GATCTCGCGG GTCTCGCCGC GCGCGCCGGG CGCCCGGCAC CGGACGTGCC GGTGGTTTTC GAGACTGGCA ATCTCGGCGC GCTATATACA CCGGACATGG AAAAAGCCGC GCAGGCGGCT TACGGTGTGG ATTACGCCAG TTTCGGGTTC GGCCCCTGGA CGCCGACAAA GGCCTGA
|
Protein sequence | MPDRFDYFVI FAGMRTGSNY LERNLNAAPD LRCYGELYNP YFIAHEGQES FLGLTLAQRE ADPDALIARI REETVGLPGF RLFHDHDQRV RDRALADPRA GKIILTRNPL DSYVSYCRAL ASNQWVLTDA KGQIDTPAID FDGPGFATFL ADQTDYFTAV RQGLARAGQT ALTLRYEDLQ QIEVINGALA YLGSPHRLAR PERTLKRQNP EPLEGKVTDM AVLRAAVAGL DPFGIDHLPP SAPPRGPAVP SYVACPDTGL LYLPMAGPEP DPVLAWMAAL DGVPKDALIT GMRQKDLRAW RQAHPNARCF TVLRHPVARA HTAFVTRILP SGLQAYAELR HGLIAAYKLP LPEHFPADEV DPDRIGKAFL RFLKFLKPNL AAQTSLRIDP SWAAQMTVLQ GMAQVAMPDV ILRDGPTLQT DLAGLAARAG RPAPDVPVVF ETGNLGALYT PDMEKAAQAA YGVDYASFGF GPWTPTKA
|
| |