Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2542 |
Symbol | |
ID | 5713439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2698518 |
End bp | 2699444 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641268465 |
Product | hypothetical protein |
Protein accession | YP_001533876 |
Protein GI | 159045082 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.548938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTG CGACGCTGGC CGTGGGGGGT AACCCCATGG CGGTAGTTTT ACTATCGAAT GGTGCGGCGC TGAACTATGC CGCCTGCGCC TGGGATGTGC CCGGGGAGGC TGGCGATGCC CTACGCACCG GCACGTTGCA GGCGCTCATC GACGCGGGCC CGGACGCACT GGACGCCTTG CGCGAAATCG TCGAGGCCGC CGAGAACGGG TCTTATCGCG ATGCGCGGAT CTCCCTCGAC AATCCGTTCC TGGCGCCACT GCACACGCCG CGCAAGAATA TCTTCTGCGT GGGCCGGAAC TATGCCGAAC ACATTGCTGA GGGCGAGCGC GCCCAGAACG CAAAGATCGG CATCACCGAG CATCCGGTCT ATTTCACAAA GCCTGCCACG GCAGTCGTGG GTCATGGTGG AGATGTTCTG ATTTTCCCGT CCGTGTCGGA AAAGATCGAC TACGAGGTCG AACTGGCCGT GGTGATCGGA ACCACTGGTC GCGATATCCC AAAGGACCGC GCCTTCGCGC ATGTGTTCGG CTACACCATC CTCAACGACA TTACCGCACG CGATGTTCAG CGCCGCCATG GCGGGCAGTA TTTCAAGGGC AAGTCCCTGG ACGGCTCCTG CCCGATCGGT CCCTGGATCG TTACGGCGGA CGAAATCACG GACCCGCAGG ACCTGTCGAT CAGCTTGTCG GTCAATGGCG AGCTGCGCCA GAACGGCTGG ACCCATGACA TGATCTTCGA CATTCCGACC CTGATCGCAT CGCTGTCCGA GGGCCTGACC CTGGAGCCGG GGGATATCAT CGCAACCGGC ACACCTTCGG GCGTGGGCTA CGCGATGGAC CCGCCGCAAT ACCTCAAGCC GGGCGACACG GTGATCTGCG ATATCGCAAA TATCGGACAA CTCAAGAACA CGGTGCGCGT TGCATGA
|
Protein sequence | MKLATLAVGG NPMAVVLLSN GAALNYAACA WDVPGEAGDA LRTGTLQALI DAGPDALDAL REIVEAAENG SYRDARISLD NPFLAPLHTP RKNIFCVGRN YAEHIAEGER AQNAKIGITE HPVYFTKPAT AVVGHGGDVL IFPSVSEKID YEVELAVVIG TTGRDIPKDR AFAHVFGYTI LNDITARDVQ RRHGGQYFKG KSLDGSCPIG PWIVTADEIT DPQDLSISLS VNGELRQNGW THDMIFDIPT LIASLSEGLT LEPGDIIATG TPSGVGYAMD PPQYLKPGDT VICDIANIGQ LKNTVRVA
|
| |