Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2027 |
Symbol | |
ID | 5713022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2147043 |
End bp | 2148059 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641267951 |
Product | putative allophanate hydrolase subunit 2 |
Protein accession | YP_001533367 |
Protein GI | 159044573 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.124744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGC GCCTGGAGAT TTGCGCCGCC GGTCCCGGCC TGACCGTGCA GGATGCCGGG TTCCTGGGCT ATATCGGGCA GGGTCTGTCT CGGGGCGGGG CGGCGGACAC CCGGGCCCTG GCCGAAGGCG CCGCATTGCT GCGCCAGTCC CCCGACCTCG CCGCGATCGA GATGGCCGGC AGTGGTGGGA CCTTCCGGGT CACCCGCGAC GCCCGGCTCG CGCTGACCGG CGCGCCGATG CAGGCGACGC TCGATGGCGC GCCGCTCGCC TGGCACGCCT GTCACGCGTG GCCCGCCGGG GCCGAACTGC GCATCGGCGC CGTGCAGGGG GGCAGCTATG GCTACCTGCA TGTGGGCGGC GGGATCGACA CGCCTGTGGA GCTCGGCTCA CGCTCCACCC ACCTCACCGC CGGGCTGGGC CGGGCGCTGG CGGCGGGCGA CAGCCTGCCC CTGGGCCGCG ATCCCGGTGG CCCGGTGGGG CAGGGGCTGC AGGTCGAGGA TCGCTTCTGT GGCGGCGAGA TCCGCATCAT CCGCAGCTTC CAGAGCGACA GCTTCGCGCC CGAGGATGTC ACCCGCCTCT CCGAGACGCC CTTCACCCGC GACCCGCGCG GCAACCGGAT GGGTGTGCGT CTCGCCCACG CGGGCGACGG GTTCTTTGCC CGGGGCGGGC TCACGGTGCT GTCCGAGATC GTCGTGCCCG GCGACATCCA GGTCACCGGC GAGGGCGCGC CTTATATCCT CGGGGCCGAA AGCCAGACCA CCGGCGGCTA TCCCCGCATC GCCACCGTCA TCCCCTGCGA TCTGCCGCGC GCGATGCAGG CCGGGCCGGG CGCGCCGATC CGCCTCGCCC TCGTGGATCG CGCCACCGCC CTTGCGGCCG AGCGCGCCGA GGCCAAGCTG CTGCAGGCCT TGCCGAAACA GGTCCGCCCC CTCTTGCGCG ACCCGCGCGA GATGTCGGAC CTTCTGTCCT ACCAACTGAT CAGCGGCGTC ACCGCCGGGG AGGAGCCCTC GCCATGA
|
Protein sequence | MTARLEICAA GPGLTVQDAG FLGYIGQGLS RGGAADTRAL AEGAALLRQS PDLAAIEMAG SGGTFRVTRD ARLALTGAPM QATLDGAPLA WHACHAWPAG AELRIGAVQG GSYGYLHVGG GIDTPVELGS RSTHLTAGLG RALAAGDSLP LGRDPGGPVG QGLQVEDRFC GGEIRIIRSF QSDSFAPEDV TRLSETPFTR DPRGNRMGVR LAHAGDGFFA RGGLTVLSEI VVPGDIQVTG EGAPYILGAE SQTTGGYPRI ATVIPCDLPR AMQAGPGAPI RLALVDRATA LAAERAEAKL LQALPKQVRP LLRDPREMSD LLSYQLISGV TAGEEPSP
|
| |