Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1166 |
Symbol | |
ID | 4285565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1278795 |
End bp | 1279802 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638140646 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_756397 |
Protein GI | 114569717 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.413543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.128796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG CAACGCTCAA ATCTGACAGC CGGGATGGCC GCCTGGTCGT CGTCTCAAAA GATCTCGCCT GGTGTGCGGA TGCCCGGCAG GTCGCGCCGA CCCTGCAGGC GGCCCTTGAT GACTGGGACC GGTGCGAGCC CGAGCTTCGC GCCCTGTCAG AAGAACTCGA GCGTGAAACC ATTCCGCGCG AGCGCTTCCA TGAACGCGAG GCCCTGTCGC CGTTGCCGCG TGCCTTCCAG TGGGCCGACG GGTCAGCCTA CGTCAACCAT GTCGAACTGG TCCGCAAGGC GCGCAATGCC GAGATGCCGG CCACCTTCTG GACCGACCCG CTGATGTATC AGGGCGGCTC CGACGCCTTC CTCGCCCCGC GCGCCGATAT CCCGCTCGGC GACACGGCCT GGGGCTGTGA CATGGAGGGC GAGGTCGCTG TCATCACCGG CGACGCGCCG GCCGGTTGTT CGGTCGAGGA CGCCGCCAAG ACCATCCGCC TGGTGATGCT GGTCAATGAC GTCTCCCTGC GCGGCCTGAT CCCCGGTGAG CTGGCCAAGG GCTTCGGCTT CTTCCAGGCC AAACCGCCGA GCGCCTTCTC GCCGGTCGCT GTCACGCCGG ACGAGCTGGG TGAGGCCTGG GATGGCAAGA AGCTGCACCT GCCGCTCTTG GTGAAATATA ATGGCGAGCT GTTCGGCAAG GCCGAGTGCG GCGTCGACAT GACCTTTGAT TTCGGCCAGC TGATCGCCCA CCTCGGCAAG ACGCGCCCGG TCACGGCCGG CACGATTGTC GGCTCCGGCA CAGTGTCCAA CAAGCTCGAT GATGGTCCCG GCAAGCCGAT CAGTGAAGGC GGTGTCGGCT ATTCCTGCAT CGCCGAGATC CGCATGATCG AAACGATCAA TGACGGCAAG CCAAGCACGC CTTTCATGCA ATATGGCGAC ACGGTCCGCA TCGAGATGAA GGACAAGGAC GGCAAGTCCA TCTTCGGCGC CATCGAGCAG GAAGTGGTCA AGGCCTGA
|
Protein sequence | MKLATLKSDS RDGRLVVVSK DLAWCADARQ VAPTLQAALD DWDRCEPELR ALSEELERET IPRERFHERE ALSPLPRAFQ WADGSAYVNH VELVRKARNA EMPATFWTDP LMYQGGSDAF LAPRADIPLG DTAWGCDMEG EVAVITGDAP AGCSVEDAAK TIRLVMLVND VSLRGLIPGE LAKGFGFFQA KPPSAFSPVA VTPDELGEAW DGKKLHLPLL VKYNGELFGK AECGVDMTFD FGQLIAHLGK TRPVTAGTIV GSGTVSNKLD DGPGKPISEG GVGYSCIAEI RMIETINDGK PSTPFMQYGD TVRIEMKDKD GKSIFGAIEQ EVVKA
|
| |