Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3877 |
Symbol | |
ID | 5077488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 47125 |
End bp | 48585 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640480986 |
Product | UbiD family decarboxylase |
Protein accession | YP_001165648 |
Protein GI | 146275487 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.358634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATGA ACGATCTCCC TAACCGCGCC CGCTCGATCT CGTCGCTGCG CGACTTCCTC GAACTGCTCG AGGATGCCGG CCAGGCGATC ACCTGGAGCG ATGCGGTGAT GCCCGAACCC GGCGTGCGCA ACATAGCCGT CGCCGCATCG CGCGATGCCA ACGGCGCGCC GGCGATCGTA TTCGACAATA TCACCGGTTA CCCCGGCAAG CGCTTGGCGG TGGGCGTCCA TGGTTCGTGG GACAACATCG CCCTGCTGCT GGGCCGACCT AAAGGCACGA CCATCCGCGA GCTTTTCTTC GAGATCGCCG GCCGCTGGGG CGATCAGGAA GCGCAAATCA GCTTTGTCCC AGAAGCCCAG GCCCCGGTGC ACGAATGCCG GATCGAACAG GACATCAACC TTTACGATGT CCTGCCGGTC TATCGGATCA ACGAATACGA TGGCGGGTTC TACATCGGCA AGGCCTCGGT CGCCTCGCGC GATCCGCTCG ATCCAGACAA TTTCGGCAAG CAGAATGTCG GCATCTATCG CCTGCAGATC CAGGGGCCGG ACACCTTCAC CCTGATGACG ATCCCCTCCC ACGACATGGG ACGTCAGATC ATGGCGGCCG AACGGGAAGG CGTTCCGCTA AAGATTGCGG TCATGCTGGG TAATCATCCC GGCCTTGCGG TGTTTGCTGC CACCCCGATC GGCTACGAGG AATCGGAATA TTCCTATGCC TCGGCGATGA TGGGCGCGCC AATCCGGCTG ACCAAATCGG GCAACGGGAT CGACATCCTG GCCGACAGCG AAATCGTGAT AGAGGCCGAA CTGCAACCGG GTGGACGCGA GCTGGAAGGG CCGTTCGGCG AATTCCCCGG TTCCTACAGC GGCGTGCGCA AGGCGCCGAT CTTCAAGGTC ACGGCGGTGT CGCACCGGCG CGATCCGATC TTCGAGAACA TTTACATCGG GCGCGGCTGG ACCGAGCACG ATACGCTGAT CGGCCTGCAC ACCTCCGCCC CGATCTATGC CCAGCTGCGC CAGAGCTTCC CCGAAGTCAC CGCGGTCAAC GCGCTTTACC AGCACGGACT GACCGGGATC ATCTCGGTCA AAAACCGCAT GGCCGGCTTT GCCAAGACGG TCGCGCTGCG CGCGCTGAGC ACGCCGCACG GCGTGATGTA CCTCAAGAAC CTGATTATGG TCGATGCCGA TGTCGATCCG TTCGATCTCA ACCAAGTGAT GTGGGCGCTT TCGACCCGCA CCCGTGCGGA CGATATCATC GTGCTGCCCA ACATGCCTGC CGTGCCGATC GATCCTTCGG CAGTGGTCCC GGGCAAGGGG CACCGCCTGA TCATCGACGC GACCAGCTAT CTCCCGCCCG ATCCGGTGGG TGAAGCGCAC CTTGTCACCC CGCCGACCGG GGACGAGATC GACGCCCTGA GCAAGCGGAT CCGCGAAATG CAGCTGGGAG CCCTGTCATG A
|
Protein sequence | MTMNDLPNRA RSISSLRDFL ELLEDAGQAI TWSDAVMPEP GVRNIAVAAS RDANGAPAIV FDNITGYPGK RLAVGVHGSW DNIALLLGRP KGTTIRELFF EIAGRWGDQE AQISFVPEAQ APVHECRIEQ DINLYDVLPV YRINEYDGGF YIGKASVASR DPLDPDNFGK QNVGIYRLQI QGPDTFTLMT IPSHDMGRQI MAAEREGVPL KIAVMLGNHP GLAVFAATPI GYEESEYSYA SAMMGAPIRL TKSGNGIDIL ADSEIVIEAE LQPGGRELEG PFGEFPGSYS GVRKAPIFKV TAVSHRRDPI FENIYIGRGW TEHDTLIGLH TSAPIYAQLR QSFPEVTAVN ALYQHGLTGI ISVKNRMAGF AKTVALRALS TPHGVMYLKN LIMVDADVDP FDLNQVMWAL STRTRADDII VLPNMPAVPI DPSAVVPGKG HRLIIDATSY LPPDPVGEAH LVTPPTGDEI DALSKRIREM QLGALS
|
| |