Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1330 |
Symbol | |
ID | 3917780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1372662 |
End bp | 1373597 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444068 |
Product | 2OG-Fe(II) oxygenase |
Protein accession | YP_496608 |
Protein GI | 87199351 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000246979 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCTCG AAACCCTGCC GGTGATCTCG CTCGCCGGCG AACCTGATGC GCTTTCCCGC GAACTGGGAG AATCCTTCAG GACCTTCGGC TTCGCCATGG TCCGCGACCA CGGCATCGAC CCCGACCTCA TCGCCCGCGC CTGGGACCTG ACCGCCCAAT TCTTCGCCCT GCCCGAAGCC GAGAAGCGCA GCTATTACCT TGATGGCCTT GCCGGCGCGC GGGGCTACAC GCCGTTCGGC ACGGAGATCG CCAAGGGAGC AAGGCTCCAC GATCTCAAGG AGTTCTGGCA CGTAGGGCGC GACCTGCCGG CCGGACATCC GCTGTCTGCC TCGATGCCGC CCAACGTGTG GCCCGCGCGC CCCGAAGGCT TCCGCCAGAC GTTCGAGACG CTCTACGGCG AATTCGACAA GGTCGGCGCA CGCATCCTCT CGCGCATCGC GGTCTGGCTC GGGCTGGACG AGAACTGGTT CGACCCGGCG ATCGAGGACG GCAACTCGGT CATGCGCCTG CTCCACTACC CTCCCGTCCC CGATGCCGAA TCCGGCGCCA TCCGCGCCGG CGCGCACGAG GACATCAACC TCATCACACT TCTCCTCGGC GCCGAGGAAG CGGGTCTGGA ACTGCTCAGC AGGCAGGGCG AATGGATTGG CGTTTCCCCG CCCGAGGGCG CGCTGGTGGT CAACATCGGC GACATGCTCC AGCGGCTCAC CAACCACGTC CTTCCATCCA CCACGCACCG CGTGCGCAAT CCGGAAGGGG AGCGGGCGCG GTTCAGCCGC TACTCGATGC CGTTCTTCCT GCACCTGAGA AGCGATTTCC CGTTCGTGAC GCTGCCGCAG TGCATCTCGG ACGAGAATCC CGACCGCTAC CCGGTCTCGA TCACCGCTGA CGATTACCTG CAGGAACGCC TGCGCGAGAT CGGCCTGCGC AAGTAA
|
Protein sequence | MPLETLPVIS LAGEPDALSR ELGESFRTFG FAMVRDHGID PDLIARAWDL TAQFFALPEA EKRSYYLDGL AGARGYTPFG TEIAKGARLH DLKEFWHVGR DLPAGHPLSA SMPPNVWPAR PEGFRQTFET LYGEFDKVGA RILSRIAVWL GLDENWFDPA IEDGNSVMRL LHYPPVPDAE SGAIRAGAHE DINLITLLLG AEEAGLELLS RQGEWIGVSP PEGALVVNIG DMLQRLTNHV LPSTTHRVRN PEGERARFSR YSMPFFLHLR SDFPFVTLPQ CISDENPDRY PVSITADDYL QERLREIGLR K
|
| |