Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2021 |
Symbol | |
ID | 3917342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2154555 |
End bp | 2156090 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640444773 |
Product | anthranilate synthase, component I |
Protein accession | YP_497294 |
Protein GI | 87200037 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0405957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGA TCGACGCATC CGCCGTCTCG CTGCCGGAAA ACCACCGCGC CGCGCTTTCC CAGCTTTCCG CCGGAAAGCC GGCGCTGGTC TGGCGCAAGC TGATCGTCGA TACCGAGACT CCCGTCGGCG CCGCGCTCAA GCTGATGGAA AGCGGTCGCG GCGATTTCCT GCTCGAATCC GTGCAGGGCG GCGAAGTACG CGGGCGCTAC AGCCTGCTCG GGCTCGATCC CGATCTCGTC TTCCGCGCCA CCGGCTCGTC GGCAGAGATC AACCGCATCT GGCGGCACGA CAAGGCAGCC TTCGCACCGC TACCCGGCGA TGCCCTCGCC GAACTGCGCG CGCTCGTCGC CTCCTGCCGC ATCGACGTCC CGGCCGAGCT GCCCTCGGCG CTCGCCTGCC TCGTCGGCTA CTTCGGCTAC GAGACCATCG GCCTGGTCGA GAAGCTGCCC CGCGCACCGC AGAGCGAGCT TGTCCTGCCC GACATGCTGT TCACCCGCCC GACCGTGGTG CTGGTGTTCG ACCGCCTGTC CGACGAACTC TTCGCCATCG CCCCGGTCTG GGCCGAAGGC GGCGATCCGG CGCGCCTGCT CGAAGCCGCG GCGGAGCGCA TCGACAATGC CCTGCGCCGG CTGTCCGATC CGGTCCCCGC CGATGCGCGC CTTGCCGAAG CGGTCGACGT CACGCCGCAG CCAGTCATGG CCGCACCCGA CTATGCGCGT ATGGTGACTG CCGCCAAGGA CTACATCGAG GCGGGCGACA TCTTCCAGGT CGTCCTCGCC CAGCGCTTCA CCGCGCCCTT CCCGCTGCCG CCCATCGCGC TCTACCGTTC GCTGCGCCGC ATCAATCCCT CGCCGTTCCT CTACTTCCTC GACATGCCGG GCTTTGCGCT CACCGGCTCC TCGCCGGAAA TCCTGGTCCG CATCCGCGAC GGCGAAGTCA CGATCCGCCC GATTGCCGGC ACCCGCCCGC GCGGGCGCAC CGCCGAGGAA GACCGGGCCA ACGAAGAGAG CCTGCTGGCC GATCCCAAGG AACGCGCCGA ACACCTCATG CTGCTCGACC TCGGCCGCAA CGACGTCGGC CGCGTGGCCA GGGCCGGCAC CGTGAAAGTC ACCGAAAGCT ACACGGTCGA ACGCTACAGC CACGTGATGC ACATCGTCTC GAACGTGGTC GGCCAGCTCG ACACGAACCG CGCCGACAGC GTCGACGCCC TCTTCGCCGG GTTCCCCGCC GGCACAGTCT CGGGCGCACC CAAAGTCCGC GCCTGCGAGA TCATCGCCGA ACTCGAACCC GAGACGCGCG GCGCCTACGC TGGCGGTGTC GGCTATTTCG CGCCCGACGG CTCTGTCGAT AGCTGCATCG TCCTCAGGAC CGGCATCCTC AAGGACGGCG TCCTCCATGT CCAGGCTGGC GCCGGCATCG TCGCCGACAG CGACCCCGCC TACGAACAGC GCGAATGCGA AGCCAAGAGC GGCGCCCTCT TCGCCGCCGC GCGCGAAGCC GTCCGTGTCG CCACAGAACC GAAGTTTGGC CAATGA
|
Protein sequence | MTTIDASAVS LPENHRAALS QLSAGKPALV WRKLIVDTET PVGAALKLME SGRGDFLLES VQGGEVRGRY SLLGLDPDLV FRATGSSAEI NRIWRHDKAA FAPLPGDALA ELRALVASCR IDVPAELPSA LACLVGYFGY ETIGLVEKLP RAPQSELVLP DMLFTRPTVV LVFDRLSDEL FAIAPVWAEG GDPARLLEAA AERIDNALRR LSDPVPADAR LAEAVDVTPQ PVMAAPDYAR MVTAAKDYIE AGDIFQVVLA QRFTAPFPLP PIALYRSLRR INPSPFLYFL DMPGFALTGS SPEILVRIRD GEVTIRPIAG TRPRGRTAEE DRANEESLLA DPKERAEHLM LLDLGRNDVG RVARAGTVKV TESYTVERYS HVMHIVSNVV GQLDTNRADS VDALFAGFPA GTVSGAPKVR ACEIIAELEP ETRGAYAGGV GYFAPDGSVD SCIVLRTGIL KDGVLHVQAG AGIVADSDPA YEQRECEAKS GALFAAAREA VRVATEPKFG Q
|
| |