Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0445 |
Symbol | |
ID | 3918313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 487393 |
End bp | 489180 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640443174 |
Product | para-aminobenzoate synthase, component I |
Protein accession | YP_495727 |
Protein GI | 87198470 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.59777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGTG CTCCCTTCAT CCTGCTCGAC GACGCCCGCA GCGAGGGCGC GAGTGATGCG CGCCTCTACG AGGACGCCGT CGAGATCGTC GTCGCGCGCC GCGCCGAGGA GGTCGAGGCC GCGCTTGAAC GGATCGCGGC TGTCCCCGGC AACTGGGCAG GCTACCTCGC CTACGAGGCG GGCCTCGCGC TCGAACCGCG TCTCTTGCCG CTGGCGGCGG TGCGAACCGG TGCTGACGGC CCGTTGGTGT GGTTCGCGCG CTTCGACCGC GTCACCCGGA TGCCGAGCGC GGAGGTCGAG CAGTGGCTGG TGCGCAATGC GCGCGGCCCC GGGCGCCTCG GTCCGCTCGG CCCTGCGCTG TCGTCGGGCG GCTATGCCCG CGCCTTCGAC CTGGTGCAGG AGGCGATCCG CGCGGGCGAC ATCTACCAGG CGAACCTGAC CTTCCCGCTG ACCGGCACCT GGGATGGCGA TCCGCTGTCG ATCTACGGCG AAGTGCGCCG CGATGCGCGC GCGGGCTATG GCGGCGTGAT ATGGGACGGC GCGCATTGGC ACCTCTCGTT CTCGCCCGAA CTCTTCTTCA AGCTTTCCGG CGGAGAAGCG CGGGTGCGCC CGATGAAGGG CACTGCCCCG CGCGGGGCAA CCCCCGAAGA GGACGCGCGC TTCAAGGCCG ACCTCTCCGC CAGTGTAAAA GACCGCGCCG AAAACCTGAT GATCGTCGAT CTCATGCGAA ACGATCTCGC GCGCGTTTCA CAGGCCGGGT CGGTCAAGGT CGAGGCGCCC TTCGCCATCG AATCCTATCC TACCGTCCAC CAGATGGTCA CCACCGTGCG CGCCAGGCTT GCCCCCGGCG AGGACGTGCG CAGCCTCCTG CGCGCGATCT TCCCCTGCGG TTCGATCACC GGAGCGCCCA AGATCCGCGC GATGGAGATC ATCCATGCCG CAGAACGGGA CGCGCGCGGG CTCTATTGCG GCTCGATCGG ACGGATCGAC CCCACTGGCG AGGCGGCGTT CAATGTGGCA ATCCGCACGC TGCGTCTTTC GAGCGATCAC GCCGGGGCCT CCGGGGGCCG CGCGGTCATG GGCGTCGGCT CTGCGGTCGT TGCCGATTCG GCCATGATCG GCGAATGGCG GGAATGCGTG GTGAAGGGGA ATTTCTTGCG TCTGTCCGCC GGCCATGCCG ACCTTATCGA GACCATGGCC TTCGATCCGG CCAAGGGTAT CGAACTGCTC GAACTGCACC TCGAACGCAT GAAGGCCAGT GCGCTTGCGC TAGGCTACAG CTTCGACCGG CACGCGGTGC GCAATGCGAT CCATGCGCTT TGCTTCGATC TCGACGCGCC CTCGCGGGTC CGGCTCGTCG TGTCGAAGGG GGGCGCCCAT GCGCTCGATG CCTCGGCCAT GCCGGCGCCG ATCGAAAGCC TGACCTGCGC GGTGCTCTCG CTTCCGGTGT CGGACGGCGA CTGGCGCCTG CGCCACAAGA CCACGGACCG CGCCTTCTAC GAGGCGGGCC TGTCCGCCGC GAAACGGGCG GGCGCCGGTG AGGCCCTGTT CCTGCGCGAC GATGGCTTCC TTACCGAAGG CACGTTCACC AACATCTTCG TCGAGCGCGA TGGCATTCTG CTCACCCCCC GCGCCGAACT CGGCCTGCTT CCCGGCGTAC TGCGCCGAAG CCTGATCGAG GCCGGACGTG CGGTCGAGGC GGACCTGACG CTCGATGACC TTGCCGGCGG CTTCCTCGTC GGCAACGCGC TGCGCGGCCT CATCCCGGCT CGCCTGCTTG GCGCCTGA
|
Protein sequence | MTRAPFILLD DARSEGASDA RLYEDAVEIV VARRAEEVEA ALERIAAVPG NWAGYLAYEA GLALEPRLLP LAAVRTGADG PLVWFARFDR VTRMPSAEVE QWLVRNARGP GRLGPLGPAL SSGGYARAFD LVQEAIRAGD IYQANLTFPL TGTWDGDPLS IYGEVRRDAR AGYGGVIWDG AHWHLSFSPE LFFKLSGGEA RVRPMKGTAP RGATPEEDAR FKADLSASVK DRAENLMIVD LMRNDLARVS QAGSVKVEAP FAIESYPTVH QMVTTVRARL APGEDVRSLL RAIFPCGSIT GAPKIRAMEI IHAAERDARG LYCGSIGRID PTGEAAFNVA IRTLRLSSDH AGASGGRAVM GVGSAVVADS AMIGEWRECV VKGNFLRLSA GHADLIETMA FDPAKGIELL ELHLERMKAS ALALGYSFDR HAVRNAIHAL CFDLDAPSRV RLVVSKGGAH ALDASAMPAP IESLTCAVLS LPVSDGDWRL RHKTTDRAFY EAGLSAAKRA GAGEALFLRD DGFLTEGTFT NIFVERDGIL LTPRAELGLL PGVLRRSLIE AGRAVEADLT LDDLAGGFLV GNALRGLIPA RLLGA
|
| |