Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1341 |
Symbol | |
ID | 3917791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1382689 |
End bp | 1383666 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444079 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_496619 |
Protein GI | 87199362 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGCC GCCGCCGATC CAGATTGCCG CTGCTGGCGG CGGCCCTTGT CGTGCTGGTC GTGGCGGCGT GGCTCGTCGG CGGCTGGTTC TCGTCTGGCC CGCTGGAAAA GCAGCTCGAA TTCGACGTGG GCGAAGGCGA GGGGCTGAGC GCGCTTTCGG ACGATCTGGA GGCGCAGGGC GCCATCGGTT CGGCCACGCT GTTCAAGCTG CGCGCACGGC TGCTGGGCGG CGGCACCGAA ATCAAGACCG GTTCGTTCCT GATCCCCAAG CGCGCGAGCG AAGCTACGAT CCTTGAAATC CTCAAGGGCG ACAAGGTCAT CCGCCGCCTG ATCACCATCC CCGAAGGCAT GCCGTCGATC ATGGTGGCCG AGCGTCTGCG CGCCGTGGAT GGCCTGACCG GCGATGTCGC GGTGCCCGAG GAAGGTTCGG TGCTGCCCGA CAGCTACGAC TGGCAGAAGG GTGAAAGCCG CGCCGCCGTG GTCAAGCGGA TGCAGGCGGC AATGGACAAG ACCCTGGCCG AACTCTGGGC AAAGCGATCG CCGCGCACGG TCGCCAAGAC GCCGCAGGAG GCGCTGGTGC TGGCATCGAT CGTCGAGAAG GAAACGGGCA AGCCCGAGGA GCGGCGCATG GTTGCCGGCC TCTACTCCAA TCGCCTGCGC CAGCGCATGC TGCTTCAGGC CGACCCGACG ATCATCTATC CGATCACCGG GGGCAAGCCG CTCGGCCGCC GCATCCGCCA GTCCGAGATC CAGGCGGTGA ACGGCTACAA CACCTATACG ATGATCGGCC TGCCCAAGGG CCCGATCACC AATCCGGGGC GCGATTCCAT CGCGGCGGTG CTCGACCCGG CGGAGACCGA TGCGCTGTTC ATGGTGGCCG ACGGTACCGG CGGGCACGTT TTCGCGAGCA CGCTGCAGGA ACACAATGCC AATGTTGCCA AGTGGTTCGC CATCCGCAAG GCTCGCGGCG AGATCTGA
|
Protein sequence | MARRRRSRLP LLAAALVVLV VAAWLVGGWF SSGPLEKQLE FDVGEGEGLS ALSDDLEAQG AIGSATLFKL RARLLGGGTE IKTGSFLIPK RASEATILEI LKGDKVIRRL ITIPEGMPSI MVAERLRAVD GLTGDVAVPE EGSVLPDSYD WQKGESRAAV VKRMQAAMDK TLAELWAKRS PRTVAKTPQE ALVLASIVEK ETGKPEERRM VAGLYSNRLR QRMLLQADPT IIYPITGGKP LGRRIRQSEI QAVNGYNTYT MIGLPKGPIT NPGRDSIAAV LDPAETDALF MVADGTGGHV FASTLQEHNA NVAKWFAIRK ARGEI
|
| |