Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1841 |
Symbol | |
ID | 3918401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1940747 |
End bp | 1941853 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640444583 |
Product | dihydropteroate synthase |
Protein accession | YP_497115 |
Protein GI | 87199858 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.285015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAG TCTACATTCG TCCGATCGCG CTGGCCGACA GCCCCCAGTC GGAGGAAGGC GAGGCCATCC GTCTAGGGGG CGGGCTGGTC TATGCAAGCC GCTTCGCGCT GATCGTCCGC GACGGCGCGC GGGTCGTGTC GCGCCGCCGA TTTTCGGTGC CCGAAGTTTC GAGAGCTTTT GGCGCCCTGC CGGATGGCCT TGCCGCAGAA GCTGCGGCCC AGTGGGACAA CCTGCGCCGC GTCCATCAGC CGATTGGCTG TGGCCAGCGC ATGCTGCGGC TCGACCAGCC GCAGGTCATG GGCATTCTCA ACGTAACGCC CGACAGCTTC TCCGACGGGG GCAAGTTCCT CGACAGGCCC GAGACTGCGC TCGAACACGC TCACGCGATG CTTGCGGCGG GCGCGGCGCT GATCGACGTC GGCGGGGAAT CGACCCGCCC CGGCGCCGCC GCCGTCTGGG AAGGCGACGA GGCGAAGCGC GTGGTCCCGG TCATCGAGGC CCTGGCCGCT TCCGGCGCCG CGATCTCGAT AGACAGTCGC CGATCTACGG TGATCGAGGC GGCGCTTGCC GCAGGCGCAC ACATCGTCAA CGACGTTTCG GCAATGCGCC ACGATCCGCG CACGGCGGAA ATCGTGGCCG CCAGCGGTGC GCCGGTCGTG CTGATGCATG CGCCAGGAAG CGACGGCGAC CTCCATGCCG ATGGCGAATA TGCGGATGTG GTGCTCGACG TCTTCGATGC CTTGCGCGAA CGGCGCGACG CGGCGCTAGC CGGCGGGATC GCTCCCGAGA AGATCCTTCT CGATCCGGGC ATCGGCTTCG GCAAGTCGCT CGCGGAGAAT CTGGCGCTGG TGAACGCCCT GCCGATGTTC CACGCGCTGG GCCACCCGAT CCTTTTCGGG GCGAGCCGCA AGCGCATGAT CGGCGCGCTA TCCAACGAGG CGCCGGCGCA CCAGCGCATG GCTGGTTCGG TTATGCTGGC GCTGAAGGCG ATGGACGCCG GATGCCAGAT GGTCCGCGTT CACGATGTGG CCGAGACGGT TCAGGCCCTG CATGTCTGGC GCGGGCTGCG CGACGCGGCG TTGACGGATT TCAGCCAACT GGCCTGA
|
Protein sequence | MLKVYIRPIA LADSPQSEEG EAIRLGGGLV YASRFALIVR DGARVVSRRR FSVPEVSRAF GALPDGLAAE AAAQWDNLRR VHQPIGCGQR MLRLDQPQVM GILNVTPDSF SDGGKFLDRP ETALEHAHAM LAAGAALIDV GGESTRPGAA AVWEGDEAKR VVPVIEALAA SGAAISIDSR RSTVIEAALA AGAHIVNDVS AMRHDPRTAE IVAASGAPVV LMHAPGSDGD LHADGEYADV VLDVFDALRE RRDAALAGGI APEKILLDPG IGFGKSLAEN LALVNALPMF HALGHPILFG ASRKRMIGAL SNEAPAHQRM AGSVMLALKA MDAGCQMVRV HDVAETVQAL HVWRGLRDAA LTDFSQLA
|
| |