Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3767 |
Symbol | |
ID | 5077915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 404342 |
End bp | 405880 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481490 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001166152 |
Protein GI | 146275992 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0143855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGATC CGCTGCCCGC CACCATACCC CATGCCGCGC AGGCGGCGGC GGCCACCTGG CCCGATGCAC CTTCCGTGCT CGAAAACGGC GAAGTCTGGA GCTTTGCCGA GCTGTGGCAG CGCAGCCGCG CGGCGGCTTC GGCGCTGATC GCGCGGGGCA TAAAGGCCGG AGATCGCGTG GCGATCTGGG CGCCGAACTC GCGCGAGTGG ATCGTTGCGG CGATCGCGAC CATGGCGTGC GGGGCGGCGG TGGTGACGCT CAACACGCGG CTCAAGGGCC GCGAGGCGGG CGACATCTTG CGGCGCACCA ATGCGCGGCT TCTGTTCACG GTCGAGGGCT TTCTCGGCAT CGACTATCGC GCGCTGATCG TGGACGAGGA TCTGCCGGCG CTTGAAGGAA CAGTCCTGCT TGACCGGGAA TTCGACGCTT TCATGCGTGA CGGCAGGGGG GCGGGCGATC CGGCGGTCGA TGCGGCGATG GCGCAGATCG ATGCCGATAC CGTGTCGGAC ATCCTGTTCA CCAGCGGCAC GACAGGCAGC CCCAAGGGCG TGTTGATGAC TTATGGCCGC GTCCTGCCTC AGGCGGCGGT GTGGTGCGCG AACACCCGCC TGACCGAGGG CGACCGTTAC CTGATTGCCA ACCCTTTCTT CCATTCCTTC GGGATGAAGG TGGGCTGGGT CGCGTGCATC CTCGCCGGTG CCGTGGCAGT GCCCATGCTG CAATTCGACG TGGGTCAGGC GATTGATCTG ATCGAGCGCG AGCGCATCAC GTTCATGCCC GGTCCGCCGA CGATCTTCCA GATGCTGCTG GCCGAACTCG ACAAGCGCAA GTGGGACTGT TCCTCGCTGC GGGGAGGAAC GACCGGCGCG GCGACAGTGC CGCCCGCACT GGTGGAGCGC ATCCGCAACG ACCTGGGCAT GGTGGACCTC ATCACCGCCT ATGGCATGAC CGAATGCGTC AACATCACGA CGTGCGTGCC CGGTGACGAT GCCGAGACCA TTGCACGGAC CTGTGGCAAG GCGTTTCCGG GCAACGAGGT GCGCATCGCC GACGAGAACG GGAACGAACT GCCAAGGGGC GAGGCGGGAG AGGTTCTGGT GCGGGGGCAG GGCGTCATGC TCGGTTATCT CGACAACCCG GAAGCCACTG CCGAGGCGAT CGACGCAGCA GGCTGGTTGC ACACGGGCGA CGTCGGCACG ATGGACGAGC GCGGTTATGT GCGCATCACC GACCGGATGA AGGATCTCTA CATCTCGGGC GGGTTCAATG TGTACCCGGC GGAAGTCGAA AAGCTGCTGG CCGAGCATCC GGCCATCGGA ATGGCTGCGG TCGTCGGCGT TCCGGACGAG CGACTGGGCG AGGTGGGGCG CGCCTTCGTG GTCCTGCGGC CCGGCGCGAG TGCGACCGAG GCCGAACTTG TCGCCTGGTC GCGCGAGAAC ATGGCGAACT ACAAGGTGCC GCGCAGCTTT GTGCTGGTCG ATGACCTGCC CCGCAACGCG TCGGGCAAGG TGCTGAAGAC CGAACTGCGG GCCGGATAA
|
Protein sequence | MLDPLPATIP HAAQAAAATW PDAPSVLENG EVWSFAELWQ RSRAAASALI ARGIKAGDRV AIWAPNSREW IVAAIATMAC GAAVVTLNTR LKGREAGDIL RRTNARLLFT VEGFLGIDYR ALIVDEDLPA LEGTVLLDRE FDAFMRDGRG AGDPAVDAAM AQIDADTVSD ILFTSGTTGS PKGVLMTYGR VLPQAAVWCA NTRLTEGDRY LIANPFFHSF GMKVGWVACI LAGAVAVPML QFDVGQAIDL IERERITFMP GPPTIFQMLL AELDKRKWDC SSLRGGTTGA ATVPPALVER IRNDLGMVDL ITAYGMTECV NITTCVPGDD AETIARTCGK AFPGNEVRIA DENGNELPRG EAGEVLVRGQ GVMLGYLDNP EATAEAIDAA GWLHTGDVGT MDERGYVRIT DRMKDLYISG GFNVYPAEVE KLLAEHPAIG MAAVVGVPDE RLGEVGRAFV VLRPGASATE AELVAWSREN MANYKVPRSF VLVDDLPRNA SGKVLKTELR AG
|
| |