Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1312 |
Symbol | |
ID | 3917944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1356877 |
End bp | 1360107 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640444049 |
Product | hypothetical protein |
Protein accession | YP_496590 |
Protein GI | 87199333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGATC GTTTCGAAAT GGCGGAATCG GAAGAGCCGC CTGTCTCCGA CGCGCCCCGG CGACGGGTCC GCTGGAGGCG GGTCGGCGTG CCCGCGCTCG CGCTGGTCAT GGCTGCCGGA GCGGGTCTGT GGCTTGGCCG CGAGCAGCTT GCCGATCGCG TCATCGCGGG GCAGCTCCGT GGCTACGGCC TGCCTGCGAC CTATGAAATC GAGAGCATCG GGCCCGGCAC GCAGATCCTG CGCAACGTCG TCGTCGGCGA TCCGCAGCGC CCGGACCTGA CTGTCGAGCG CGTGCTGGTG GGGATCGAGT ATCACCTTGG TACGCCCACC ATCGGCTCGG TCCGGCTCGT CCGGCCAAGG GTTTACGGGC AATACCTGAA CGGCAAGCTC AGCTTCGGCT CGCTTGACAA AGTGCTGTTC GCACCGCGCG ATCCGAACAA GCCCTTCGCG CTGCCCAGGC TGGACCTGGC CGTCGAGGAC GGGCGCGGAC TGATCCTGTC GGACTACGGG AAGGTCGGGC TGAAGCTCGA CGGACACGGG CGGCTGGACG ATGGCTTCGC CGGAACGCTG GCCGCCGTCG CGCCGCGCAT TTCCGGGGGC AACTGCACGG GCGAGGACGC AAGCCTGTTC GGCCGTGTCT CGATCCGCGA GGAGCGTCCG GGCTTTACTG GTCCCCTGCG GCTCAAAAGC CTGTCCTGCG AAGGCGGCGC GCGCAGCGGG GCCATCGTGG CGCAGGTCGA CGCGCGTGCC GACAAGGGGC TTGGCGGCGT GGCCGGCGAT GCCACCGTTC GCGGGCAGGG CCTGGCTCTT ACTGGCGCAA CGATCGAGAC CCTGGCGCTC GATACCCGGC TCGCGTGGCG CGAGAACGTG CTGACGGGGC GGGTCGAGGC CAACGCCGGC GGCGTGAAGA CCGGGGGCGT GAGCATCGGC CTGCTCGGTG TCGAGGGCGC GGTGCGGGCG CGCGAGGGCT TCCGCAAGGC GGAGTTTCGC GGCGCGCTCG AGGGCGAGGG GCTGCGGCGC GGCAAGGTGC TGGACGCAGG TCTTGCCAGT GCAGAGCGGA GCGCGAGCGG CACGCTGCTT GCGCCGATGC TCGCGCAATT GCGCCAGAGC CTGCTGCGCG AGGAGCGCGG CAGCAGGCTA TCGGGCGAGA TCGCGCTGCG CCGCAACGGC GATGGCTTGT CGCTGGTGGC GCCGCAGGCG CAACTCGTCG GCGGCAGCGG GGCATCCCTG CTGACGCTTT CGCGCTTCCA GGTGGCGACC GGCGGCGACG ATGGCCCGCC GCGCCTCGCC GGGAACTTCG CGACCGGCGG CGCAGGCATC CCGCGCATCG TCGGCCGGAT GGAGCGCGGG CGTGCCGGAC AGGCGCTTTT CCGCCTGACG ATGGCGCCGT GGCGCGCCGG CGGCGGCTCG CTCGCGATAC CGGAGATGAT GATCGCGCAG GTCGGCGACG GATCGCTCGG CTTTTCCGGA ACCGCGCAGC TTTCCGGAGC GATACCGGGC GGTTCGGTGC AGAACCTCGT CCTGCCGCTC AACGGTGCCT ATGGCGCCAG CGGCGAACTG GCGCTGTGGA AGCGCTGCGT GACCGCGCGG TTCGACCGGC TGGTGCTTGG CCAGATGCAG GTCGACGGCA ACCGCCTCGG TCTTTGTCCG CCCGGCGGCT CGGCGATCGT CCGCAACGGC GCTGCGGGCC TGCGGGTCGC GGCGGGGACA CCGGGGCTGG AACTCACTGG ACGGCTTGGC GAGACGTCGC TTGCCGTGAA GACGGGTGCG GTGGGCTTTG CCTGGCCCGG CGTCCTGACC GCCAGGGCGG TGGAAGTGGC GCTTGGCCCG GCCGACACGG CGACCCACCT CAAGCTTGCC GATCTCGATG CTCGGCTGGG CAAGGACTTT ACCGGAAGCT TCGGCGGGGT CGAGGCGAAG CTGGCGGCGG TTCCGCTCGA CGTGAGCAAT GCCGCCGGGC AGTGGCGCTA TGCCGATGGC GCGCTTGTCC TTTCCGGCGT GGGCTTCGAG CTTACCGATC GGCTCGATCC GGCACGCTTC GAACGATTGC GGAGCGAGGG CGCGACGCTT GCGCTTGCCG ACAATCGCAT CGTCGCCAAC GCCTTGCTGC GCGAAGCGAA GAGCGGGCGC GAAGTGGCGA CGACCGTGAT CCGCCACGAT CTCGGAACCG GCGCCGGCCA TGCCGACCTC AAGGTCGACG GCCTTGTGTT CGACAAGGGC TTCCAGCCGG ATGACCTTAC CCGGCTGGCG CTGGGCGTCG TGGCCAACGT CAAGGGCACC GTGCGCGGCG AAGGCGACAT CGACTGGTCG GCGCGCGGCG TGACCAGCAA GGGGCGCTTC GGGACTGAAA GCATGGACCT TGCGGCAGCC TTCGGCCCGG TAAAGGGTCT TTCGGGCACG CTCGAGTTCA CCGACCTTCT GGGCATGGTG ACCGCCCCCC ACCAGAAGCT GCGCGTCGCC TCTGTCAATC CGGGGATCGA AGTGGCGGAC GGCGTCGTCG ATCTCACGCT GCTGCCCGAC CAGGTGCTGC GCCTGCATGA AGCGCGATGG CCGTTCCTGG GCGGCACGCT GACCCTCGAG CCGACCGACC TGCGACTCGG CGTGGCCGAA GCGCGGCGCT ACACGCTCAC CATCGTCGGT CTCGATGCGG CGAAGTTCGT CGAGAGGATG GAGCTGGGCA ACCTTTCCGC CACCGGAACT TTCGACGGGC AGCTTCCGCT GGTGTTCGAT GCGAACGGCG GGCGGCTGGA GAAGGGTACG CTGGTTTCCC GTCCGCCGGG AGGCAATGTC TCCTACGTCG GCGCGCTGAC CTACAGGGAC CTGTCTCCGA TGGCGAACTT CGCGTTCGAT GCGCTCAAGT CGCTCGACTA CCGCACAATG ACCATCGCCA TCGAGGGCGA TCTCGAGGGC GAGATCGTGA CCAACGTGAA GTTCGGCGGG GTCAAGCAGG GCGCGGGCAC GAAGCGCAAC TTCATCACCA AGCAGGTCGC CAACCTGCCG ATCCAGTTCA ACGTCAACAT CCGTGCGCCG TTCTACCAGC TCATCACTTC GGTGAAGGCG ATGTACGACC CCGCCTTCAT CAAGGACCCG CGCACGCTGG GCCTTGTCGA TGCGCAGGGC CGACCCATCC AGCGGTTCGG GAACGGCGTG CGGCCCGGCG GGGCGCCGGT CGTGGTCCTG CCCGGAGAAC AGCGCAGCAT TCAGCCTGCA GAAAGCGGAA ACCTGCCATG A
|
Protein sequence | MADRFEMAES EEPPVSDAPR RRVRWRRVGV PALALVMAAG AGLWLGREQL ADRVIAGQLR GYGLPATYEI ESIGPGTQIL RNVVVGDPQR PDLTVERVLV GIEYHLGTPT IGSVRLVRPR VYGQYLNGKL SFGSLDKVLF APRDPNKPFA LPRLDLAVED GRGLILSDYG KVGLKLDGHG RLDDGFAGTL AAVAPRISGG NCTGEDASLF GRVSIREERP GFTGPLRLKS LSCEGGARSG AIVAQVDARA DKGLGGVAGD ATVRGQGLAL TGATIETLAL DTRLAWRENV LTGRVEANAG GVKTGGVSIG LLGVEGAVRA REGFRKAEFR GALEGEGLRR GKVLDAGLAS AERSASGTLL APMLAQLRQS LLREERGSRL SGEIALRRNG DGLSLVAPQA QLVGGSGASL LTLSRFQVAT GGDDGPPRLA GNFATGGAGI PRIVGRMERG RAGQALFRLT MAPWRAGGGS LAIPEMMIAQ VGDGSLGFSG TAQLSGAIPG GSVQNLVLPL NGAYGASGEL ALWKRCVTAR FDRLVLGQMQ VDGNRLGLCP PGGSAIVRNG AAGLRVAAGT PGLELTGRLG ETSLAVKTGA VGFAWPGVLT ARAVEVALGP ADTATHLKLA DLDARLGKDF TGSFGGVEAK LAAVPLDVSN AAGQWRYADG ALVLSGVGFE LTDRLDPARF ERLRSEGATL ALADNRIVAN ALLREAKSGR EVATTVIRHD LGTGAGHADL KVDGLVFDKG FQPDDLTRLA LGVVANVKGT VRGEGDIDWS ARGVTSKGRF GTESMDLAAA FGPVKGLSGT LEFTDLLGMV TAPHQKLRVA SVNPGIEVAD GVVDLTLLPD QVLRLHEARW PFLGGTLTLE PTDLRLGVAE ARRYTLTIVG LDAAKFVERM ELGNLSATGT FDGQLPLVFD ANGGRLEKGT LVSRPPGGNV SYVGALTYRD LSPMANFAFD ALKSLDYRTM TIAIEGDLEG EIVTNVKFGG VKQGAGTKRN FITKQVANLP IQFNVNIRAP FYQLITSVKA MYDPAFIKDP RTLGLVDAQG RPIQRFGNGV RPGGAPVVVL PGEQRSIQPA ESGNLP
|
| |