Gene Saro_1312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1312 
Symbol 
ID3917944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1356877 
End bp1360107 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content69% 
IMG OID640444049 
Producthypothetical protein 
Protein accessionYP_496590 
Protein GI87199333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATC GTTTCGAAAT GGCGGAATCG GAAGAGCCGC CTGTCTCCGA CGCGCCCCGG 
CGACGGGTCC GCTGGAGGCG GGTCGGCGTG CCCGCGCTCG CGCTGGTCAT GGCTGCCGGA
GCGGGTCTGT GGCTTGGCCG CGAGCAGCTT GCCGATCGCG TCATCGCGGG GCAGCTCCGT
GGCTACGGCC TGCCTGCGAC CTATGAAATC GAGAGCATCG GGCCCGGCAC GCAGATCCTG
CGCAACGTCG TCGTCGGCGA TCCGCAGCGC CCGGACCTGA CTGTCGAGCG CGTGCTGGTG
GGGATCGAGT ATCACCTTGG TACGCCCACC ATCGGCTCGG TCCGGCTCGT CCGGCCAAGG
GTTTACGGGC AATACCTGAA CGGCAAGCTC AGCTTCGGCT CGCTTGACAA AGTGCTGTTC
GCACCGCGCG ATCCGAACAA GCCCTTCGCG CTGCCCAGGC TGGACCTGGC CGTCGAGGAC
GGGCGCGGAC TGATCCTGTC GGACTACGGG AAGGTCGGGC TGAAGCTCGA CGGACACGGG
CGGCTGGACG ATGGCTTCGC CGGAACGCTG GCCGCCGTCG CGCCGCGCAT TTCCGGGGGC
AACTGCACGG GCGAGGACGC AAGCCTGTTC GGCCGTGTCT CGATCCGCGA GGAGCGTCCG
GGCTTTACTG GTCCCCTGCG GCTCAAAAGC CTGTCCTGCG AAGGCGGCGC GCGCAGCGGG
GCCATCGTGG CGCAGGTCGA CGCGCGTGCC GACAAGGGGC TTGGCGGCGT GGCCGGCGAT
GCCACCGTTC GCGGGCAGGG CCTGGCTCTT ACTGGCGCAA CGATCGAGAC CCTGGCGCTC
GATACCCGGC TCGCGTGGCG CGAGAACGTG CTGACGGGGC GGGTCGAGGC CAACGCCGGC
GGCGTGAAGA CCGGGGGCGT GAGCATCGGC CTGCTCGGTG TCGAGGGCGC GGTGCGGGCG
CGCGAGGGCT TCCGCAAGGC GGAGTTTCGC GGCGCGCTCG AGGGCGAGGG GCTGCGGCGC
GGCAAGGTGC TGGACGCAGG TCTTGCCAGT GCAGAGCGGA GCGCGAGCGG CACGCTGCTT
GCGCCGATGC TCGCGCAATT GCGCCAGAGC CTGCTGCGCG AGGAGCGCGG CAGCAGGCTA
TCGGGCGAGA TCGCGCTGCG CCGCAACGGC GATGGCTTGT CGCTGGTGGC GCCGCAGGCG
CAACTCGTCG GCGGCAGCGG GGCATCCCTG CTGACGCTTT CGCGCTTCCA GGTGGCGACC
GGCGGCGACG ATGGCCCGCC GCGCCTCGCC GGGAACTTCG CGACCGGCGG CGCAGGCATC
CCGCGCATCG TCGGCCGGAT GGAGCGCGGG CGTGCCGGAC AGGCGCTTTT CCGCCTGACG
ATGGCGCCGT GGCGCGCCGG CGGCGGCTCG CTCGCGATAC CGGAGATGAT GATCGCGCAG
GTCGGCGACG GATCGCTCGG CTTTTCCGGA ACCGCGCAGC TTTCCGGAGC GATACCGGGC
GGTTCGGTGC AGAACCTCGT CCTGCCGCTC AACGGTGCCT ATGGCGCCAG CGGCGAACTG
GCGCTGTGGA AGCGCTGCGT GACCGCGCGG TTCGACCGGC TGGTGCTTGG CCAGATGCAG
GTCGACGGCA ACCGCCTCGG TCTTTGTCCG CCCGGCGGCT CGGCGATCGT CCGCAACGGC
GCTGCGGGCC TGCGGGTCGC GGCGGGGACA CCGGGGCTGG AACTCACTGG ACGGCTTGGC
GAGACGTCGC TTGCCGTGAA GACGGGTGCG GTGGGCTTTG CCTGGCCCGG CGTCCTGACC
GCCAGGGCGG TGGAAGTGGC GCTTGGCCCG GCCGACACGG CGACCCACCT CAAGCTTGCC
GATCTCGATG CTCGGCTGGG CAAGGACTTT ACCGGAAGCT TCGGCGGGGT CGAGGCGAAG
CTGGCGGCGG TTCCGCTCGA CGTGAGCAAT GCCGCCGGGC AGTGGCGCTA TGCCGATGGC
GCGCTTGTCC TTTCCGGCGT GGGCTTCGAG CTTACCGATC GGCTCGATCC GGCACGCTTC
GAACGATTGC GGAGCGAGGG CGCGACGCTT GCGCTTGCCG ACAATCGCAT CGTCGCCAAC
GCCTTGCTGC GCGAAGCGAA GAGCGGGCGC GAAGTGGCGA CGACCGTGAT CCGCCACGAT
CTCGGAACCG GCGCCGGCCA TGCCGACCTC AAGGTCGACG GCCTTGTGTT CGACAAGGGC
TTCCAGCCGG ATGACCTTAC CCGGCTGGCG CTGGGCGTCG TGGCCAACGT CAAGGGCACC
GTGCGCGGCG AAGGCGACAT CGACTGGTCG GCGCGCGGCG TGACCAGCAA GGGGCGCTTC
GGGACTGAAA GCATGGACCT TGCGGCAGCC TTCGGCCCGG TAAAGGGTCT TTCGGGCACG
CTCGAGTTCA CCGACCTTCT GGGCATGGTG ACCGCCCCCC ACCAGAAGCT GCGCGTCGCC
TCTGTCAATC CGGGGATCGA AGTGGCGGAC GGCGTCGTCG ATCTCACGCT GCTGCCCGAC
CAGGTGCTGC GCCTGCATGA AGCGCGATGG CCGTTCCTGG GCGGCACGCT GACCCTCGAG
CCGACCGACC TGCGACTCGG CGTGGCCGAA GCGCGGCGCT ACACGCTCAC CATCGTCGGT
CTCGATGCGG CGAAGTTCGT CGAGAGGATG GAGCTGGGCA ACCTTTCCGC CACCGGAACT
TTCGACGGGC AGCTTCCGCT GGTGTTCGAT GCGAACGGCG GGCGGCTGGA GAAGGGTACG
CTGGTTTCCC GTCCGCCGGG AGGCAATGTC TCCTACGTCG GCGCGCTGAC CTACAGGGAC
CTGTCTCCGA TGGCGAACTT CGCGTTCGAT GCGCTCAAGT CGCTCGACTA CCGCACAATG
ACCATCGCCA TCGAGGGCGA TCTCGAGGGC GAGATCGTGA CCAACGTGAA GTTCGGCGGG
GTCAAGCAGG GCGCGGGCAC GAAGCGCAAC TTCATCACCA AGCAGGTCGC CAACCTGCCG
ATCCAGTTCA ACGTCAACAT CCGTGCGCCG TTCTACCAGC TCATCACTTC GGTGAAGGCG
ATGTACGACC CCGCCTTCAT CAAGGACCCG CGCACGCTGG GCCTTGTCGA TGCGCAGGGC
CGACCCATCC AGCGGTTCGG GAACGGCGTG CGGCCCGGCG GGGCGCCGGT CGTGGTCCTG
CCCGGAGAAC AGCGCAGCAT TCAGCCTGCA GAAAGCGGAA ACCTGCCATG A
 
Protein sequence
MADRFEMAES EEPPVSDAPR RRVRWRRVGV PALALVMAAG AGLWLGREQL ADRVIAGQLR 
GYGLPATYEI ESIGPGTQIL RNVVVGDPQR PDLTVERVLV GIEYHLGTPT IGSVRLVRPR
VYGQYLNGKL SFGSLDKVLF APRDPNKPFA LPRLDLAVED GRGLILSDYG KVGLKLDGHG
RLDDGFAGTL AAVAPRISGG NCTGEDASLF GRVSIREERP GFTGPLRLKS LSCEGGARSG
AIVAQVDARA DKGLGGVAGD ATVRGQGLAL TGATIETLAL DTRLAWRENV LTGRVEANAG
GVKTGGVSIG LLGVEGAVRA REGFRKAEFR GALEGEGLRR GKVLDAGLAS AERSASGTLL
APMLAQLRQS LLREERGSRL SGEIALRRNG DGLSLVAPQA QLVGGSGASL LTLSRFQVAT
GGDDGPPRLA GNFATGGAGI PRIVGRMERG RAGQALFRLT MAPWRAGGGS LAIPEMMIAQ
VGDGSLGFSG TAQLSGAIPG GSVQNLVLPL NGAYGASGEL ALWKRCVTAR FDRLVLGQMQ
VDGNRLGLCP PGGSAIVRNG AAGLRVAAGT PGLELTGRLG ETSLAVKTGA VGFAWPGVLT
ARAVEVALGP ADTATHLKLA DLDARLGKDF TGSFGGVEAK LAAVPLDVSN AAGQWRYADG
ALVLSGVGFE LTDRLDPARF ERLRSEGATL ALADNRIVAN ALLREAKSGR EVATTVIRHD
LGTGAGHADL KVDGLVFDKG FQPDDLTRLA LGVVANVKGT VRGEGDIDWS ARGVTSKGRF
GTESMDLAAA FGPVKGLSGT LEFTDLLGMV TAPHQKLRVA SVNPGIEVAD GVVDLTLLPD
QVLRLHEARW PFLGGTLTLE PTDLRLGVAE ARRYTLTIVG LDAAKFVERM ELGNLSATGT
FDGQLPLVFD ANGGRLEKGT LVSRPPGGNV SYVGALTYRD LSPMANFAFD ALKSLDYRTM
TIAIEGDLEG EIVTNVKFGG VKQGAGTKRN FITKQVANLP IQFNVNIRAP FYQLITSVKA
MYDPAFIKDP RTLGLVDAQG RPIQRFGNGV RPGGAPVVVL PGEQRSIQPA ESGNLP