Gene Saro_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1061 
Symbol 
ID3916357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1104677 
End bp1105735 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content66% 
IMG OID640443796 
Productpyridoxal-5'-phosphate-dependent enzyme, beta subunit 
Protein accessionYP_496340 
Protein GI87199083 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.173696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGAAG AATCCAGGGC CTGGGCCGAC GAGGCGGTGC GGCGGATCGA GGCGGATTTC 
AACCGCTCAG CCGACACGCA TCTCATCCGC GTAGAACTTC CACGCTTTCC GGGCATCACG
CTCTATCTCA AGGACGAGAG CGTCCATCCG ACCGGCAGTC TCAAGCACCG GCTCGCCCGC
TCGCTGATCC TCTATGGCCT GTGCAACAAT CGCATCGGGC CCGACACGCT GTTGGTCGAT
GCGACGAGTG GCTCCACGGC GGTGTCGGAA GCCTATTTCG CCCGGCTGAT CGGCCTGCGC
TTCGTGGCGG TTATTCCGCG CAGCACGTCA CCCGCGAAGA TCGATGCGAT CCGCTTTCAC
GGCGGGGAGG TGCACATGGT CGATACCGCG GCGCAGATGT ACGCGGAGGC GGCGCGCCTT
GCGGACGACG CGGGCGGTTT GTTCCTTGAC CAGTTCACGT ATGCCGAGCG CGCGACCGAC
TGGCGCGGCA ACAACAACAT CGCGCAATCC ATCTTCCAGC AGATGTCGCG CGAGGATCAT
CCGGTGCCGT CGTGGATCGT GTGCGGGGCT GGGACCGGTG GTACTTCCGC CACGCTCGGG
CGCTTCATCC GTTACGGCCG CCATGCGACA CGGCTTTGCG TTGCAGATCC TGAAGGCTCG
GTGTTCCACC TCCACCATGC CGACCGAAGC GTCACAGAGC CGTCCCGGGG AGTGCGCTGC
ATGATCGAGG GAATCGGCCG GCCGCGCGTG GAGCCTTCTT TCCTGCCCGA CGTGATCGAC
CGCATGATCG CGGTGCCCGA TGCGGCGTCG ATCGGTGCGA TGCGGGCCAT CGCAGCGCGG
ATCGGACGTC CGGTCGGGGG TTCGACAGGA ACCAATGTCC ACGCTTGCCT CGAGATCGCG
CAGGAAATGG CAGCTTCCGG CGAGACCGGG TCGATCGTCA CGATCCTGTG CGATTCCGGG
CTGCGCTATG CGGGGACCTA CTACGACGAT GCCTGGCTCG ACGGGCAGGG CATCGACTGG
CGGGCCGACG AGGTGCGCGT CGCGGCGTTG CTTTCCTGA
 
Protein sequence
MREESRAWAD EAVRRIEADF NRSADTHLIR VELPRFPGIT LYLKDESVHP TGSLKHRLAR 
SLILYGLCNN RIGPDTLLVD ATSGSTAVSE AYFARLIGLR FVAVIPRSTS PAKIDAIRFH
GGEVHMVDTA AQMYAEAARL ADDAGGLFLD QFTYAERATD WRGNNNIAQS IFQQMSREDH
PVPSWIVCGA GTGGTSATLG RFIRYGRHAT RLCVADPEGS VFHLHHADRS VTEPSRGVRC
MIEGIGRPRV EPSFLPDVID RMIAVPDAAS IGAMRAIAAR IGRPVGGSTG TNVHACLEIA
QEMAASGETG SIVTILCDSG LRYAGTYYDD AWLDGQGIDW RADEVRVAAL LS