Gene Saro_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0501 
Symbol 
ID3918630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp543116 
End bp544357 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID640443231 
Productaminotransferase 
Protein accessionYP_495783 
Protein GI87198526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGCGAT CCTGCCGCGT TCATTCGACG AGTAACCAAA CCGTGGACGA AGAATTCTAC 
CGCATGAAGC GCCTGCCGCC CTACGTCATT GCCGAAGTCA ACGGCATGCG GGCCGCCGCG
CGCGCCGCCG GTAGGGACAT CATCGACCTC GGCATGGGCA ACCCGGACCT GCCGCCGCCC
CAGCACGTGA TCGACAAGCT GTGCGAAGTG GCGCAGAAGC CCGATGCGCA CGGCTATTCG
GCATCCTCGG GCATCCCGGG CGTGCGCAAG GCCCAGGCCA ACTACTATGG CCGCCGCTTC
AACGTAGATC TCGATCCCGA GACCGAAGTG GTGATGACGA TGGGTTCGAA GGAGGGCCTG
GCCAGCCTCG CAACCGCGAT CACCGCGCCC GGGGACGTCG TGCTGGCCCC CAATCCCAGC
TATCCGATCC ACACCTTCGG CTTCATCATA GCCGGGGCGA CGATCCGTTC GGTGCCGACC
ACGCCTGACG AGCGCTATTG GGATTCGCTC GACCGCGCGA TGAAGTATTC GGTGCCGCGC
CCCTCGATCC TTATCGTCAA CTATCCGTCC AACCCGACTG CGGAGACGGT AGACCTCGCC
TTCTACGAAC GCCTTGTTGC CTGGGCGAAG GAGAACAAGG TCTGGATCCT GTCCGACCTC
GCCTATTCCG AACTGTACTA TGACGGCAAT CCGACCCGCT CCATCCTCGA GGTTCCGGGT
GCGAAGGATG TCGCGGTCGA GTTCACCTCG ATGTCCAAGA CCTTCTCGAT GGCAGGCTGG
CGCGTTGGCT TTGCGGTTGG CAACCAGCGC CTTATCGCGG CGCTTAAGCG CGTGAAATCC
TACCTCGATT ACGGCGCGTT CACGCCGATC CAGGCTGCCG CCTGCGCAGC GCTGAACGGC
CCGCAGGACA TCGTCGTCAA GAACCGCGAA CTCTACCAGA AGCGCCGCGA CGTGATGGTG
GAGGCATTCG GCCGCGCCGG ATGGGAAATC CCCAGCCCCA GCGCATCGAT GTTCGCGTGG
GCGCCGTTGC CGCCGGCACT GACCCATCTG GGCAGCCTCG AGTTCTCGAA GCAGCTTCTT
ACCCATGCCG AAGTCGCGGT TGCCCCGGGC GTTGGCTATG GCGAGGACGG CGAAGGTTTC
GTGCGCATCG CGATGGTCGA GAACGAGCAG CGCCTTCGCC AGGCGGCGCG CAACATCAAG
CGTTACCTGC AAAGCATGGG CGTCAACGCC TCGGCGGCCT GA
 
Protein sequence
MRRSCRVHST SNQTVDEEFY RMKRLPPYVI AEVNGMRAAA RAAGRDIIDL GMGNPDLPPP 
QHVIDKLCEV AQKPDAHGYS ASSGIPGVRK AQANYYGRRF NVDLDPETEV VMTMGSKEGL
ASLATAITAP GDVVLAPNPS YPIHTFGFII AGATIRSVPT TPDERYWDSL DRAMKYSVPR
PSILIVNYPS NPTAETVDLA FYERLVAWAK ENKVWILSDL AYSELYYDGN PTRSILEVPG
AKDVAVEFTS MSKTFSMAGW RVGFAVGNQR LIAALKRVKS YLDYGAFTPI QAAACAALNG
PQDIVVKNRE LYQKRRDVMV EAFGRAGWEI PSPSASMFAW APLPPALTHL GSLEFSKQLL
THAEVAVAPG VGYGEDGEGF VRIAMVENEQ RLRQAARNIK RYLQSMGVNA SAA