Gene Saro_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2100 
Symbol 
ID3917748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2236438 
End bp2237667 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID640444853 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_497373 
Protein GI87200116 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0674945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAT ACGCTTCTTC CCTGCTGCGC CTGCTCTCGG AGCGGGGCTA CATCCATCAG 
ATGACCGACG CCGACGCACT CGATGCGCTG GCGGCGAAGC AGGTCATACC CGGCTATATC
GGCTTCGATC CGACCGCGCC ATCGCTGCAC GTCGGGTCGA TGGTGCAGAT CATGCTCCTG
CGCCGCCTCC AGCAGGCCGG GCACAAGCCC ATCGTGCTGA TGGGCGGCGG CACCGGCAAG
ATCGGCGACC CGAGCTTCAA GGACGAGGCA CGCAAGCTGA TGACCAACGA CGTCATCGCG
GCCAACGTCG CCTCGATCAA GACCGTGTTC GAACGCTTCC TGACCTTCGG CGACGGCCCG
ACCGACGCGG TCATGGTCGA CAATGCCGAC TGGCTCGACC GGCTTGAATA CATCCCGTTC
CTGCGCGAGG TGGGCCAGCA CTTCTCGGTC AACCGCATGC TCAGCTTCGA TTCGGTGAAG
CAGCGCCTTG ACCGCGAGCA ATCGCTCTCG TTCCTCGAAT TCAACTACAT GATCCTCCAG
GCCTACGACT TCCGCGAGCT GTCGCAGCGC CACGCTTGCC GCCTGCAGAT GGGCGGGTCG
GATCAGTGGG GGAACATCGT CAACGGCATC GAACTGACCC GCCGCATGGA CGGCGTGGAA
GTGTTCGGCG TGACCACGCC GCTGCTCACC ACCGCCGACG GCTCCAAGAT GGGGAAGACC
GCCGCTGGTG CTGTCTGGCT CAACGAGGAT GCGCTCCCGG CCTGGGACTT CTGGCAATAC
TGGCGCAACA CCGATGACCG CGACGTGGGC AAGTTCCTGC GCCTGTTCAC CGACCTGCCG
CTGGACGAGA TCGCCCGCCT CGAAGCGCTC GAGGGCAGCG AGATCAACGC CGCCAAGGTC
GTTCTGGCCA ACGAGGTCAC CAGACTGGTG CGCGGCGAGG AAGCAGCAAA GGCTGCCGAA
GCGACCGCGG CGGCGACCTT TGCGGGCGGC GGCCTCGGGC AGGATCTGCC GACCCTTTCC
GTCGGCGAAT CCGAGATCGG CATCGTCGAT GCGCTCGTCG GTCTGGGCTT TGCCGCCAGC
CGTGGCGAGG CCAAGCGGCT CGTCGCGGGC GGCGGCGCGC GCGTGGATGG CGAGCCAGTG
ACCGACGAGG GTTTCCGCAT TCTTGTGAAT GACAAGGAAA TTCGCGTTTC TTCCGGCAAG
AAGAAGCACG GCATCCTGCG CAAGGCCTGA
 
Protein sequence
MTEYASSLLR LLSERGYIHQ MTDADALDAL AAKQVIPGYI GFDPTAPSLH VGSMVQIMLL 
RRLQQAGHKP IVLMGGGTGK IGDPSFKDEA RKLMTNDVIA ANVASIKTVF ERFLTFGDGP
TDAVMVDNAD WLDRLEYIPF LREVGQHFSV NRMLSFDSVK QRLDREQSLS FLEFNYMILQ
AYDFRELSQR HACRLQMGGS DQWGNIVNGI ELTRRMDGVE VFGVTTPLLT TADGSKMGKT
AAGAVWLNED ALPAWDFWQY WRNTDDRDVG KFLRLFTDLP LDEIARLEAL EGSEINAAKV
VLANEVTRLV RGEEAAKAAE ATAAATFAGG GLGQDLPTLS VGESEIGIVD ALVGLGFAAS
RGEAKRLVAG GGARVDGEPV TDEGFRILVN DKEIRVSSGK KKHGILRKA