Gene Saro_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2031 
Symbol 
ID3917678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2165934 
End bp2167397 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content65% 
IMG OID640444783 
Productglutamyl-tRNA synthetase 
Protein accessionYP_497304 
Protein GI87200047 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0008] Glutamyl- and glutaminyl-tRNA synthetases 
TIGRFAM ID[TIGR00464] glutamyl-tRNA synthetase, bacterial family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCG CCACTGACAC CGTTTCTGCA GGATCGACGC CCCGGGACAA GGTCGTCACC 
CGTTTCGCCC CCTCGCCCAC GGGTTTCCTG CACATTGGCG GCGCGCGCAC GGCGCTGTTC
AACTGGCTCT ATGCGCGCCA CCATGGCGGC ACCTACCTCC TGCGCATCGA GGATACCGAC
CGTGCGCGTT CGACGGACGC CGCGATCGAC GCGATCTTCG ACGGACTCGA ATGGCTGGGC
CTTGGCGGCG ACGAGCCGGC GGTGTTCCAG TTCGCGCGGT CGGACCGGCA CGCCGAAGTG
GCGCACAAGC TGCTCGAGGC CGGCCATGCC TATCGCTGCT ATCTGACCCA GGAAGAACTG
GCGGCACGGC GCGAACTGGC GCAGGCGGAA CGGCGCCCGT TCCGCATCGA CAGCGAATGG
CGCGATGCCA CGCCCGACCA GTGGCCCGCC GATCAGTCCT ATGTCGTGCG GATGAAGGCC
CCGCGCGAAG GCGAGACGAC GATCGTCGAC AAGGTGCAGG GTTCGATCAC CGTCCAGAAC
AGCGAACTGG ACGACTTCAT CATCCTGCGT TCCGACGGCA CGCCGACCTA CATGCTGGCG
GTGGTGGTGG ACGACCACGA CATGGGCGTG ACGCACGTCA TCCGCGGCGA CGACCACATC
AACAACGCGT TCCGCCAGCT CGTCATCATC CGCGGCATGC ATGCCATCGA AGGCGGTTGG
CCGGACCCCG TCTATGCTCA CATTCCGCTG ATCCACGGCG CGGACGGGGC GAAGCTTTCA
AAGCGCCACG GAGCGCTGGG CGTCGACGCC TATCGCGACG AGATGGGGCT CCTGCCCGAG
GCGGTGTTCA ACTATCTGCT GCGTCTTGGC TGGGGCCACG GCGACGAGGA AATCATCAGC
CGCGAACAGG CAGTGGCGTG GTTCGATATT GGCGACGTCA ACAAGGGCGC ATCACGGTTC
GACCTGAAGA AGCTGCTCAA CCTCAACGGG CACTATATCC GCGAGGCGGA CGATGCACGC
CTGGCAGCCC TGGTCGCGCC CCGCCTGGCG ACGCTGGCTC CCGGGTTCGC GCCCGACAAG
GGGCTGGACC TGCTGACCCG GGCGATGCCG GTGCTGAAGG TGCGCGCGGC GGATATCAAC
GAACTGGCAG CCGGTTCGGT GTTCCTTTTC GCCCAGCGTC CGCTGGCCAT GGCCGAGAAG
GCAGCAAGCC TGTTGACCGA CGACGCACGT GCGATCCTGA CCAAGGTTGC AGGGGTCCTG
GAGGCTGAAA ACGTCTGGAC AACCGGAGTG CTCGAGGCCA CGGTAAAGCA AATGGCCGAG
GAGCTTGGGA TAGGCTTGGG CAAGATTGCC CAACCGTTGC GCGCAAGTTT GACGGGACAG
ACTACCTCGC CGGGAATTTT CGACGTATTG GCGCTGCTCG GCAAGGACGA GTCGCTGTCG
CGCATTCGCG ATCAGGCGGC CTGA
 
Protein sequence
MASATDTVSA GSTPRDKVVT RFAPSPTGFL HIGGARTALF NWLYARHHGG TYLLRIEDTD 
RARSTDAAID AIFDGLEWLG LGGDEPAVFQ FARSDRHAEV AHKLLEAGHA YRCYLTQEEL
AARRELAQAE RRPFRIDSEW RDATPDQWPA DQSYVVRMKA PREGETTIVD KVQGSITVQN
SELDDFIILR SDGTPTYMLA VVVDDHDMGV THVIRGDDHI NNAFRQLVII RGMHAIEGGW
PDPVYAHIPL IHGADGAKLS KRHGALGVDA YRDEMGLLPE AVFNYLLRLG WGHGDEEIIS
REQAVAWFDI GDVNKGASRF DLKKLLNLNG HYIREADDAR LAALVAPRLA TLAPGFAPDK
GLDLLTRAMP VLKVRAADIN ELAAGSVFLF AQRPLAMAEK AASLLTDDAR AILTKVAGVL
EAENVWTTGV LEATVKQMAE ELGIGLGKIA QPLRASLTGQ TTSPGIFDVL ALLGKDESLS
RIRDQAA