Gene Saro_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1950 
Symbol 
ID3917265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2068026 
End bp2069573 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content64% 
IMG OID640444697 
Productprolyl-tRNA synthetase 
Protein accessionYP_497224 
Protein GI87199967 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGC CCGCAATAAA GCACGCGCTC AACGTCAAGC GCGCCGACGA CTTCGCCCAG 
TGGTACCAGG CCGTGATCGC CGAGGCCGAG CTTGCCGAGG AATCCGGTGT GCGCGGCTGC
ATGGTCATCA AGCCTTGGGG TTATGGCATC TGGGAACGTA TCCAGAAGCT TATGGATGCC
GAGATCAAGG AGGCAGGCGT CGAGAACTGC TACTTCCCGC TGTTCATCCC GTTGTCCTAC
TTCACCAAGG AAGCGGAACA CGTCGAAGGC TTTGCCAAGG AAATGGCGGT CGTCACGCAC
CACCGCCTTA TTTCGGACGG AAAGGGCGGC CTGACGCCCG ACCCGGAAGC CAAGCTGGAA
GAGCCGCTTG TGGTCCGTCC CACTTCCGAG ACCGTTATCG GCGCGGCGAT GTCTCGCTGG
ATTCAGTCCT GGCGGGATCT TCCGCTGCTT ACCAATCAGT GGGCCAACGT CGTGCGCTGG
GAAATGCGCA CGCGCATGTT CCTCCGCACC AGCGAGTTCC TCTGGCAGGA AGGCCACACC
GCGCACGTCG ACGAGGCGGA CGCGATGAAG GAAACCCTGC GCGCGCTCGA AATGTATCGT
GCCTTTGCCG AAGGCCCGCT CGCCATGCCG GTGATCGCGG GACCCAAGCC CGAGAACGAG
CGTTTCCCCG GTGCGGTCGA GACCTTCTCG ATCGAGGCCA TGATGCAGGA TGGCAAGGCC
CTCCAGGCCG GCACTTCGCA CTACCTAGGC ACGACCTTCG CCAAGGCGGC GGGCATCCAG
TACCAGAACA AGGAAGGCCA GCAGGCCCTC GCGCACACCA CGTCGTGGGG CGTCTCCACT
CGTCTGATCG GCGGCGTCAT CATGACCCAC GGAGACGACG ATGGTCTGCG AGTGCCCCCT
CAGGTCGCAC CGCAGCAGAT CGTCATCCTG CCGATGCTCC GCGATAACGA AGGCGATGAC
GCGCTGCTGG CCTATTGCGA GGAAATCCGC GCGTCTCTGG TAAAGCTGTC GGTGTTCGGG
GAACGCATCC GCGTTCTCCT CGACAAGCGC CCCGGCAAGG CCACGCAGAA GCGCTGGGCG
TGGGTCAAGA AAGGCATGCC GCTCATTCTC GAGATCGGCG GACGCGATGC CGAAGGCGGC
CTGGTTTCCG TGCTGCGGCG GGATCGCCTG TGGCGCCAGG ATGCCAAGCC TAACTTCGTT
GGCCAGGCCA AGGACGACTT CCTCGCCAGC GCCGCGACCG AACTCGAATC GATCCAGGCT
GCACTCTATG ATGAAGCCCG TGCGCGCCGC GATGCGCAGA TCGTGCGCGA CGTGACCGAC
CTTGAAGGCC TCAAGGGATA CTTTGCGGAG GGCAACAAGT ACCCGGGATG GGTCGAGATG
GGCTGGGCCA AGCCCACCGG CGAAGCGCTC GACAAGGTTG TCGAGCAACT CAAGGCGCTG
AAGCTGACCA TCCGCAACAC GCCGATGGAC GCGGAAAAGC CTGTGGGCGC TTGCCCCTTC
ACCGGCGAGC CCGCGGTCGA GAAGATCCTG ATCGCGCGGT CCTACTGA
 
Protein sequence
MNQPAIKHAL NVKRADDFAQ WYQAVIAEAE LAEESGVRGC MVIKPWGYGI WERIQKLMDA 
EIKEAGVENC YFPLFIPLSY FTKEAEHVEG FAKEMAVVTH HRLISDGKGG LTPDPEAKLE
EPLVVRPTSE TVIGAAMSRW IQSWRDLPLL TNQWANVVRW EMRTRMFLRT SEFLWQEGHT
AHVDEADAMK ETLRALEMYR AFAEGPLAMP VIAGPKPENE RFPGAVETFS IEAMMQDGKA
LQAGTSHYLG TTFAKAAGIQ YQNKEGQQAL AHTTSWGVST RLIGGVIMTH GDDDGLRVPP
QVAPQQIVIL PMLRDNEGDD ALLAYCEEIR ASLVKLSVFG ERIRVLLDKR PGKATQKRWA
WVKKGMPLIL EIGGRDAEGG LVSVLRRDRL WRQDAKPNFV GQAKDDFLAS AATELESIQA
ALYDEARARR DAQIVRDVTD LEGLKGYFAE GNKYPGWVEM GWAKPTGEAL DKVVEQLKAL
KLTIRNTPMD AEKPVGACPF TGEPAVEKIL IARSY