Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1950 |
Symbol | |
ID | 3917265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2068026 |
End bp | 2069573 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444697 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_497224 |
Protein GI | 87199967 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00408] prolyl-tRNA synthetase, family I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAGC CCGCAATAAA GCACGCGCTC AACGTCAAGC GCGCCGACGA CTTCGCCCAG TGGTACCAGG CCGTGATCGC CGAGGCCGAG CTTGCCGAGG AATCCGGTGT GCGCGGCTGC ATGGTCATCA AGCCTTGGGG TTATGGCATC TGGGAACGTA TCCAGAAGCT TATGGATGCC GAGATCAAGG AGGCAGGCGT CGAGAACTGC TACTTCCCGC TGTTCATCCC GTTGTCCTAC TTCACCAAGG AAGCGGAACA CGTCGAAGGC TTTGCCAAGG AAATGGCGGT CGTCACGCAC CACCGCCTTA TTTCGGACGG AAAGGGCGGC CTGACGCCCG ACCCGGAAGC CAAGCTGGAA GAGCCGCTTG TGGTCCGTCC CACTTCCGAG ACCGTTATCG GCGCGGCGAT GTCTCGCTGG ATTCAGTCCT GGCGGGATCT TCCGCTGCTT ACCAATCAGT GGGCCAACGT CGTGCGCTGG GAAATGCGCA CGCGCATGTT CCTCCGCACC AGCGAGTTCC TCTGGCAGGA AGGCCACACC GCGCACGTCG ACGAGGCGGA CGCGATGAAG GAAACCCTGC GCGCGCTCGA AATGTATCGT GCCTTTGCCG AAGGCCCGCT CGCCATGCCG GTGATCGCGG GACCCAAGCC CGAGAACGAG CGTTTCCCCG GTGCGGTCGA GACCTTCTCG ATCGAGGCCA TGATGCAGGA TGGCAAGGCC CTCCAGGCCG GCACTTCGCA CTACCTAGGC ACGACCTTCG CCAAGGCGGC GGGCATCCAG TACCAGAACA AGGAAGGCCA GCAGGCCCTC GCGCACACCA CGTCGTGGGG CGTCTCCACT CGTCTGATCG GCGGCGTCAT CATGACCCAC GGAGACGACG ATGGTCTGCG AGTGCCCCCT CAGGTCGCAC CGCAGCAGAT CGTCATCCTG CCGATGCTCC GCGATAACGA AGGCGATGAC GCGCTGCTGG CCTATTGCGA GGAAATCCGC GCGTCTCTGG TAAAGCTGTC GGTGTTCGGG GAACGCATCC GCGTTCTCCT CGACAAGCGC CCCGGCAAGG CCACGCAGAA GCGCTGGGCG TGGGTCAAGA AAGGCATGCC GCTCATTCTC GAGATCGGCG GACGCGATGC CGAAGGCGGC CTGGTTTCCG TGCTGCGGCG GGATCGCCTG TGGCGCCAGG ATGCCAAGCC TAACTTCGTT GGCCAGGCCA AGGACGACTT CCTCGCCAGC GCCGCGACCG AACTCGAATC GATCCAGGCT GCACTCTATG ATGAAGCCCG TGCGCGCCGC GATGCGCAGA TCGTGCGCGA CGTGACCGAC CTTGAAGGCC TCAAGGGATA CTTTGCGGAG GGCAACAAGT ACCCGGGATG GGTCGAGATG GGCTGGGCCA AGCCCACCGG CGAAGCGCTC GACAAGGTTG TCGAGCAACT CAAGGCGCTG AAGCTGACCA TCCGCAACAC GCCGATGGAC GCGGAAAAGC CTGTGGGCGC TTGCCCCTTC ACCGGCGAGC CCGCGGTCGA GAAGATCCTG ATCGCGCGGT CCTACTGA
|
Protein sequence | MNQPAIKHAL NVKRADDFAQ WYQAVIAEAE LAEESGVRGC MVIKPWGYGI WERIQKLMDA EIKEAGVENC YFPLFIPLSY FTKEAEHVEG FAKEMAVVTH HRLISDGKGG LTPDPEAKLE EPLVVRPTSE TVIGAAMSRW IQSWRDLPLL TNQWANVVRW EMRTRMFLRT SEFLWQEGHT AHVDEADAMK ETLRALEMYR AFAEGPLAMP VIAGPKPENE RFPGAVETFS IEAMMQDGKA LQAGTSHYLG TTFAKAAGIQ YQNKEGQQAL AHTTSWGVST RLIGGVIMTH GDDDGLRVPP QVAPQQIVIL PMLRDNEGDD ALLAYCEEIR ASLVKLSVFG ERIRVLLDKR PGKATQKRWA WVKKGMPLIL EIGGRDAEGG LVSVLRRDRL WRQDAKPNFV GQAKDDFLAS AATELESIQA ALYDEARARR DAQIVRDVTD LEGLKGYFAE GNKYPGWVEM GWAKPTGEAL DKVVEQLKAL KLTIRNTPMD AEKPVGACPF TGEPAVEKIL IARSY
|
| |