Gene Rpal_3886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3886 
SymbolthrS 
ID6411566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4169409 
End bp4171472 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content65% 
IMG OID642713768 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001992859 
Protein GI192292254 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA AAACTTCGGG CGACAAGCCT TCCGCGTCCA CCTCCGGCTT CCAATACACG 
CTCTCCAACC TCAAGCCGGT AGCCCCCATG GACCAGATCA CCATCACCTT CCCCGACGGC
AAGACCCGCG AATATCCCCG CGGCACCACC GGGCTCGACA TCGCCAAGGG CATTTCGCCC
TCACTCGCCA AGCGCACCGT CGTGATGGCG CTGAACGGCA CGCTGACCGA CCTCGCCGAT
CCGATCGAAG ACAATGCCCA GATCGACTTC GTCGCCCGTG ACGATGCGCG CGCGCTGGAA
TTGATCCGGC ACGACTGCGC CCACGTGCTC GCCGAAGCGG TGCAAAGCCT GTGGCCGGGC
ACCCAGGTGA CCATCGGCCC GACCATCGAA AACGGCTTCT ACTACGACTT CTTCCGCAAC
GAGCCGTTCA CCCCGGAAGA CTTCGCGGCG ATCGAGAAGA AGATGCGCGA GATCATCGCG
CGCGACAAAC CGTTCACCAA GGAAGTCTGG ACCCGCGACG AAGCCAAGAA GGTGTTCGCC
GACAACGGCG AGGCGTTCAA GGTCGAGCTG GTCGACGCCA TCCCCGAAGA CCAGACCATC
AAGATCTACA AGCAGGGCGA ATGGTTCGAT CTGTGCCGCG GCCCGCACAT GACCTCGACC
GGCAAGATCG GCACCGCCTT CAAGCTGATG AAGGTGGCGG GCGCGTATTG GCGCGGCGAC
AGCAACAATC CGATGCTGAC CCGCATCTAC GGCACCGCCT TCGCCAAGCA GGACGACCTC
GACGCCTACC TTCATCAGAT CGAGGAAGCC GAGAAGCGCG ACCACCGCAA GCTCGGCCGT
GAACTCGACC TGTTCCACTT CCAGGAAGAA GGTCCGGGCG TGGTGTTCTG GCACGCCAAG
GGCTGGAGCC TGTTCCAGTC GCTGGTCGGC TATATGCGCC GCCGCCTCGC CGGCGACTAC
GACGAGGTCA ACGCGCCGCA GATCCTCGAC AAGGTGCTGT GGGAGACCTC GGGCCATTGG
GAATGGTACC GCGAGAACAT GTTCGCGGCG CAGTCCGCCG GCGACGACGC CGAGGACAAG
CGCTGGTTCG CGCTGAAGCC GATGAACTGC CCGGGCCACG TGCAGATCTT CAAACATGGC
TTGAAGAGCT ATCGCGACCT GCCGCTGCGG CTCGCCGAGT TCGGCGTGGT GCATCGCTAC
GAGCCGTCGG GCGCCATGCA CGGCCTGATG CGGGTGCGCG GCTTCACCCA GGACGACGCG
CACGTGTTCT GCACCGAGGC GCAGCTCGCC GAGGAGTGCA TCAAGATCAA CGACCTGATC
CTGTCGACCT ACTCCGACTT CGGCTTCGAG GGCGAGTTGA CGGTGAAGCT GTCGACCCGG
CCGGAGAAGC GCGTCGGCAC CGACGAGATG TGGGATCACG CCGAGCGCGT GATGGCCACG
GTGCTCTCCG AGATCAAGGC CAAGGGCGGC AACCGCATCA AGACCGAGAT CAACCCGGGC
GAAGGCGCGT TCTACGGGCC GAAGTTCGAA TACGTGCTGC GCGACGCGAT CGGCCGCGAT
TGGCAGTGCG GCACCACGCA GGTCGACTTC AACCTGCCGG AGCGGTTCGG CGCGTTCTAC
ATCGACGCCG ACGGCGCCAA GAAGGCACCG GTGATGGTGC ATCGCGCGAT CTGCGGCTCG
ATGGAGCGTT TCACCGGCAT CCTGATCGAG CACTACGCCG GCAACTTCCC GCTGTGGCTG
GCGCCGGTGC AGGTCGTCGT CACCACGATT ACGTCGGAAG GCGACGACTA CGCCAAGAAG
GTGCTGGCGG CGCTGCGCAA GGCCGGCTTG CGCGCCGACA TCGATCTGCG CAACGAGAAG
ATCAACTTCA AGGTGCGCGA GCATTCGCTC GCCAAGGTCC CCGCCCTGCT GGTGGTCGGC
AAGAAGGAGG CCGAAAGCCA CTCGGTCTCC GTCCGCCGCC TCGGCAGCGA GGGCCAGAAG
GTGATGCCCA CCGACGAAGC GATCGCCGCG CTGGTGGACG AAGCGACCCC GCCGGACGTG
AAGCGGATGC GCGGAGCGGC GTAA
 
Protein sequence
MNDKTSGDKP SASTSGFQYT LSNLKPVAPM DQITITFPDG KTREYPRGTT GLDIAKGISP 
SLAKRTVVMA LNGTLTDLAD PIEDNAQIDF VARDDARALE LIRHDCAHVL AEAVQSLWPG
TQVTIGPTIE NGFYYDFFRN EPFTPEDFAA IEKKMREIIA RDKPFTKEVW TRDEAKKVFA
DNGEAFKVEL VDAIPEDQTI KIYKQGEWFD LCRGPHMTST GKIGTAFKLM KVAGAYWRGD
SNNPMLTRIY GTAFAKQDDL DAYLHQIEEA EKRDHRKLGR ELDLFHFQEE GPGVVFWHAK
GWSLFQSLVG YMRRRLAGDY DEVNAPQILD KVLWETSGHW EWYRENMFAA QSAGDDAEDK
RWFALKPMNC PGHVQIFKHG LKSYRDLPLR LAEFGVVHRY EPSGAMHGLM RVRGFTQDDA
HVFCTEAQLA EECIKINDLI LSTYSDFGFE GELTVKLSTR PEKRVGTDEM WDHAERVMAT
VLSEIKAKGG NRIKTEINPG EGAFYGPKFE YVLRDAIGRD WQCGTTQVDF NLPERFGAFY
IDADGAKKAP VMVHRAICGS MERFTGILIE HYAGNFPLWL APVQVVVTTI TSEGDDYAKK
VLAALRKAGL RADIDLRNEK INFKVREHSL AKVPALLVVG KKEAESHSVS VRRLGSEGQK
VMPTDEAIAA LVDEATPPDV KRMRGAA