Gene TM1040_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1852 
Symbol 
ID4077877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1952289 
End bp1953644 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID638007168 
Productprolyl-tRNA synthetase 
Protein accessionYP_613847 
Protein GI99081693 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGT CTCGCTATTT TCTGCCCGTC CTCAAGGAGA CCCCCTCCGA GGCGCAGGTC 
GTCAGCCACC GTTACATGCT GCGCGCCGGC ATGATCAAAC AGTCCGCCGC CGGGATCTAT
TCCTGGCTGC CTCTGGGCTA CCGCGTGCTG AAAAAGATCG AAGGCATCGT TCACGAAGAG
CAGATGCGCG CCGGTCACAT CCCGATGCTG ATGCCCACCA TCCAGTCGGC TGACCTGTGG
CGCGAGTCCG GGCGCTATGA CGCCTACGGC GAGGAGATGC TGCGCATCCG CGACCGCCAT
GACCGCGACA TGCTCTTTAC GCCAACCGCC GAGGAACTCA TCACCGACAT CTTCCGCGCC
AATGTCTCAA GCTACAAAGA CCTGCCGCTG ACGATGTATC AGATCCAGTG GAAGTTCCGC
GACGAGATCC GCCCGCGCTT TGGCGTGATG CGGGGGCGCG AATTCTACAT GAAGGACGGT
TACAACTTCG ACCTCACCAA AGAGGACGCG CTGCACGCCT ACAACCGCCA TCTGGTCACC
TACCTGCGCA CCTATGAGCG CATGGGGCTA CAGGCGATCC CGATGCGCGC CGATGGTGGC
CCGATTGGTG GCGACTACAC CCACGAATTC CTCGTGCTGG CTGAAACCGG CGAATCGGAG
GTCTTTTATG ACAGCGAGAT CACCGATCTG ACGTTCGGCG CCCGCGAGAT CGACTATGAC
AATGTTGAAC AGTGTCAGGC GGTGCTCGAA GAGTTCACCT CACGTTACGC CCGCACCGAC
GAGACCCACG ACGAGGCGCT GTTCAACGCG GTCCCAGAAG AGCGCCGCCG CGTCGCGCGC
GGGATCGAGG TCGGCCAGAT CTTCTACTTC GGCACCAAAT ACTCCGAGGC GCTGGGCGCC
ACCGTGCAGA CCGCCGATGG CCAGAGCGTG CCGGTCCACA TGGGCTCGCA CGGGATTGGT
GTGTCGCGTC TTCTCGGCGC CATCATCGAG GCCAGCCACG ACGACAAGGG CATCATCTGG
CCCGAAGGTG TGACCCCCTT CCACTGCGGC ATCGTGAACC TCAAGCAGGG CGACGATGAA
GCGGATGCCG CCTGCGAGCA GCTCTATGCG GCGCTCACCG CCATCGGTCT GGAGCCGCTT
TATGATGATC GCAAGGAACG TGCGGGCGGC AAATTCGCCA GCATGGATCT CATTGGCCTG
CCATGGCGCA TCACCGTCGG CCCGCGCGGT CTGAAGAACG GCGTCGTCGA AGTGACCAGC
CGCCGTACCG GCGAAAGCGA AGAAATGAGC CCGGAAGACG CGGTAAAGAA AATCGCCGCG
ATCTACGCCA ACCACCCCAC GCCGCGCGGC TTCTGA
 
Protein sequence
MRLSRYFLPV LKETPSEAQV VSHRYMLRAG MIKQSAAGIY SWLPLGYRVL KKIEGIVHEE 
QMRAGHIPML MPTIQSADLW RESGRYDAYG EEMLRIRDRH DRDMLFTPTA EELITDIFRA
NVSSYKDLPL TMYQIQWKFR DEIRPRFGVM RGREFYMKDG YNFDLTKEDA LHAYNRHLVT
YLRTYERMGL QAIPMRADGG PIGGDYTHEF LVLAETGESE VFYDSEITDL TFGAREIDYD
NVEQCQAVLE EFTSRYARTD ETHDEALFNA VPEERRRVAR GIEVGQIFYF GTKYSEALGA
TVQTADGQSV PVHMGSHGIG VSRLLGAIIE ASHDDKGIIW PEGVTPFHCG IVNLKQGDDE
ADAACEQLYA ALTAIGLEPL YDDRKERAGG KFASMDLIGL PWRITVGPRG LKNGVVEVTS
RRTGESEEMS PEDAVKKIAA IYANHPTPRG F