Gene Dgeo_0570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0570 
Symbol 
ID4058581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp609253 
End bp610746 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content63% 
IMG OID641229584 
Productprolyl-tRNA synthetase 
Protein accessionYP_604041 
Protein GI94984677 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.267387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTA TGACGAAGGC CGAGGGCAAG CAGGACAAGA AGGCGCAGCA GTACGGGGTG 
ACGCCTCAGA GCGTGGATTT CAACGACTGG TACAACGAGG TGGTCAAGAA GGCCGATCTG
GCCGATAACA GCCCCGTCGC GGGCGCGATG GTGGTCAAGC CCTACGGCAC CGCCCTCTGG
GAGAACATCC AGCGCTGGCT GGACGACCGC TTCAAGGCGA CCGGCCACGA GAGCCTGATC
TTCCCCACCC TGATCCCGAT GAACTTCATC ACCAAAGAAG CCGATCACGT GGAGGGCTTT
GCCCCCGAAC TGTTCACGGT GGACCGCATC GGCACCGAGC AACTGACCGA GCCGTACGTG
CTGCGGCCCA CCTCCGAAAC AATCATCGGC TACATGTGGA GCGGGTGGCT CAACTCCTAC
CGTGACCTTC CCTTCCTGCA CTATCAGTGG GGCAGCGTAT TCCGCGCGGA ACTGCGGACG
AAGGCCTTCT TGCGCACCTC TGAGTTCTAT TGGCACGAGG GCCACACCGC CCACGCCTCC
GAAGAAGAAG CCCGGCGCGA GGTCCGCCAG ATCCTCGACC TCTACCACGA GTTCTGCCGC
GACATTCTCG CGCTGCCCGT CGTGCGCGGT GAGAAGACGG CCAGCGAGCG ATTCGCCGGG
GCGGTCGCTA CCTACTCTAT CGAGGGCATG ATGCGCGACG GCAAGGCGCT GCAATCAGGC
ACCTCGCATT ACCTGGGGCA GAATTTCTCC AAGGCCTTTG ACGTGAAGTT CCAGACGCGC
GAGCAGCGTG AGGAGTACGC CTACACCACG AGCTGGGCGA TCTCCAGCCG CATCATCGGC
GCAATCATCA TGACGCACGG GGACGACTTC GGCCTGATCA TGCCGCCCCG CATCGCGCCC
ATCCAGGTGG TCGTGATTCC GGTGAGCCGC AAGGAGAACT TCGATCAGAT GGTGGCGGAG
GGCGAGAAGC TGGCCCATGA ACTGCGCGCG CAGGGCCTCC GCGTGAAGGT GGACCGGCGC
GAAGGCGTCA CCAACGGTTT CAAGTACAAC GACTGGGAAC TCAAGGGGGT GCCTGTCCGC
ATCGAGCTTG GCCCGCGCGA TCTGGAGCAG GGCGTCGTGG TGGTTAAAAA CCGCAACGCC
GAGGAGAAGG AGACGCTGCC GCGCGAAGAG GCAATCCGCG GCATGGCCAA CCGCCTGGAC
AGCATCCACA ATTGGCTGCT GCAGCGCGCA ACGGACTTCC TGCTGACACA TACCGTCCCC
GCTGACAGCT ACGAGGAGCT GAAGAACGCC ATCGAGCACG GCAACTGGGT GCGGGCCTTC
CACTGCGGAA ACGCCGAATG CGAGGCCCAA ATCAAAGAGG ACACCAAGGC CACCACCCGC
AATATTCCTC TCGACGACGC CGAGTTCTTC AATGAGCGGG AGGAGGGCGT GTGCGTGAAG
TGCGGTCAGC CGAGCGCGTA CGGCAAGCGG GTGATTTTCG GGCGGCAGTA CTGA
 
Protein sequence
MPPMTKAEGK QDKKAQQYGV TPQSVDFNDW YNEVVKKADL ADNSPVAGAM VVKPYGTALW 
ENIQRWLDDR FKATGHESLI FPTLIPMNFI TKEADHVEGF APELFTVDRI GTEQLTEPYV
LRPTSETIIG YMWSGWLNSY RDLPFLHYQW GSVFRAELRT KAFLRTSEFY WHEGHTAHAS
EEEARREVRQ ILDLYHEFCR DILALPVVRG EKTASERFAG AVATYSIEGM MRDGKALQSG
TSHYLGQNFS KAFDVKFQTR EQREEYAYTT SWAISSRIIG AIIMTHGDDF GLIMPPRIAP
IQVVVIPVSR KENFDQMVAE GEKLAHELRA QGLRVKVDRR EGVTNGFKYN DWELKGVPVR
IELGPRDLEQ GVVVVKNRNA EEKETLPREE AIRGMANRLD SIHNWLLQRA TDFLLTHTVP
ADSYEELKNA IEHGNWVRAF HCGNAECEAQ IKEDTKATTR NIPLDDAEFF NEREEGVCVK
CGQPSAYGKR VIFGRQY