Gene Dtox_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3201 
Symbol 
ID8430195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3401663 
End bp3403381 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content50% 
IMG OID645035447 
Productprolyl-tRNA synthetase 
Protein accessionYP_003192566 
Protein GI258516344 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.154366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000045968 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAGCAT CTCAACTGCT TATCCCGACT TTGCGGGAGA CGCCGGCGGA AGCGGAAGTA 
ATCAGCCATA AGCTGCTTTT GCGGGCCGGT TATATCAGGA GATCGGCGGC AGGTGTCTAT
ACTTACCTGC CGCTGGCGCA AAGAGTATTG AGTAAAATAA AAAGGATTGT GCGAGAGGAA
ATGGACAGGC AGGGCGGGCA GGAGATACTG ATGCCTATTA TGCAGCCGGC CGAGTTATGG
CAGGAATCGG GCCGCTGGGA TGTCTACGGG CCGGAATTGT TTCGCTTGAA GGATAGGCAT
GGCCGGGACT TTGCCCTGGG ACCTACCCAT GAGGAAATTA TCACTGCTCT GGTGAAAAGT
GAGCTGTCCT CTTACAAGCA ACTGCCTTTA TTGCTTTACC AAATACAGAA CAAATACCGG
GATGAGCGCC GTCCCCGTTT TGGATTGCTT AGGGGCCGGG AATTTATTAT GAAGGATCTG
TATTCTTTTG ATGCTGATGA GGCGGGCCTG GATATAACTT ACCGCAAGAT GTATGATGCC
TATGTACGTA TTTTTCAGCG CTGTGGTCTC AGATTCCGGC CGGTGGAAGC CGATTCCGGG
GCTATCGGCG GCAGCGATAC GCATGAGTTC ATGGTTTTGG CCGAGTCGGG AGAGGCTCTG
GTAGTTTTTT GCCAGAACGA AAACTGCAGT TACGCAGCCA ATGTGGAAAA GGCCCAGTCC
TCAGAACTGC CAAAACCCGT TGCTGAGCTG CTGCCCCTGG AAGAAGCGCA TACACCCGGG
CAAAAAACTA TTGCCCAGAT TTCGAAATAT CTTGGCCTGC CGGAATCCGG CCTGATTAAA
ACTCTCTTTT ACGAAACTGA AAGTGAGGTA GTTGCGGCGC TGGTGCGCGG TGACCATGAT
GTAAATGAGA TTAAATTGCA GCGGGTAATA GATTGCGCCA GGCTGGAACT GGCCTCGGAA
GCTATTGTGA CTAAACTAAC CGGGGCGCCG TTGGGTTTTG CCGGTCCTAT TGGTCTGAAA
AATGTGCTGG TGATTGCCGA TTATGAAGTA TCGGTTATGG TTAATGCCGT AACAGGTGCC
AATAAGGCTG ACTACCATTA TAAGAATGTA TGTCCGGGCA GGGATTTTAA AGCCGGCAAA
ATCGCTGATA TTCGTTTTGT CAAAGCCGGG GAGCCTTGTC CCGTCTGCGG CGGTGTGCTG
GCTGAGGCGA AAGGTATTGA AGTGGGACAA GTATTTAAAT TGGGCGATAA ATACAGCAAG
TCCCTGGAGG CTACTTTCCA GGACGAAAAC GGCAAAACGC GCCACTATGT TATGGGTTGT
TACGGTATCG GTGTGAGCCG CACAATGGCT GCCGCTATCG AGCAGCATAA CGACCTAAAC
GGTATTACCT GGCCGGCAGC CATTGCCCCC TTCCATCTGG TCATAGTGCC GGTAAACGTA
AAGGATCAGG CACTGATGGA TATATCGGAA GTATTGTACC GCAGCTTCCT TGAGTCCGGT
GTGGAGGTCG TTCTGGATGA CCGCCCCGAG CGTCCTGGTG TAAAGTTCAA AGACGCCGAT
TTGGTAGGCT ACCCGCTGCG CCTGACTGTA GGCAAAAAGT TTTTAGAAGA AGGTTTGTTG
GAGCTTCGGG AGAGAAGGTC AGGCAAAACT CATTTTCTGA AAGAAGAAGA GATTCTGTCT
TTCATCAAGG ATTTCATTAA AAGTGGAATG GCTTTATAG
 
Protein sequence
MRASQLLIPT LRETPAEAEV ISHKLLLRAG YIRRSAAGVY TYLPLAQRVL SKIKRIVREE 
MDRQGGQEIL MPIMQPAELW QESGRWDVYG PELFRLKDRH GRDFALGPTH EEIITALVKS
ELSSYKQLPL LLYQIQNKYR DERRPRFGLL RGREFIMKDL YSFDADEAGL DITYRKMYDA
YVRIFQRCGL RFRPVEADSG AIGGSDTHEF MVLAESGEAL VVFCQNENCS YAANVEKAQS
SELPKPVAEL LPLEEAHTPG QKTIAQISKY LGLPESGLIK TLFYETESEV VAALVRGDHD
VNEIKLQRVI DCARLELASE AIVTKLTGAP LGFAGPIGLK NVLVIADYEV SVMVNAVTGA
NKADYHYKNV CPGRDFKAGK IADIRFVKAG EPCPVCGGVL AEAKGIEVGQ VFKLGDKYSK
SLEATFQDEN GKTRHYVMGC YGIGVSRTMA AAIEQHNDLN GITWPAAIAP FHLVIVPVNV
KDQALMDISE VLYRSFLESG VEVVLDDRPE RPGVKFKDAD LVGYPLRLTV GKKFLEEGLL
ELRERRSGKT HFLKEEEILS FIKDFIKSGM AL