Gene Pars_1315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1315 
Symbol 
ID5055924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1186556 
End bp1187806 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID640468861 
ProductRNA modification protein 
Protein accessionYP_001153530 
Protein GI145591528 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01578] MiaB-like tRNA modifying enzyme, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.107305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAGGG CCTATATAGA GACTTATGGT TGTTGGCTGG CTAAGGCCGA TGCGGAGATC 
ATCAGACAGA GACTGGGGCT TGTGGCTGTG GAAAGGCCCG AGGATGCCGA CGTGGTGATG
ATCTATACAT GCGCGGTGAG GGAAGACGGC GAGGTTCGCC AGCTTGCAAG AATAAGGGAG
CTTGCGGGTC TACGCAAGGA GGTTGTGGTG GCGGGTTGTC TCGCCAAGCT GAGGCCGTAT
ACGATTAAAT CCGCAGCGCC TAACGCGCGT CTGTTATACC CCTCTGAGGT GGAGGGTGGG
CAGAAGAGGG AGATGAAGGT ACTGCCCAGG TACGAGGGCG GCGTCATTTA CACGGTGCCT
CTACAGGTCG GTTGCCTCGG CAACTGCACC TTCTGCGCTA CTAAGTACAC GCGGGGTGGC
GCCGGCTACG TCAAGAGTGC CAATCCCGAC GACGTGGTGC GCCACGTGAA GGAGGCCGTG
GCGAGGGGGG CGAAGGAGAT CTACCTGACA GGTCAAGACG TCATTACCTA CGGTTTCGAC
ATGAGGTGGA GGCCCGGCTG GAGCCTGCCG GATCTCCTGG AGCGGATATT GAGGGAGGTG
GAGGGGGAGT ACAGGGTTAG GATAGGCATG TCGGAGCCTT GGGTATTTGC GAGGTTTGCA
GATCGTCTCC TCGACATTGT TAAGGGCGAC CGCCGGGTGT ACCGATACTT CCACCTCCCG
GTGCAGTCCG GAAGCGATAG GGTGCTTAGA GCGATGGGGC GGAGATACAC CGTTGATGAG
TATAGGGAGC TTGTGAGGAA GATTAGGAAG ACGTTGGGAG AGTTCGCCTT TGTCGCCACC
GATATCATAG TCGGCTTCCC CGGCGAGGCT GAGGACGACT TCTGGGAGTC GGTGAAGCTG
GTGGAGGAGC TCCAGCTGGA CAAGGTGCAC GTGGCTAGGT TTAGCCCGAG GCCCTTCACA
GAGGCCGCTG TCATGCCTAG ACAAGTCCCC GACGCGGAGA AGAAGAGGAG GAGCAAAATC
CTGAGTGAGG TCTCTCTTAG AGTGGCCCGT CTGAGAAACG GCCTGCGTGT GGGGAGCCGT
GACGTCGTCT TAATCGACGA GGTTGACCAC GGGTTGGTTG TCGGCCGCGC AAGCGACTAC
AGACAGGTGG TGGTGAAAAG GGGCCACGGC GACGGCCTCA TTGGCCAGTA CAGAGAAGTC
CAGATAGTCG CCGCTGGCGC AGTCTACCTC TACGGCGACA TTGTAGAGTA G
 
Protein sequence
MARAYIETYG CWLAKADAEI IRQRLGLVAV ERPEDADVVM IYTCAVREDG EVRQLARIRE 
LAGLRKEVVV AGCLAKLRPY TIKSAAPNAR LLYPSEVEGG QKREMKVLPR YEGGVIYTVP
LQVGCLGNCT FCATKYTRGG AGYVKSANPD DVVRHVKEAV ARGAKEIYLT GQDVITYGFD
MRWRPGWSLP DLLERILREV EGEYRVRIGM SEPWVFARFA DRLLDIVKGD RRVYRYFHLP
VQSGSDRVLR AMGRRYTVDE YRELVRKIRK TLGEFAFVAT DIIVGFPGEA EDDFWESVKL
VEELQLDKVH VARFSPRPFT EAAVMPRQVP DAEKKRRSKI LSEVSLRVAR LRNGLRVGSR
DVVLIDEVDH GLVVGRASDY RQVVVKRGHG DGLIGQYREV QIVAAGAVYL YGDIVE