Gene Pars_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1240 
Symbolsat 
ID5054418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1122075 
End bp1123445 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID640468785 
Productsulfate adenylyltransferase 
Protein accessionYP_001153458 
Protein GI145591456 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGCCC CCCTTGAACC GCATGGAGGT CGGCTGGTGT ACAACGTAGT GGAGGATAGG 
GACAAGGCGG CTGGCATGGC TGCTGGCTTG ACGAAGCTGG AGATTGAGCC GACGCTGGGC
CCAGACGGGG CTCCGATTAG GAACCCCTAC AGGGAGGTTA TGTCCATCGC CTACGGCTTC
TTCAGCCCTG TGGAGGGCTT TATGACTAGG AACGAGGTGG AGTCGGTCTT GAGGGAGAGG
CGCCTCTTGT CGGGCTGGCT ATTCCCCTTC CCCCTAATAT TCGACGTAGA TGAGGAGAAG
CTAAAGACCG CAGGCGTCAA GGAGGGGGAT TCCGTATTGT TAACCCTCAA GGGCAGGCCC
TTCGCCGTGT TGAACGTCGA GGAGGTGTGG AAGTTGCCGG ACCGCAAAGA GTTGGCGGAC
GCCGTCTTCG GCACGCCGGA GAAGAACGGC GAGGTGGTGA AGAGGCGTTT CGACGAGAAG
CACCCGGGCT GGCTCATATA CCGAATGATG AGGCCTGTGG CTCTCGCCGG GAAGGTGGCG
GTGGTCAACC CGCCTAGGTT CAAGGAGCCG TACTCGCGGT TCTGGATGCC GCCCCGCGTC
TCTAGGGAGT ACGTTAGGCA GAGGGGGTGG AAAATCGTCG TGGCTCATCA AACCAGGAAC
GTGCCGCATA TCGGCCACGA AATGCTTATG AAGAGGGCGA TGTTCGTGGC TGGGGGCGAT
AGGCCTGGCG ACGCCGTTCT CGTCAACGCC ATTATCGGGG CCAAGCGGCT GGGGGACTAC
GTAGACGAGG CCATCTTGGA GGGACACGAG GCGCTTAACA AGGCGGGCTA CTTCCACCCC
AACCGCCACG TGGTGACCAT GACGCTTTGG GATATGCGAT ACGGAAACCC GCTGGAGTCT
CTCCTACACG GCATCATACG GCAGAACATG GGCGCCACAC ACCACATGTT CGGCCGAGAC
CACGCCGCCA CTGGCGACTA CTACGACCCA TACTCCACCC AGTACCTCTG GACAAGGGGC
CTCCCCAGCT ATGGGATAAA CGAGCCGCCC CACGTGACTG ACAAGGGGCT GAGGATAAGG
CCGGTGAATT TGGGCGAGTT CGCCTACTGC CCAGTCTGCG GCGAGTACAC ATATCTAGGC
ATATCCTACG GCGACTACAA AGAGGCCCCC CTCTGCGGCC ACACGCCGGA GCGGATAAGC
GGCTCCTTCC TCCGCGGGGT GATAATAGAG GGGTTGAGGC CGCCAAAAGT AGTAATGAGA
CCCGAGGTAT ACGACGTAAT TGTGAAGTGG TGGAGGGTCT ACGGCTACCC CTACGTGACT
GACAAATACT TAAAGATAAA AGAGCAGGAG CTTGAAGTGG AGCTCCAGTA G
 
Protein sequence
MYAPLEPHGG RLVYNVVEDR DKAAGMAAGL TKLEIEPTLG PDGAPIRNPY REVMSIAYGF 
FSPVEGFMTR NEVESVLRER RLLSGWLFPF PLIFDVDEEK LKTAGVKEGD SVLLTLKGRP
FAVLNVEEVW KLPDRKELAD AVFGTPEKNG EVVKRRFDEK HPGWLIYRMM RPVALAGKVA
VVNPPRFKEP YSRFWMPPRV SREYVRQRGW KIVVAHQTRN VPHIGHEMLM KRAMFVAGGD
RPGDAVLVNA IIGAKRLGDY VDEAILEGHE ALNKAGYFHP NRHVVTMTLW DMRYGNPLES
LLHGIIRQNM GATHHMFGRD HAATGDYYDP YSTQYLWTRG LPSYGINEPP HVTDKGLRIR
PVNLGEFAYC PVCGEYTYLG ISYGDYKEAP LCGHTPERIS GSFLRGVIIE GLRPPKVVMR
PEVYDVIVKW WRVYGYPYVT DKYLKIKEQE LEVELQ