Gene Pars_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1361 
Symbol 
ID5054181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1224088 
End bp1225659 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID640468907 
Producthypothetical protein 
Protein accessionYP_001153576 
Protein GI145591574 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.26972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0191854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTAAGA TCGGTGTAGT TGTCAAATCG CCGTCTCTGT ACTACTACGT CTTTAAGCCG 
TTTAGAGGCG TTGAACTTGA CGTAGGCTCT TTTGTGGCGA CGGAGATTGA TGGAGTTAGA
GTCATTTCTA GGGTAGTAGC CATACGGCAT AGGAACGCCG TGGTGGACCC GCGCCTAATT
GCTCACTTCG ACGAAGAGTC TACAGTAAAG GAGATTAAAG AAACTCTTGG AATAGAAGAG
GCGCTGTACT ACACAGAGGC CAAAGCCGTG GTGCTCGGCG CCAGGAGCGG CCGGAAAATC
CTCAAACCGC AGAAGCCCGT CAAGCCTCTG AGCTACGTGT ACAGCACCAC ACCCGAGGAA
CTTGAACAAT TCTTCGCGCC TGGAGAAGAA GGAACATATA TACCAATAGG AAAGATAAAA
GGCACTTCTA TTCCTGCATA CATAGACGCT GAGAGGCTAG TCACGCACCA CTGCGCAATT
CTAGCCAGCA CCGGCGCCGG GAAGAGCTAC CTCGCCGGTG TGATAGTGGA GAGGCTCTCG
GCGTTGGACA TCCCAATAGT AGTGATAGAC CCCCATGGAG AGTACTCCTC TATGGCGGTG
CCGGCCACAG AAGAGGGCAA GCACGTATCG GAAAAGGTGA GGATTTTCGT CGTGGGCAAA
ACAGACGTCA CTCACCTCGA CCAAGCCTTT AAGAAACGCT ACGGCATCCC CCGCACATAC
ACGAGAATCG GCCTAAACCC ACGTAGCATT CCCCTACGCA CCCTAGAAAA GATCCTAGAC
CTACTATACG GCCTTACGGA CGCACAACGA CGAATACTTG AAGAGGGGTG GCAAAGCGCC
ACCAGCTACG GCGAGCGACA ACCTCTCACA TCCGTAGAAG AGCTAATAAA AGAAGTCCTA
GAAGGCGGCA AACACGCAGC CCCGCCCGGC TTTGCGGGGG AGATGTCACT AAGAGGACTT
GAAGGACGTC TAAGAGCCCT CTTCTACACA AGCCCCGTAT TCATTACGAG ATACGGCGAG
ACGTATCAAG GAGAACCAAT CAAGCTAATA GACCCCGAGA TGTACCTCAC CACTCCGTCA
ATACACATCT TCGACATTTC CGGCCTCGAC ATCCTCGACC AGCAACTCTT CCTCGCCGTA
CTCCTAGACC AGCTCTACAG AGTATCCACA CTGAGGAAAA ACCTCACAAC ACTCCTCATA
ATCGAAGAAG CCCACAACTA CGCCCCGGCG GCCGGCACTT CAGTAGCCAA AAGCTACATA
GCAAAAATAG CAAGAGAAGG CAGGAAATTC GGCCTGGGGC TGTGCCTCAT CACCCAACGG
CCTACAAAGT TGGACCCAGA CGTCGTATCC CAGGCAATGA CCCAGATATT CAAAAGAATG
ATAAACCCCC ACGACTTGCG ATACGTAGCC ACAGTCGCGG AGCACCTAGA CGACCCTAGG
CCGCTGAGAA CCCTAGACGA GGCAGAAGCA GTAGTAACAG GAATCTCAGT CCCAGTGCCG
CTAATGATAG TAGTTGACCA AAGGTGGACG CAACACGGCG GAGTAACCCC AAGCATAAGA
AGACAAGTAT AA
 
Protein sequence
MAKIGVVVKS PSLYYYVFKP FRGVELDVGS FVATEIDGVR VISRVVAIRH RNAVVDPRLI 
AHFDEESTVK EIKETLGIEE ALYYTEAKAV VLGARSGRKI LKPQKPVKPL SYVYSTTPEE
LEQFFAPGEE GTYIPIGKIK GTSIPAYIDA ERLVTHHCAI LASTGAGKSY LAGVIVERLS
ALDIPIVVID PHGEYSSMAV PATEEGKHVS EKVRIFVVGK TDVTHLDQAF KKRYGIPRTY
TRIGLNPRSI PLRTLEKILD LLYGLTDAQR RILEEGWQSA TSYGERQPLT SVEELIKEVL
EGGKHAAPPG FAGEMSLRGL EGRLRALFYT SPVFITRYGE TYQGEPIKLI DPEMYLTTPS
IHIFDISGLD ILDQQLFLAV LLDQLYRVST LRKNLTTLLI IEEAHNYAPA AGTSVAKSYI
AKIAREGRKF GLGLCLITQR PTKLDPDVVS QAMTQIFKRM INPHDLRYVA TVAEHLDDPR
PLRTLDEAEA VVTGISVPVP LMIVVDQRWT QHGGVTPSIR RQV