Gene Pars_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1558 
Symbol 
ID5054991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1409447 
End bp1411219 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content54% 
IMG OID640469099 
Productputative ATPase RIL 
Protein accessionYP_001153764 
Protein GI145591762 
COG category[R] General function prediction only 
COG ID[COG1245] Predicted ATPase, RNase L inhibitor (RLI) homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000149119 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTACGCA TCGCCGTCGT TGACGTAGAC TCTTGTCAGC CTAAGAAATG TGGTCATGAG 
TGTGTGAAGT ACTGCCCGGT GAACAAGACG GGGAAGGTGG TGTGGATAGA CGAGCAGACA
AAGAAGGCGG TAATATCGGA GGCTCTCTGC ATTGGCTGCG GCATCTGTGT CCATAAGTGC
CCCTTTGAGG CAATTACGAT TGTCAACCTC CCCGACGAGT TGGAGAGAGA CTGCGTGCAC
CGCTATGGGC CCAGCGGGTT TAAGCTTTAT AGGCTCCCGA TTTTGAAAAG AGGCAAGATA
GTGGGGGTTC TGGGACGCAA CGCCCTCGGC AAGACAACTA TGGCGAGGAT ATTGGCCGGG
GAGCTGGTGC CCAACTTGTG TAATCCTGAA GGCGGGAGTA GCCGCGAGGA GGTGATTAAA
CAGTTCAGAG GTACTGAGCT CCACACCTAC TTCTCTGAGC TTTATAACAA CAAGCTTCGT
GCTGTCCACA AAATTCAGTA CATAGAGCTG ATTCCCCTCT ACTTAAAGGG TACAGTTGGA
GATATTATTA AAAAGGCGGG GATAAGGGAT GAGCTCGTCA AGCGCTTCGG CTTAGACAAG
CTCGTGTCTA GGGAGATCAA CAAACTGTCC GGCGGGGAGT TGCAGAAGCT TGCGATTGCC
GCCGCTTTGT CGAAAGACGC CGACGTCTAC ATATTCGACG AGCCTGCCAC CCACCTAGAC
GTGGTGGAGC GCGTGAAGGT GGGGGACGCC ATCAGGGAGT ACACTCAGAA TAAGTACGTC
CTAGTTGTAG AGCACGACTT GACTGTGCTG GATTTTCTCG CAGACAACGT GGTGATTGTA
TACGGAAAGC CGGGGGCTTA CGGCATAGTG TCACATCCCG GGGGGGCGAG GGAGGCCGTG
AACGAATACC TCTCGGGGTA CATCTCCTCA GAGAACATGA GGATTAGGGA TAGGCCTATA
AAGTTCGAGG CTAGACCGCC GGAGAGGAAG AGCGGAAAGG CCGCGCGGCT CGTGGAGTGG
GACAACATAC AAGTCTCCCT AGGAGACTTC CAGCTGGAGG TATCTGCATC GTACATAGCG
AAGGGGGAGG TGGTGGGGGT CGTGGGGCCC AACGGCATTG GAAAAACGAC ATTTCTCAAA
GTGCTGGTAG GCGAGGTTAA GCCGCAGAGC GGCACAGTAA GCTCGTCTCC GCGGATTAGC
TACAAGCCGC AGTACATTAG GGACATCGCT GTGAAAAACC AAGACGTGCC TGTGAGCCTC
TGGCTTGCGC AGCAAGCCGG CGACTACTCC GACAACCCAA TATGGCCTGA CCTGAACAGC
GGCTTTAACC TAACGCCTCT TCTCGGGCGG AAAATGGGGG AGCTTTCTGG CGGTGAGCTA
CAGAGAGTTG TAGTCGCGGC TTCGCTCCTA AAAAAGGCAG ACATATACGT GTTAGACGAG
CCAATGGCGT ATCTAGACGT GGAGCAGAGA ATAACCGTTG CCCGTACAAT AAGGCGTATC
ATCGAAGAAA GCGAAGTAGC GGCGCTGGTG GTGGAGCACG ACATCGCCAT GCTGGACTAC
ATGTCCAACG CCGTTATGCC ATTCATAGGC GAGCCCGGTG TTAGGGGCTA CTCCCCGGGG
CCGACTGACA TGAGGACTGG GATGAACATG TTCTTAAAGT GGGCGGATGC GTCCTTTAGA
AGAGACGTGC GATCGGGCAG GCCTAGGCTG AACAAGCCGG GATCCGCCCT TGACAGAGAG
CAGAAAGAAA AGGGCGAGCT TTACTATATG TAA
 
Protein sequence
MVRIAVVDVD SCQPKKCGHE CVKYCPVNKT GKVVWIDEQT KKAVISEALC IGCGICVHKC 
PFEAITIVNL PDELERDCVH RYGPSGFKLY RLPILKRGKI VGVLGRNALG KTTMARILAG
ELVPNLCNPE GGSSREEVIK QFRGTELHTY FSELYNNKLR AVHKIQYIEL IPLYLKGTVG
DIIKKAGIRD ELVKRFGLDK LVSREINKLS GGELQKLAIA AALSKDADVY IFDEPATHLD
VVERVKVGDA IREYTQNKYV LVVEHDLTVL DFLADNVVIV YGKPGAYGIV SHPGGAREAV
NEYLSGYISS ENMRIRDRPI KFEARPPERK SGKAARLVEW DNIQVSLGDF QLEVSASYIA
KGEVVGVVGP NGIGKTTFLK VLVGEVKPQS GTVSSSPRIS YKPQYIRDIA VKNQDVPVSL
WLAQQAGDYS DNPIWPDLNS GFNLTPLLGR KMGELSGGEL QRVVVAASLL KKADIYVLDE
PMAYLDVEQR ITVARTIRRI IEESEVAALV VEHDIAMLDY MSNAVMPFIG EPGVRGYSPG
PTDMRTGMNM FLKWADASFR RDVRSGRPRL NKPGSALDRE QKEKGELYYM