Gene Pars_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0983 
Symbol 
ID5055103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp873968 
End bp875818 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content54% 
IMG OID640468539 
Producthypothetical protein 
Protein accessionYP_001153215 
Protein GI145591213 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.495624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000164054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAATAG GCTACGTCGT GGCCACGGCG ACGCCGTTTG AGTTCGTGGC GACGCTGGAT 
CCCGAGAGGC CTGTTAGTCT GTACGACTAC GTGGTGGTTG ACCACGTGGA GCTCGACAAC
GCCTCCGGCG AGCTTGTAAA CGTCAGTTTA CTGGGCCAGA TAGTGAAGCT TTACCGCGAC
CCCTACTCGG TGAAGAGGGA TCTGCCGCTC TACACCGTCA TACAGGAGGT CTCTAGTAAT
ATTTTGGAAG TTCAGATTGC CAAGGTCAAG GTGCTTGGCT ATGTGCTAAA CGGTGAGTTG
AGGCAGCCGA AGCAGCCGCC GAGGATAGGT TCGCCGGTCT ACTTGGCGGA GAACGAACAA
ATCGCCGAGC TGTTTAAGGT GGAGAACGGG CTGTGTGTCG GCAAGCTTGC AAGCCGCGAT
GTGGCTGTGT GTCTAGATAT AAACGGTATT AGGAGACACC TTGCGGTAAT TGCGGCGACG
GGCAGTGGCA AGACTTGGTT TTCGGTGGTG TTGATAGAGG AGTTGCTGAG ACGAGGGGCT
AAAATTGTGG TCATAGACCC ACACGGCGAA TACGTAGCAA TAAAAGACTC AATACACCGC
CTAGGTCCCT TCACTGCGAG GGTTGTGAAG GTGTCGAAAC ACCACGTGGG GGACTTAATG
TACAAGATAG GTGTTCTTGA CAGTGATCCA GAAGCGTTGG CAAACGCCGC GGGCGTACCG
CCTGGCGCTA AGAAGATAAG ATATGCGATC TACCTCGCAT GGTCCTATGC GAAGAAGGTT
AGGAAAGCCA CTGGGGAAAA AGTCGGCTTG GCCTTTATGA AAAGAGTCCT ATACACAGCC
ATGAGGGGGG AAAACGCCTT GCAAAAACTT TTCCAGCAGT ACAAAGGAAT CAACGACGGC
GCTCACAAGG CCGAGGGAGA TTTTCCCCTA AGCGATTTAA AGCAACTCGC CGCCAAGGAC
AGACACGCCA TTTTCAGCGC GTTGACGTAT TTAAAAAAGC TGTCTAGGCT GGGAGTCTTC
TCGTCTAGGT CAACCCCTCT CTCGAAGCTT CTGGGCGACA TTACGATTAT CAACCTGGCA
GGGGTAAACG AGGAGGTCCA GGACTACGTG GTGTCGCACT TGGTGAATAG GCTCTTCCAA
GCTAGGGTGA ACCACGTCAG GGGGTTAAAG GGGTACCAAC TCCCGTGGCC CATAGTCTTG
TTCGTAGAGG AGGCTCACAG ATTCGCCCCT CCAAAGGCAC TAAGAAAGAC GAGGTCTTAC
GAGGCCTTGT CCCGGGTCGC CTCAGAAGGG CGCAAGTTCG GCGCCTACCT CGTAATTATA
AGCCAGAGGC CTAGCAAGGT CGATCCTGAC ATAATTAGCC AGTGCCAGAG CCAAGTAATA
ATGCGGATAG TCAACCCCAA AGACCAAGAG GCGGTTAGAG AGAGTAGCGA ACTGTTGGCG
CAGGAGTTTC TAGAAAACCT GCCCGGGCTG GACGTGGGCG AGGCTGTGGT GTTGGGACCC
ATCGTGAAAC TCCCCGTAGT GATAAAGGTG AGGGACAGGG TGCTTGAATA CGGCGGATCT
GACATAGATC TCACAACGGC GTGGAAGGTG GATAAGACCG CCGACGTGGC GCAGATGTGG
AGGAGGATAT TCAACAGCCC GCCTCCTCCA AGCGTTATGC TGTCGGCATC TAGAATGAGG
CTACTCCACA AAAAGAGGGA GGGGAATAAA ATCGTCATTA AGCTCCTCGA CGGGGATAAG
GAAGTGGACG TGGTAATCGA GGGCGGCTCC CCCCGCTGTA GTGTCTGCGG CGTCGGCAAG
CCGTGTAGCC ACGTGTATAA GGCACTTGAA GAGGCGCTAG AGGTGGTATG A
 
Protein sequence
MRIGYVVATA TPFEFVATLD PERPVSLYDY VVVDHVELDN ASGELVNVSL LGQIVKLYRD 
PYSVKRDLPL YTVIQEVSSN ILEVQIAKVK VLGYVLNGEL RQPKQPPRIG SPVYLAENEQ
IAELFKVENG LCVGKLASRD VAVCLDINGI RRHLAVIAAT GSGKTWFSVV LIEELLRRGA
KIVVIDPHGE YVAIKDSIHR LGPFTARVVK VSKHHVGDLM YKIGVLDSDP EALANAAGVP
PGAKKIRYAI YLAWSYAKKV RKATGEKVGL AFMKRVLYTA MRGENALQKL FQQYKGINDG
AHKAEGDFPL SDLKQLAAKD RHAIFSALTY LKKLSRLGVF SSRSTPLSKL LGDITIINLA
GVNEEVQDYV VSHLVNRLFQ ARVNHVRGLK GYQLPWPIVL FVEEAHRFAP PKALRKTRSY
EALSRVASEG RKFGAYLVII SQRPSKVDPD IISQCQSQVI MRIVNPKDQE AVRESSELLA
QEFLENLPGL DVGEAVVLGP IVKLPVVIKV RDRVLEYGGS DIDLTTAWKV DKTADVAQMW
RRIFNSPPPP SVMLSASRMR LLHKKREGNK IVIKLLDGDK EVDVVIEGGS PRCSVCGVGK
PCSHVYKALE EALEVV