Gene Pars_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1053 
Symbol 
ID5054236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp937187 
End bp938758 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content59% 
IMG OID640468609 
ProductATPase 
Protein accessionYP_001153283 
Protein GI145591281 
COG category[R] General function prediction only 
COG ID[COG0714] MoxR-like ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.603764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCACGG GGTTTGAAGA CTGCGGGGCG TTTTTCGTCG AGGCGTCCGG CGATGTGTCC 
CACGTGGGGA AATTCCTGTG GAGCCCCGCC GACAAGCCGT GGAAAGCGAT GGAAGAGATT
AGGAGTGGGG ACTGCGTCCT TCACTACGTC ACTGAGAAGG CCGGCGTCAA GGGGTTTGTG
GGGGTTTCCA AGGCGGCCGA GCCGGCGAAA CAGGTCGACA AGGCTGAAAG CGAGAGGCTT
TTCCAAAGCC TCGGCGTGGA GCCGAGCTAC TACAGCCAGT GGCTTTCGAA GTACGATAAG
TTCTACTTCG TCCCCCTAAA GGGCTTCCGA CGTTTTGACA AGCCGCTGGG GCTGGAGGAG
GTGGTGGCTA TGGGCATAAG CGTCTTTGAG GTAGTGCCGC AGAACTACAT TAGGAAGACG
CCTTATGGGA AGAGGATTTT AGAGGCGGCG TGTAGGCAAA GCGGTCCGCG CAGTTGCGGA
GGTGTGGACG CGGCGCCTAG TGTCGAGCTG GAGAAATTCG TGGCTCTGCT AATGCTGGCG
GGGAAAAACG TCTTGTTTGT GGGGGCGCCG GGGGTGGGGA AGACGCTCCT GGCGCGTAGG
GCGGCGTGCT TCTTCACGGA GTGCCCCCCG GTGGTTGAGG CGGGGCGGGA GGACTTGGCC
TACGAGGACT TGGCCATCCG CTACGCGGTG GGGCCCGACG GCAGAGTGGA GAGGCGGCTG
GGCTCCCTTG CAAGAGCCGT TGCCGAGAGC TGGGCATCGC TGAGGAGAGG CGGGGGGCCG
TGTCATTTCA TAATAGACGA GATAAACCGC GCCAACATAG ACGTGGTGAT GGGCCGCTTC
TTCACCGTCC TGGACATGGA GCACAGGTCT GTGGAGATTC CGGAGCTGGA GGAGGCCGGT
GTCGACCCAC CGCGTGTGCC GCTGTCTTTT AGGGTATTCG CCACGATGAA CGTCGTCGAT
AGGGGCCAGC TGTTTAGGAT GAGCTTCGCC CTTCTGCGGA GGTTTGCCTA CGTATACGTA
CTGCCGCCTC ACAAGAAGAT AGAGCCAGAG AAGTCGCCGC GGGAGACGGC GAAGGAGCTA
GGCTTATACC GCCCATATGC GGAGGCGGCT TACAAGGCCC TCACCTTGAA AAGCTACTTA
GAAAACGACG TGGCGACTCT TATCAGACTC CCAATCCCCC AGCCAGAGAA GATAGCGGAA
GAGGCTGATA GGCTCGGGAT ACTGCACCTC GTGGACCATT TGTTAGAGAG CGCGGACAAA
ATCGGCTTGG AGGTAGGCCC CTCTATGATC GTCGACGTGT TGAAGGCTGT GGCCGTCTAC
GCCTCTGCCC CGCCGAGCCT AAAGCCCAGA GAGGAAGTCT TTGTAGACTA CGTTGTCTCC
TCGCTGGTGC TTCCATACTT CGCCGCGGCA ATTCCGAGAA TAAGACAGAA GGCGCTGTAT
ACGTCAAAGG CCTTTGAAGA GGCCAGGGAG CTGAACGAGG TGGCGTCTAA AATTAGGGAG
TGGCTCGGGG AGAGGTCGGC TTCTTATCAC GTGGCGAGAG GGCTTCTCTA TGAGCTACCG
GCTAAGGTGT GA
 
Protein sequence
MCTGFEDCGA FFVEASGDVS HVGKFLWSPA DKPWKAMEEI RSGDCVLHYV TEKAGVKGFV 
GVSKAAEPAK QVDKAESERL FQSLGVEPSY YSQWLSKYDK FYFVPLKGFR RFDKPLGLEE
VVAMGISVFE VVPQNYIRKT PYGKRILEAA CRQSGPRSCG GVDAAPSVEL EKFVALLMLA
GKNVLFVGAP GVGKTLLARR AACFFTECPP VVEAGREDLA YEDLAIRYAV GPDGRVERRL
GSLARAVAES WASLRRGGGP CHFIIDEINR ANIDVVMGRF FTVLDMEHRS VEIPELEEAG
VDPPRVPLSF RVFATMNVVD RGQLFRMSFA LLRRFAYVYV LPPHKKIEPE KSPRETAKEL
GLYRPYAEAA YKALTLKSYL ENDVATLIRL PIPQPEKIAE EADRLGILHL VDHLLESADK
IGLEVGPSMI VDVLKAVAVY ASAPPSLKPR EEVFVDYVVS SLVLPYFAAA IPRIRQKALY
TSKAFEEARE LNEVASKIRE WLGERSASYH VARGLLYELP AKV