Gene Pars_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0515 
Symbol 
ID5054448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp466225 
End bp467865 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content58% 
IMG OID640468077 
Producthypothetical protein 
Protein accessionYP_001152762 
Protein GI145590760 
COG category[T] Signal transduction mechanisms 
COG ID[COG3848] Phosphohistidine swiveling domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.292882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTACC CACTGAGGCA GAGCAGAAAG CCGGAAGAGG GCTTCTGGTA CCGCGACGTG 
GTGCACTTCG GCGACGCTCC CCTATACCCC TTGGACTCCT ACTTCACGGT GTCTATGATG
GACTTAGCCC AGTCCTATTA CTACGGCAGG TATTTCTCCA TGCCCACCTC CTCGGGGAGG
GACACGGCGT TGGTGGAAGG GAGGCCCTTT AGGACTTCAT ACCCCCCGAG GCCTTTTGCG
GATATATTTG AGCGGAGGGC CAGGGAGTAT TTGGAGAATT GGGACGCGAA ATACGCCGAG
TGGAAAAAAG AAGTCGTGGC GATAATTGAG GAGATGTCTA AACTGCCCGT GGATCTCACC
GAGGGCGTGG ATTTGAACGG CGCGGCGCCG TATCGGGTAA TTGAGAGCTG GCTCAAGCTC
TACCTCCTTT GGCTGAGGCT TTGGTTTAAA CACTACGAAT TCCTAATGTT GGGCTACCTC
ATTTATCAAT TGTTTTATAA GTTTATAAAG ACGTTTTTCC CCGACGCGCC AGATCACCAC
ATTTCAGAAA TGCTGGCCCA ACGCGACATT GACACCTTTA GGCCTACAAA AGAGCTGGAG
AGGTTGGCGG AGCTGGCGCG CGAGTTGGGA ATCGCCGAGA GATTAGCTGC GTTTAGCAAC
GCGGCTGAGA TGGAGAGATC TTTCGCCGAG AGCGGCGATC CTAAGGAGAG GAAGTGGCTT
GAGGAGTGGA ACGCCGTGAA ATACCCCTGG TTCTATATAT CCACAGGGAC GGGCTTTTTA
CATTGGGAGG AGAGGTGGAT AGACAACCTC GACATCCCCT TTACCTATTT AAAAAAGTTG
TTGAAGGAAG GGCCTTCCCG CAAGCACGGG GAGAGGGGCG GGGTGCTGGC GAGGGGCTAC
GCCGATTTAT TGCCCGAGGG CTACCGAGGG GTGTTTTATA AATACCTGGA GGCGGCCCGG
AGGGCCTACC GCTACATAGA GGAGCACAGC TTCTACGTGG AGCACCTGGG GTTCACGGTG
GGGTACAGGA AGATAAGGGA GTTCGGCCTC TTGCTGGCCA AACTGGGCGT GTTGGAGAGG
GAGGACGACA TATGGTATTT GACCTGGGGC GAGGTTCTGG AGGCGTTGTT AGACGGGCTG
ACCGGCTGGT GCAACCTCAC GGGACCGGCG GCGCATAAAG TTCTCCGTAT GCGCATAGCC
GAGAGGAAGG CGTTGTTGGA GAAGATGGCC TCCTCCCAGC CGCCGACGCA TATAGGGGAG
CAGGGGGAGG TGTCGGACGC CAATTTGGCC TTGCTCCACG GCGTCGGCAG GAAGACCGGC
GGCGACGTGG TGGCCGGGAT AGCCGCATCG CCCGGCAGAG CCAGGGGGCG GGTAGTGGTG
GTTAAAAGCC CGCGGGACTT GGAGAAGGTG GTCGAGGGGT GCGTAGTGGT GACGTCTACT
ATTTCGCCCA CTTGGATCCC TGCGTTGAGG CTGGCCGCGG CCGTGGTGTC GGAAAGCGGA
GGCGCCATGT CGCACGCGGC GATAATAGCT AGAGAGCTCG GGAAGCCGGC CGTCGTCGGG
GCAGCGGGGG CCACCTCTCT TTTCAAAGAT GGCGACGAAG TGGAGGTCGA CGGCAATATA
GGCGTGGTGA GGCGGGTATG A
 
Protein sequence
MMYPLRQSRK PEEGFWYRDV VHFGDAPLYP LDSYFTVSMM DLAQSYYYGR YFSMPTSSGR 
DTALVEGRPF RTSYPPRPFA DIFERRAREY LENWDAKYAE WKKEVVAIIE EMSKLPVDLT
EGVDLNGAAP YRVIESWLKL YLLWLRLWFK HYEFLMLGYL IYQLFYKFIK TFFPDAPDHH
ISEMLAQRDI DTFRPTKELE RLAELARELG IAERLAAFSN AAEMERSFAE SGDPKERKWL
EEWNAVKYPW FYISTGTGFL HWEERWIDNL DIPFTYLKKL LKEGPSRKHG ERGGVLARGY
ADLLPEGYRG VFYKYLEAAR RAYRYIEEHS FYVEHLGFTV GYRKIREFGL LLAKLGVLER
EDDIWYLTWG EVLEALLDGL TGWCNLTGPA AHKVLRMRIA ERKALLEKMA SSQPPTHIGE
QGEVSDANLA LLHGVGRKTG GDVVAGIAAS PGRARGRVVV VKSPRDLEKV VEGCVVVTST
ISPTWIPALR LAAAVVSESG GAMSHAAIIA RELGKPAVVG AAGATSLFKD GDEVEVDGNI
GVVRRV