Gene Pars_1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1462 
Symbol 
ID5055496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1318396 
End bp1320465 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content63% 
IMG OID640469002 
Producthypothetical protein 
Protein accessionYP_001153671 
Protein GI145591669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.947543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCATA TATTGGCAAT AATGGCAATA ACCGCCGTCG CCCTCGCTGT AGTGTACGTC 
TCAGAGGGCG GAGACGTGGT GTACTACTCC ACGCTGGGGG GATTCGGCGC GTTAAACGGC
CTTGGCCAGA GGCTGTGGCA CGTGGACGCC CCCGGCGCTT TAATAGCCAC AGACCCCATG
GGCTCTTGCC TAGCCGTCGC GCACCCCCTC GGCAACGCCA CGAGCTGGCT CGGCACCCGC
GTGGCGCTGT ACGTCAGAGG CGCGGCGTTG TGGTCAGCCG TCTTGAAGCT AAACGCCTCT
GCCATAGCCA CCGACTGCAA CCGAATAGCA GTAGGCACCA TGGACGGCCG GATTGTCGAG
TTGCAAAACG GAAGAGTCGC CTCTGAGAAA ACAGTCGGAG TGCCCGTAAT ATCGCTGGTC
TACGACGGCG GTGCTCTTAG ATACGGCGTG TGGAGGCCGG GCTACGTGGA GCACCCCCTC
CGCTGTGGAT ACACCGTGGC GCTGGCCAAG AGGGATAAGC CGTACGTGGT GGTAGACGGG
AGGGAGTACG TCGGCTTCGG CGAGTTGCTC TCGCTAAGCC CGCCCGCCGC CGTCTCGCAA
AACTGCGTCT TGACCTTCGC CGCTGAGGGC GCCGTGTACT GGGGCTCCGC CGCCATCCCT
GTTAAAGAGC CTGTATACGC CGTTGCCATC TCCGGCGACG GCAACGTCTT GGCGGTGGGC
TTCGCCGACA GGGTGGAGCT CTACCGCGGT GGGCAGCTGG CGGCGTCTAT ACAGGCAAAC
ATGCCTAGAT CCCTCGCCCT TGACTTCCGG GGCTTTACGC TCGCCGTCCA AGACGACTCC
GGGGTGCAAG TATACTCCTT CACCCAGAGA GAGGTGGAGG TGGTTGGTTG TCCCTACGGC
GTGATCAAAG CCGGCGTAGC TACTTACAAC GTCACTGGGA GGGCCGTGGT GTACGTGCCC
AGGGGGGCGG AGCTAACGCC GTTGCGCATC AACTTCACAG ACGGGGTCTG CGCCCCCGCC
GGCTTCGACG GCCGCGTCGT CAGCTACCAG CGGCTTTACA GAGTCGAGGT GACGCCGCCT
GCCAAGGGCC CCGAGCTGGC CGCGGGCCCC ACCGCCTACG CCGCCCCCCT CGAAGCCGAG
GTGAGGGCTA AAACCGGCGT TTTAAAGGCG TATTTAGCCG GGTGGCTTGT CGGCGGGAGG
AGGATGCCGC CGGTTCCAGT GTTAACTGTA GACGTGAGGA ACGCGACGGC TGTAGCCCCG
CTCTACAGAC TAGACGTGGC GGCGGAGTTA GTTGAAGGCG GAGTAAAACG GGTGTTAAAA
GGCGTGGCGG CTTACGACAG CAAGGGGGCG CCTATGGCGC CGGTGGAGGA CTACGTCTAC
CCAGGCCTCC CCGCATATGT AGAGACTTAC TACGACGAGT ACTACCTCCT CCGCGCGGAG
GCTTATACAC GTAGCACGTT CAATGCCACA GAGCTCTGGC TTAAGCCGGG CCAGACCGCG
GTGATCTACG CCGACGAAGT CGTCGACTTT GGCAACTCCA CTAGACTCGT CTTCACCGGG
TGGAGCGACG GGTCTAAAGA GCTTAGACGC GCCGTGGGGC CGGGGACATA CACGGCTAGG
TACAAGGTGC AGTATCTCGT GACGTTTAAT GCGCCTAACT ACACCGCGGC CGTCTGGGCA
GACGCGGGGG CTAAGCCGCC GGCGCCTAAG CCCCCTGAGA AGCTCTACGA CGACGGCAGT
ACTAGGATTT GGTTCAACGG GTGGCAACTG CCGGAGAGAG TAGACGGGCC GCTTAACGTC
ACCGCCAACA CGGCGAGGGA GTACCGCGTC GTGTTGAAAT ACCCTTGGGG CCAGGAGGAG
AGGTGGCTCC CCCACGGCTA CGCCCTGCTA CCGCCGGACC GCAACCGCTA CAACGTCTTC
TGGCGCTTCT CCCACTGGGC CCCGAGCGAC GTAGTCGCGG GGCCGGGCGT TTACGAGGCG
GTGTACCAGC TGGACGCCTT TGCAGTTGCG GCCATCGCAT CTGTGGTAAT TATAACCGCC
GCAGTTGCGC TTTGGCTAAA GAGGCGGTGA
 
Protein sequence
MRHILAIMAI TAVALAVVYV SEGGDVVYYS TLGGFGALNG LGQRLWHVDA PGALIATDPM 
GSCLAVAHPL GNATSWLGTR VALYVRGAAL WSAVLKLNAS AIATDCNRIA VGTMDGRIVE
LQNGRVASEK TVGVPVISLV YDGGALRYGV WRPGYVEHPL RCGYTVALAK RDKPYVVVDG
REYVGFGELL SLSPPAAVSQ NCVLTFAAEG AVYWGSAAIP VKEPVYAVAI SGDGNVLAVG
FADRVELYRG GQLAASIQAN MPRSLALDFR GFTLAVQDDS GVQVYSFTQR EVEVVGCPYG
VIKAGVATYN VTGRAVVYVP RGAELTPLRI NFTDGVCAPA GFDGRVVSYQ RLYRVEVTPP
AKGPELAAGP TAYAAPLEAE VRAKTGVLKA YLAGWLVGGR RMPPVPVLTV DVRNATAVAP
LYRLDVAAEL VEGGVKRVLK GVAAYDSKGA PMAPVEDYVY PGLPAYVETY YDEYYLLRAE
AYTRSTFNAT ELWLKPGQTA VIYADEVVDF GNSTRLVFTG WSDGSKELRR AVGPGTYTAR
YKVQYLVTFN APNYTAAVWA DAGAKPPAPK PPEKLYDDGS TRIWFNGWQL PERVDGPLNV
TANTAREYRV VLKYPWGQEE RWLPHGYALL PPDRNRYNVF WRFSHWAPSD VVAGPGVYEA
VYQLDAFAVA AIASVVIITA AVALWLKRR