Gene Pars_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0071 
Symbol 
ID5055762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp59049 
End bp61013 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content50% 
IMG OID640467649 
Producthypothetical protein 
Protein accessionYP_001152338 
Protein GI145590336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.207619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTTGG ACACTTATTT AAAAAGGTAT CAGGTAAACA CCGTGGAAAA ACTCGCACCG 
ATTCTCCTAA TAGCGCTGGG GATATCGCTA ATCGCAGTAT ACTTCGCAAT TCCGCCCAAA
GCCCCGTTCA CCACCACGGC GACGGCGGCC ACGAACACAG TCTTCAAAAC CACATCCACA
AGCCCCTCCA TAACGCAGAC CATAGAAACA TACACCAGCA CTGCCCAGAC AACGCAACCA
CCGCCGCAAC CGCCCTCCAC ACCGCCGGTC GCTAGGCCGC CGGTATTACT GCCATTGTTC
AGGGTGGACG TTGTGGCACC TGACGTAGTA AACACAACGC GGATGCCAAT ACAGATAAAC
TACACAGTAG TTGTCAGAAA CGTAGGCAAC GGCACAGGCT CTGTTTTAGT TGGCGGCAAA
CACTACGTCA TAGACCCCGG CAAAGAGGTT AAAGTAAACG CGACGGAGAC AATCACATTT
CCAGGTACCT ACACCTTAGA AGTAGAAGTT AACGGCACGC CGTATTCAAA GACGGTCAAG
GTATTTTACT ACACGCCGGT TCTAGAAGCG GAACCTGTAA AAGTAAACGT AACCACCCTC
CCCACAAACA TAACCGTGGC TGTGTTGGTT AGGAACAGGG GAAATCTAAC CGCAGTAGTT
GAAGGCGTGG AGATAAGACC GGGAGAGGCA AGGACAATAA ACAAGACCAT AACAGTAACC
GCGGCCGGTT ACTACTTCAT CAACGTAAGC GGCGTCAACG CCCCTATCGC CGTTAGTTAC
TACACTCCAA AACTAGAGTG GAAAATAGGA GGCCCAGAAG AGGTGGAAGC AGTGCCAGGA
GAGAGCTCCT CGGCCTGGCT GTGGCTAAAG AATGTCGGCA ACGTCACAGC TAAATTCGTC
GTAGACGGCA GACAAATTGT GTTACCGCCT GGTAGCGCAG TAAACCTAAC AAAGTCAGTC
ACCGTGTCCA CTGCTGGATA CTATACAGTA GAGTTCGTAG TCAGAGGCCA ACTCAACGCG
ACTATGAAAC ATGCAGTAAA AGTAAAGATA ATTGCAACAC GCGTAGAGCT CATAACGTGG
TCTCCTGAGC TTAGAAGGAG GTGGCCACAG CCAGGCTCAA CTGAGTCTAT TACGCTGAGC
GTGCCTAACA AAACAGTGGC CATGACGTGG GGGTACATCA TCTCGACAAA TGCAACTAGG
AGAAGCACGA CCATAGTAGT CGAAGACCCA GACGGTGTAC AGCAATACCA ACTCACGCCA
GGCGCCGCCC TTTCAAAAAA CTTTACAATA GTCATGGAAG CGCCGGGCGA GAGAACAGTG
GCTATCAAGG TTAATTCGAC TACCTACGGC CTAGTTGTAT CTCTAAAACT CACCCCACCA
AAAGTGACAG TAAGAGATAT TACAAAAATA GATTTCTCTG ACAGTAGACC ACTACTTGCC
ATCAAAATTA GTTGCAGATA TGCAGACATA TCTTTTGATA TCTTAGAAGT ATCAGGGACG
CTCTTATTCA CGCAAACCGG CCGGTCTATT TCCGGCACGA TAATAGTTAG ATCAGCAAGA
GGCGTCGATA CTGGTAGCTA TTCGGGCCAG GCAGAGGGGG GGAGGGGATT TCTGAATCTT
AACTTGCTTG GGAGGAACGT ACATGTAGAA TTCTCATTGC AACCGGTCAT CATAACGAGA
GTGGAGGTCG ACGGGACGCC GTACGACTGC AAAGTCCCGC TAGAGCTGAT ACCCACGATC
CTCTATGGCG ACAAGCCCAC CGCCTCTGAC GAGCCGGCAG ATCGATACGC AATGAGATTA
ATATCGGCAT TTGCGAGGGG AGACAACGGA GCACCGCAGT GGGCGGTATG GAACGGGGAG
TACGTAGAAG TTAGGGACAG AGAGGGACAT GTGATGAAGG TGTATTTCGA AAAGGGCACT
GTTAGAATAG AAGGCCCCCT CCAGGCTTAT ATCGTTATAT CCTGA
 
Protein sequence
MSLDTYLKRY QVNTVEKLAP ILLIALGISL IAVYFAIPPK APFTTTATAA TNTVFKTTST 
SPSITQTIET YTSTAQTTQP PPQPPSTPPV ARPPVLLPLF RVDVVAPDVV NTTRMPIQIN
YTVVVRNVGN GTGSVLVGGK HYVIDPGKEV KVNATETITF PGTYTLEVEV NGTPYSKTVK
VFYYTPVLEA EPVKVNVTTL PTNITVAVLV RNRGNLTAVV EGVEIRPGEA RTINKTITVT
AAGYYFINVS GVNAPIAVSY YTPKLEWKIG GPEEVEAVPG ESSSAWLWLK NVGNVTAKFV
VDGRQIVLPP GSAVNLTKSV TVSTAGYYTV EFVVRGQLNA TMKHAVKVKI IATRVELITW
SPELRRRWPQ PGSTESITLS VPNKTVAMTW GYIISTNATR RSTTIVVEDP DGVQQYQLTP
GAALSKNFTI VMEAPGERTV AIKVNSTTYG LVVSLKLTPP KVTVRDITKI DFSDSRPLLA
IKISCRYADI SFDILEVSGT LLFTQTGRSI SGTIIVRSAR GVDTGSYSGQ AEGGRGFLNL
NLLGRNVHVE FSLQPVIITR VEVDGTPYDC KVPLELIPTI LYGDKPTASD EPADRYAMRL
ISAFARGDNG APQWAVWNGE YVEVRDREGH VMKVYFEKGT VRIEGPLQAY IVIS