Gene Pars_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0129 
Symbol 
ID5055510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp114277 
End bp117627 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content57% 
IMG OID640467708 
Producthypothetical protein 
Protein accessionYP_001152396 
Protein GI145590394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.294302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCCGT CGCCTAGGAG AGGTCCGAGG GGTCAGATAA TTCTGCTCAC CGCCCTAGTC 
CTAGCTTTCG CCGTGGCGAT CGTCATGACG GCCCAGATAT CGTCCAGCCT AGGCCAAACT
GCATCCACCT ACCAGACGGG GTATGGCATA TACGCAAAGG CGTGGCCCGA GGCGGTGGAC
ATGGCGGATG CCCACGCCCT TCAAGTCTCG TTGAAGGGGG CATCGCAGAT CTCCACAGGC
GCCCTTTCGT CGGGGCTGTA TTCATATTGG CCTACTGCCG GCAGGTGGCT TACGTACAAC
GCCACTTATA TTTACCTTAA CAAGTCCCTT GCGGCGGTTA AGGAATCGTT CCTTCCCCTG
GGGGCTCAGC TGGACTTCGG CGCGGTGGTG CGCTACTACG GCTTCAACGG GAGTCTCGGG
CCCTACCCGC TGGCGGCTCC CCTTACGAGG TACAACGTGA CGCTTGTGAG GCAGGGCTAT
TTGACTATCC CCGGCGGGGC GCTGTACAAG CTGACGGTGT ACTGGCAGGC GCCTGACTAC
TCTGGATACT ATGTAGTGAC ATACGGCGCA CATGTCTCCG AGCCTGGCGC AGTGGGCGAG
ACGTACAACG TTGCGCTGGC CGAGGTTCTA CAAGCCTTTA GCAGAGCGGG GTACGACCTA
AAGAGGAGGC TCTTCTTCTT CCTCGTATAT TCCGACTCCG GCGGCTGTAA AGTTAGGCAA
CTGCCCTGGT GGGCAGAGAT TAAGAAATGT CCCCTGTGCC TCCTCCACGT CAGACTGCCC
AACGTAGGGG AGATTGGGAT GGGCGATCCG ATACTGCGTA GTGGAGCCGC CTCGGAGATC
CTCGCAGTCC TCCTGCCTGC CGACTTGGCA GGAAACCAGC TTGCCCTGTG GCTCTCCGGG
CCCTGTTCTC AAGGCGGCGG TCTGACCACG CAACAGATTA CTGTTAACAT AGATGCTAAC
GACAACCCGT ACGCAGTGTT CGCCCAGACC GTCCAATACG ACGGCCCCAC GCCGGTTAGC
TGGTCAAACT GGTTGTTCTG GGGGGCGGCG GATATTCAAG GCTGGAGGGG GGCTACTGTT
AGTGCGGCAG ATTGCTATCC CAATCCAACA GCTGGAGGTG TAAGAGGCAG TGGGGTCAAC
CTAATCTACG ACAGCGACTT TGGCCAGGTT GCTGAGGTGT GGGCGGAATA CTACGCGGGG
ACGAGGAACT GGGCTTACAA CCTCTACATC CCCACGCCTG TATCAATATC TGGCTGGCCG
GGGTTTAGGG CCGAGGCGTT AGTTAGGCCT ATGTCGTTGG GAACAATTAG GCCGGCGAGG
TTCGACCTCT ACTACTATAC AAGCAACCCA GGTACCACCC ACCCGATATG TGCAAACCTA
GTGGCTGACC AAGCAGTATC GCCGGTGTTG GTGTGGACGA ATTGGCGTCA AAACAGCGGC
GTCTGGAACG GCAGGACTTG GGACGACGCT GGTAGTGTAT ACTTCGACTC TTCGTGGTAC
CTCTTCACTA TTTACGGATC GAGCGACTCG ATAATCTACC AGATATACAG CTACAACTCC
ACGAGGGCTT TGAAGGCAAT CGCCACGAAG CGCGTGCAGG ATACGCCTTG GGGGGCCACC
AGCTGGCAGT TCTACATCGT CTTAGGAAGC GCTATTGTCG ACAACCCTGC GAGCACGACC
GCCAGCTGGG TTGAGAGGGC TCGCTACGCA TATGTAAGGC TGAGGCCGTG GGTTGAGCCC
CCGCCTGGCG TCATGTTGAC GACGCTCGCC GCGCCGCCAT CGGTGAGGCC GAGCAGAGTA
GATATCGTGG CCTCTAGGCT TGCCAATGTC TCGTCAAGAG TTGGGATGAG TGGGCTTATT
GGTCTTGCGG GGATTTCGGA CTTAAGGATA GAGAGGTCGG TTTCACTAAA TGCTTCAATC
ACCTCGGTTG CCCCGCCGGC GGTTGGGCCG AATCAGAACA CCTACACATA CAGTGTGGAG
GTGACCTCAT CAATTCCCAG AGGCTCGCTG GCGGCCTCTT TTACGCTGTT CTACAGCCTC
GGGAGCACCT ATGTGAACTC CACGTGCCCC TCAGACTTGT GCTCGGCGAC GCTTATCCGG
TACTTGGGGT TTGACGGCGC CCGGGACAGG GCTGTGTACC AAGTAAGCGT CACGGTGCCG
GCGTGGGTCA GCCACTCAAT AATCGTAAAT GTATTTGGCA CAAAAGTCGC ACTATCTCCA
ACGGCGCCTA GGCTGTATGT AGTAGGCGCG GGCGGGAGGT CGTGGTATAT CATCAATGAG
GGAGACGGTA CTGCTATTTT TATATTCCCC TGGAGCGGCG GCGCCTCGCC CGTTTACAGC
TACAGCCCCG ACGACCCTAG ATACGTGGGG GTAAACCACG TCACAGACGG GTCAAGAAAG
TGGACCGTTG TCCTAGTGCC TCCTGGAACC GTTATGAACA TAACCTTCGC CGCACCAGCC
GATGCGAGCT ACTTAAGAGA GGCGCTGGTG CCGTGGCAGA GGGCCTACTG GGCGGGACAG
GTTAGCCCGC CGTGTCCGGA TCCCAGATTC GTGAGAGTTT ACATGCCGGG AAACGCCACT
TTGAGGAGGT ACTTCGTCAT GATACCCCGG GATTTAGGCC TACAAAGGAA TTCACGCCCC
TCGCCGCCTG TTGCACATGC CTTCATAGGC GGGGGCTGGC GGCAGATCCC CACCTATAGA
GACGATGCGG GTATGCTGTG GCTTAGATTA GATGTACAGA GCTTCTCGCC GAACCAGCTG
ACTGGCAGAG CTGTCCTCGT GGCCTTATGC AGCTCGCCCA ATAGCAACCC CAGCGGAGAG
TCGTTTTTTG GTTTCTATTA CCAATCTACG AGATCTGGCC TGGCACTACT GCCGCCGCTT
GGCAATTATC CCGACGGCTA CACCGTCGTG GTGTGGCTAA AAAGCAGACG TTCAGTTGCA
CTAACTGACG TAGCCACTAC TCCAAGCTTC ACGTGCACCC CCACCAGGAA GATAATTGGA
ATGCACTACG TAGGCTCTCC CACTGGGTAT TACTGGAACT ACGACGGGTT CTGCTTCGAT
AACCACATAT GGGAGCAGAA AGGCGACGAT ACCTCCGACG CCACCTATGT GTATATGATA
TCTGTGAGCC AGTCAACTGT GTTGTACCAA ATAGCTACTG GATTCAACAC GGGGCTGTGG
TGGCTGAGGT CGTACCCCAA CAGAGCGCCG GCGATAGGCA ACCAGCCGCT GTTCTACTTC
TCGTATGTTT ATGCGGACGA TGCGTTTAGC TTTACCGTCG TGTCGTTTAC GTGGCCACGG
CCCTACTTCT ATCTGGACTT TGAAAGAAAA GGTCCCTACG CGACGACATG A
 
Protein sequence
MSPSPRRGPR GQIILLTALV LAFAVAIVMT AQISSSLGQT ASTYQTGYGI YAKAWPEAVD 
MADAHALQVS LKGASQISTG ALSSGLYSYW PTAGRWLTYN ATYIYLNKSL AAVKESFLPL
GAQLDFGAVV RYYGFNGSLG PYPLAAPLTR YNVTLVRQGY LTIPGGALYK LTVYWQAPDY
SGYYVVTYGA HVSEPGAVGE TYNVALAEVL QAFSRAGYDL KRRLFFFLVY SDSGGCKVRQ
LPWWAEIKKC PLCLLHVRLP NVGEIGMGDP ILRSGAASEI LAVLLPADLA GNQLALWLSG
PCSQGGGLTT QQITVNIDAN DNPYAVFAQT VQYDGPTPVS WSNWLFWGAA DIQGWRGATV
SAADCYPNPT AGGVRGSGVN LIYDSDFGQV AEVWAEYYAG TRNWAYNLYI PTPVSISGWP
GFRAEALVRP MSLGTIRPAR FDLYYYTSNP GTTHPICANL VADQAVSPVL VWTNWRQNSG
VWNGRTWDDA GSVYFDSSWY LFTIYGSSDS IIYQIYSYNS TRALKAIATK RVQDTPWGAT
SWQFYIVLGS AIVDNPASTT ASWVERARYA YVRLRPWVEP PPGVMLTTLA APPSVRPSRV
DIVASRLANV SSRVGMSGLI GLAGISDLRI ERSVSLNASI TSVAPPAVGP NQNTYTYSVE
VTSSIPRGSL AASFTLFYSL GSTYVNSTCP SDLCSATLIR YLGFDGARDR AVYQVSVTVP
AWVSHSIIVN VFGTKVALSP TAPRLYVVGA GGRSWYIINE GDGTAIFIFP WSGGASPVYS
YSPDDPRYVG VNHVTDGSRK WTVVLVPPGT VMNITFAAPA DASYLREALV PWQRAYWAGQ
VSPPCPDPRF VRVYMPGNAT LRRYFVMIPR DLGLQRNSRP SPPVAHAFIG GGWRQIPTYR
DDAGMLWLRL DVQSFSPNQL TGRAVLVALC SSPNSNPSGE SFFGFYYQST RSGLALLPPL
GNYPDGYTVV VWLKSRRSVA LTDVATTPSF TCTPTRKIIG MHYVGSPTGY YWNYDGFCFD
NHIWEQKGDD TSDATYVYMI SVSQSTVLYQ IATGFNTGLW WLRSYPNRAP AIGNQPLFYF
SYVYADDAFS FTVVSFTWPR PYFYLDFERK GPYATT