Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0129 |
Symbol | |
ID | 5055510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 114277 |
End bp | 117627 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640467708 |
Product | hypothetical protein |
Protein accession | YP_001152396 |
Protein GI | 145590394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.294302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCCGT CGCCTAGGAG AGGTCCGAGG GGTCAGATAA TTCTGCTCAC CGCCCTAGTC CTAGCTTTCG CCGTGGCGAT CGTCATGACG GCCCAGATAT CGTCCAGCCT AGGCCAAACT GCATCCACCT ACCAGACGGG GTATGGCATA TACGCAAAGG CGTGGCCCGA GGCGGTGGAC ATGGCGGATG CCCACGCCCT TCAAGTCTCG TTGAAGGGGG CATCGCAGAT CTCCACAGGC GCCCTTTCGT CGGGGCTGTA TTCATATTGG CCTACTGCCG GCAGGTGGCT TACGTACAAC GCCACTTATA TTTACCTTAA CAAGTCCCTT GCGGCGGTTA AGGAATCGTT CCTTCCCCTG GGGGCTCAGC TGGACTTCGG CGCGGTGGTG CGCTACTACG GCTTCAACGG GAGTCTCGGG CCCTACCCGC TGGCGGCTCC CCTTACGAGG TACAACGTGA CGCTTGTGAG GCAGGGCTAT TTGACTATCC CCGGCGGGGC GCTGTACAAG CTGACGGTGT ACTGGCAGGC GCCTGACTAC TCTGGATACT ATGTAGTGAC ATACGGCGCA CATGTCTCCG AGCCTGGCGC AGTGGGCGAG ACGTACAACG TTGCGCTGGC CGAGGTTCTA CAAGCCTTTA GCAGAGCGGG GTACGACCTA AAGAGGAGGC TCTTCTTCTT CCTCGTATAT TCCGACTCCG GCGGCTGTAA AGTTAGGCAA CTGCCCTGGT GGGCAGAGAT TAAGAAATGT CCCCTGTGCC TCCTCCACGT CAGACTGCCC AACGTAGGGG AGATTGGGAT GGGCGATCCG ATACTGCGTA GTGGAGCCGC CTCGGAGATC CTCGCAGTCC TCCTGCCTGC CGACTTGGCA GGAAACCAGC TTGCCCTGTG GCTCTCCGGG CCCTGTTCTC AAGGCGGCGG TCTGACCACG CAACAGATTA CTGTTAACAT AGATGCTAAC GACAACCCGT ACGCAGTGTT CGCCCAGACC GTCCAATACG ACGGCCCCAC GCCGGTTAGC TGGTCAAACT GGTTGTTCTG GGGGGCGGCG GATATTCAAG GCTGGAGGGG GGCTACTGTT AGTGCGGCAG ATTGCTATCC CAATCCAACA GCTGGAGGTG TAAGAGGCAG TGGGGTCAAC CTAATCTACG ACAGCGACTT TGGCCAGGTT GCTGAGGTGT GGGCGGAATA CTACGCGGGG ACGAGGAACT GGGCTTACAA CCTCTACATC CCCACGCCTG TATCAATATC TGGCTGGCCG GGGTTTAGGG CCGAGGCGTT AGTTAGGCCT ATGTCGTTGG GAACAATTAG GCCGGCGAGG TTCGACCTCT ACTACTATAC AAGCAACCCA GGTACCACCC ACCCGATATG TGCAAACCTA GTGGCTGACC AAGCAGTATC GCCGGTGTTG GTGTGGACGA ATTGGCGTCA AAACAGCGGC GTCTGGAACG GCAGGACTTG GGACGACGCT GGTAGTGTAT ACTTCGACTC TTCGTGGTAC CTCTTCACTA TTTACGGATC GAGCGACTCG ATAATCTACC AGATATACAG CTACAACTCC ACGAGGGCTT TGAAGGCAAT CGCCACGAAG CGCGTGCAGG ATACGCCTTG GGGGGCCACC AGCTGGCAGT TCTACATCGT CTTAGGAAGC GCTATTGTCG ACAACCCTGC GAGCACGACC GCCAGCTGGG TTGAGAGGGC TCGCTACGCA TATGTAAGGC TGAGGCCGTG GGTTGAGCCC CCGCCTGGCG TCATGTTGAC GACGCTCGCC GCGCCGCCAT CGGTGAGGCC GAGCAGAGTA GATATCGTGG CCTCTAGGCT TGCCAATGTC TCGTCAAGAG TTGGGATGAG TGGGCTTATT GGTCTTGCGG GGATTTCGGA CTTAAGGATA GAGAGGTCGG TTTCACTAAA TGCTTCAATC ACCTCGGTTG CCCCGCCGGC GGTTGGGCCG AATCAGAACA CCTACACATA CAGTGTGGAG GTGACCTCAT CAATTCCCAG AGGCTCGCTG GCGGCCTCTT TTACGCTGTT CTACAGCCTC GGGAGCACCT ATGTGAACTC CACGTGCCCC TCAGACTTGT GCTCGGCGAC GCTTATCCGG TACTTGGGGT TTGACGGCGC CCGGGACAGG GCTGTGTACC AAGTAAGCGT CACGGTGCCG GCGTGGGTCA GCCACTCAAT AATCGTAAAT GTATTTGGCA CAAAAGTCGC ACTATCTCCA ACGGCGCCTA GGCTGTATGT AGTAGGCGCG GGCGGGAGGT CGTGGTATAT CATCAATGAG GGAGACGGTA CTGCTATTTT TATATTCCCC TGGAGCGGCG GCGCCTCGCC CGTTTACAGC TACAGCCCCG ACGACCCTAG ATACGTGGGG GTAAACCACG TCACAGACGG GTCAAGAAAG TGGACCGTTG TCCTAGTGCC TCCTGGAACC GTTATGAACA TAACCTTCGC CGCACCAGCC GATGCGAGCT ACTTAAGAGA GGCGCTGGTG CCGTGGCAGA GGGCCTACTG GGCGGGACAG GTTAGCCCGC CGTGTCCGGA TCCCAGATTC GTGAGAGTTT ACATGCCGGG AAACGCCACT TTGAGGAGGT ACTTCGTCAT GATACCCCGG GATTTAGGCC TACAAAGGAA TTCACGCCCC TCGCCGCCTG TTGCACATGC CTTCATAGGC GGGGGCTGGC GGCAGATCCC CACCTATAGA GACGATGCGG GTATGCTGTG GCTTAGATTA GATGTACAGA GCTTCTCGCC GAACCAGCTG ACTGGCAGAG CTGTCCTCGT GGCCTTATGC AGCTCGCCCA ATAGCAACCC CAGCGGAGAG TCGTTTTTTG GTTTCTATTA CCAATCTACG AGATCTGGCC TGGCACTACT GCCGCCGCTT GGCAATTATC CCGACGGCTA CACCGTCGTG GTGTGGCTAA AAAGCAGACG TTCAGTTGCA CTAACTGACG TAGCCACTAC TCCAAGCTTC ACGTGCACCC CCACCAGGAA GATAATTGGA ATGCACTACG TAGGCTCTCC CACTGGGTAT TACTGGAACT ACGACGGGTT CTGCTTCGAT AACCACATAT GGGAGCAGAA AGGCGACGAT ACCTCCGACG CCACCTATGT GTATATGATA TCTGTGAGCC AGTCAACTGT GTTGTACCAA ATAGCTACTG GATTCAACAC GGGGCTGTGG TGGCTGAGGT CGTACCCCAA CAGAGCGCCG GCGATAGGCA ACCAGCCGCT GTTCTACTTC TCGTATGTTT ATGCGGACGA TGCGTTTAGC TTTACCGTCG TGTCGTTTAC GTGGCCACGG CCCTACTTCT ATCTGGACTT TGAAAGAAAA GGTCCCTACG CGACGACATG A
|
Protein sequence | MSPSPRRGPR GQIILLTALV LAFAVAIVMT AQISSSLGQT ASTYQTGYGI YAKAWPEAVD MADAHALQVS LKGASQISTG ALSSGLYSYW PTAGRWLTYN ATYIYLNKSL AAVKESFLPL GAQLDFGAVV RYYGFNGSLG PYPLAAPLTR YNVTLVRQGY LTIPGGALYK LTVYWQAPDY SGYYVVTYGA HVSEPGAVGE TYNVALAEVL QAFSRAGYDL KRRLFFFLVY SDSGGCKVRQ LPWWAEIKKC PLCLLHVRLP NVGEIGMGDP ILRSGAASEI LAVLLPADLA GNQLALWLSG PCSQGGGLTT QQITVNIDAN DNPYAVFAQT VQYDGPTPVS WSNWLFWGAA DIQGWRGATV SAADCYPNPT AGGVRGSGVN LIYDSDFGQV AEVWAEYYAG TRNWAYNLYI PTPVSISGWP GFRAEALVRP MSLGTIRPAR FDLYYYTSNP GTTHPICANL VADQAVSPVL VWTNWRQNSG VWNGRTWDDA GSVYFDSSWY LFTIYGSSDS IIYQIYSYNS TRALKAIATK RVQDTPWGAT SWQFYIVLGS AIVDNPASTT ASWVERARYA YVRLRPWVEP PPGVMLTTLA APPSVRPSRV DIVASRLANV SSRVGMSGLI GLAGISDLRI ERSVSLNASI TSVAPPAVGP NQNTYTYSVE VTSSIPRGSL AASFTLFYSL GSTYVNSTCP SDLCSATLIR YLGFDGARDR AVYQVSVTVP AWVSHSIIVN VFGTKVALSP TAPRLYVVGA GGRSWYIINE GDGTAIFIFP WSGGASPVYS YSPDDPRYVG VNHVTDGSRK WTVVLVPPGT VMNITFAAPA DASYLREALV PWQRAYWAGQ VSPPCPDPRF VRVYMPGNAT LRRYFVMIPR DLGLQRNSRP SPPVAHAFIG GGWRQIPTYR DDAGMLWLRL DVQSFSPNQL TGRAVLVALC SSPNSNPSGE SFFGFYYQST RSGLALLPPL GNYPDGYTVV VWLKSRRSVA LTDVATTPSF TCTPTRKIIG MHYVGSPTGY YWNYDGFCFD NHIWEQKGDD TSDATYVYMI SVSQSTVLYQ IATGFNTGLW WLRSYPNRAP AIGNQPLFYF SYVYADDAFS FTVVSFTWPR PYFYLDFERK GPYATT
|
| |