Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1462 |
Symbol | |
ID | 5055496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1318396 |
End bp | 1320465 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640469002 |
Product | hypothetical protein |
Protein accession | YP_001153671 |
Protein GI | 145591669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.947543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCATA TATTGGCAAT AATGGCAATA ACCGCCGTCG CCCTCGCTGT AGTGTACGTC TCAGAGGGCG GAGACGTGGT GTACTACTCC ACGCTGGGGG GATTCGGCGC GTTAAACGGC CTTGGCCAGA GGCTGTGGCA CGTGGACGCC CCCGGCGCTT TAATAGCCAC AGACCCCATG GGCTCTTGCC TAGCCGTCGC GCACCCCCTC GGCAACGCCA CGAGCTGGCT CGGCACCCGC GTGGCGCTGT ACGTCAGAGG CGCGGCGTTG TGGTCAGCCG TCTTGAAGCT AAACGCCTCT GCCATAGCCA CCGACTGCAA CCGAATAGCA GTAGGCACCA TGGACGGCCG GATTGTCGAG TTGCAAAACG GAAGAGTCGC CTCTGAGAAA ACAGTCGGAG TGCCCGTAAT ATCGCTGGTC TACGACGGCG GTGCTCTTAG ATACGGCGTG TGGAGGCCGG GCTACGTGGA GCACCCCCTC CGCTGTGGAT ACACCGTGGC GCTGGCCAAG AGGGATAAGC CGTACGTGGT GGTAGACGGG AGGGAGTACG TCGGCTTCGG CGAGTTGCTC TCGCTAAGCC CGCCCGCCGC CGTCTCGCAA AACTGCGTCT TGACCTTCGC CGCTGAGGGC GCCGTGTACT GGGGCTCCGC CGCCATCCCT GTTAAAGAGC CTGTATACGC CGTTGCCATC TCCGGCGACG GCAACGTCTT GGCGGTGGGC TTCGCCGACA GGGTGGAGCT CTACCGCGGT GGGCAGCTGG CGGCGTCTAT ACAGGCAAAC ATGCCTAGAT CCCTCGCCCT TGACTTCCGG GGCTTTACGC TCGCCGTCCA AGACGACTCC GGGGTGCAAG TATACTCCTT CACCCAGAGA GAGGTGGAGG TGGTTGGTTG TCCCTACGGC GTGATCAAAG CCGGCGTAGC TACTTACAAC GTCACTGGGA GGGCCGTGGT GTACGTGCCC AGGGGGGCGG AGCTAACGCC GTTGCGCATC AACTTCACAG ACGGGGTCTG CGCCCCCGCC GGCTTCGACG GCCGCGTCGT CAGCTACCAG CGGCTTTACA GAGTCGAGGT GACGCCGCCT GCCAAGGGCC CCGAGCTGGC CGCGGGCCCC ACCGCCTACG CCGCCCCCCT CGAAGCCGAG GTGAGGGCTA AAACCGGCGT TTTAAAGGCG TATTTAGCCG GGTGGCTTGT CGGCGGGAGG AGGATGCCGC CGGTTCCAGT GTTAACTGTA GACGTGAGGA ACGCGACGGC TGTAGCCCCG CTCTACAGAC TAGACGTGGC GGCGGAGTTA GTTGAAGGCG GAGTAAAACG GGTGTTAAAA GGCGTGGCGG CTTACGACAG CAAGGGGGCG CCTATGGCGC CGGTGGAGGA CTACGTCTAC CCAGGCCTCC CCGCATATGT AGAGACTTAC TACGACGAGT ACTACCTCCT CCGCGCGGAG GCTTATACAC GTAGCACGTT CAATGCCACA GAGCTCTGGC TTAAGCCGGG CCAGACCGCG GTGATCTACG CCGACGAAGT CGTCGACTTT GGCAACTCCA CTAGACTCGT CTTCACCGGG TGGAGCGACG GGTCTAAAGA GCTTAGACGC GCCGTGGGGC CGGGGACATA CACGGCTAGG TACAAGGTGC AGTATCTCGT GACGTTTAAT GCGCCTAACT ACACCGCGGC CGTCTGGGCA GACGCGGGGG CTAAGCCGCC GGCGCCTAAG CCCCCTGAGA AGCTCTACGA CGACGGCAGT ACTAGGATTT GGTTCAACGG GTGGCAACTG CCGGAGAGAG TAGACGGGCC GCTTAACGTC ACCGCCAACA CGGCGAGGGA GTACCGCGTC GTGTTGAAAT ACCCTTGGGG CCAGGAGGAG AGGTGGCTCC CCCACGGCTA CGCCCTGCTA CCGCCGGACC GCAACCGCTA CAACGTCTTC TGGCGCTTCT CCCACTGGGC CCCGAGCGAC GTAGTCGCGG GGCCGGGCGT TTACGAGGCG GTGTACCAGC TGGACGCCTT TGCAGTTGCG GCCATCGCAT CTGTGGTAAT TATAACCGCC GCAGTTGCGC TTTGGCTAAA GAGGCGGTGA
|
Protein sequence | MRHILAIMAI TAVALAVVYV SEGGDVVYYS TLGGFGALNG LGQRLWHVDA PGALIATDPM GSCLAVAHPL GNATSWLGTR VALYVRGAAL WSAVLKLNAS AIATDCNRIA VGTMDGRIVE LQNGRVASEK TVGVPVISLV YDGGALRYGV WRPGYVEHPL RCGYTVALAK RDKPYVVVDG REYVGFGELL SLSPPAAVSQ NCVLTFAAEG AVYWGSAAIP VKEPVYAVAI SGDGNVLAVG FADRVELYRG GQLAASIQAN MPRSLALDFR GFTLAVQDDS GVQVYSFTQR EVEVVGCPYG VIKAGVATYN VTGRAVVYVP RGAELTPLRI NFTDGVCAPA GFDGRVVSYQ RLYRVEVTPP AKGPELAAGP TAYAAPLEAE VRAKTGVLKA YLAGWLVGGR RMPPVPVLTV DVRNATAVAP LYRLDVAAEL VEGGVKRVLK GVAAYDSKGA PMAPVEDYVY PGLPAYVETY YDEYYLLRAE AYTRSTFNAT ELWLKPGQTA VIYADEVVDF GNSTRLVFTG WSDGSKELRR AVGPGTYTAR YKVQYLVTFN APNYTAAVWA DAGAKPPAPK PPEKLYDDGS TRIWFNGWQL PERVDGPLNV TANTAREYRV VLKYPWGQEE RWLPHGYALL PPDRNRYNVF WRFSHWAPSD VVAGPGVYEA VYQLDAFAVA AIASVVIITA AVALWLKRR
|
| |