Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2307 |
Symbol | |
ID | 5056098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2063132 |
End bp | 2064742 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469859 |
Product | hypothetical protein |
Protein accession | YP_001154503 |
Protein GI | 145592501 |
COG category | [S] Function unknown |
COG ID | [COG3356] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00335528 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGTCTT TTGAGAAGGG GTATAGCATT CTCTTTGGGC GTTCGCCGAG GAGGGTTGCG CTTTACGCAA CGGCCCTCTT AGCCTTCTTA GCGGCTTTGA AGGCGCTGTC CGCACAGCGG GCGCCACTTC TTTACGCCTT GTTCGGCGGC GTGATTCTCC TCATACTGCT CTCAGCGGAT CGCGCCGTGA TTAACCCGCG CAGATCTTAC TACGTCGCGG TTATATCGAC GCTGGTGGTC TCCTTCTTCG ATTTATTATT TCAAAAAGCT CCGCTGACCT TTGCCCTAGT CGGCGCCGTG ATAACCGCGG TGGTCCTGCA GTCGCTTAAA TGCAGGAGCT TTTGGTACAT CGCTCCACTC GTCGCGGTCT CAGCTATTTA CTACGCGGTG GGGGAGCTGT ACCTTTTCGC CATTTCTCTC CTTTACATAT CGGTGCTCCA GCTGACTAGG TTTGTTATAA ACAAGATGGT GAGGGGTCTC GACGCCATGT GCATGTTTTC GAGCTTTATC TACTCCGTCT TTGCAGAAGA TGACGTTTTG GAAGACGCCT TCAGGGAGTT GGGCAGGTTG GAAAGGGTGC CTCTCCACGT CTTTATCATC GGCGGGAGGC ACGTCGTCGT TGTGTCGGAC TTCCACCCAG GGCCGTTTAG GCACATCGGC GGCGGTATGC TGGTAGATGA GTTGCAGAAA GCGGTTGAGG GTATGGGGTA CAGCTTCACC TTTCTCCACG GCGTTGGTAG CCACGAGCGC GACCCCGTGG ACGGGGAATC CCTCAGGAGA ATAGTAAACG CGGTCAAGAC TGTCTTGGCC TACGGGCGAA ACGGAGCCCC GCCCAGGGGG ATCTATCCGC AGAGCCACAT TGTTGGGGAC GTAAAGGTAG TGGGCCTCAG CCTCGGCGCA CCGCCGTACC TAGCAGTGGT GAGCAGGGTG AACTCCGCCT CGGACGACAT CCCCACCTGG GTTAGCCGGC TTGTGGACAC CGGCGCGTAT ATACTAATCG ACGCACAGAA CAAATTCGAC GGCGCGGTGC AGTGGCGCGA GGTGGACGTG GCGTCGCTCT CCAAGGGGCT GAAAGCCCTC CAGGAGGCCC CGCAGTGCCG CGTCTTCAAA ATCGGCGTGG GCAAAGTAAG TGCGCACCAC CTCGATGTCC TGGGCTACGA GATTGGGCCG GCGGGGATAT CGGCAATAGT AGGCGAGTGC GACGGGGCGA GGAGCTTGCT GGTAGTTTTT GACGGGAACA ACCTACACAG CGAGTTGTAC AACAAGATCG TAGACACGTT CGAGAGCCGT GGCTACAAGC TGGTTGAAGT AGTAACCACC GACACTCACA GGGCCACGGG AATTGGCATC GGCAAGGGAT ACCGCATAGT GGGCGAGCGC ATAGACCATG GACAGATCTT AAAGGCCGTA GAAGAGGCTG TGTCCATCGC CGAGAGATCG CTCGGCGACC ACAACGTAGA CTACAAGAGG GTAGAGGTTG AGGCGTACGT CTTGGGTGAG GAGGGCTTTA GGAAGATCCA AGACGCCGTG AGGATGTACA AGAAAGTCGG GGTGTTGATC GCGGCGGTTG TATTCGCCCT GCCAATTCTC CTAATTTCGC TTTTAGCATA A
|
Protein sequence | MRSFEKGYSI LFGRSPRRVA LYATALLAFL AALKALSAQR APLLYALFGG VILLILLSAD RAVINPRRSY YVAVISTLVV SFFDLLFQKA PLTFALVGAV ITAVVLQSLK CRSFWYIAPL VAVSAIYYAV GELYLFAISL LYISVLQLTR FVINKMVRGL DAMCMFSSFI YSVFAEDDVL EDAFRELGRL ERVPLHVFII GGRHVVVVSD FHPGPFRHIG GGMLVDELQK AVEGMGYSFT FLHGVGSHER DPVDGESLRR IVNAVKTVLA YGRNGAPPRG IYPQSHIVGD VKVVGLSLGA PPYLAVVSRV NSASDDIPTW VSRLVDTGAY ILIDAQNKFD GAVQWREVDV ASLSKGLKAL QEAPQCRVFK IGVGKVSAHH LDVLGYEIGP AGISAIVGEC DGARSLLVVF DGNNLHSELY NKIVDTFESR GYKLVEVVTT DTHRATGIGI GKGYRIVGER IDHGQILKAV EEAVSIAERS LGDHNVDYKR VEVEAYVLGE EGFRKIQDAV RMYKKVGVLI AAVVFALPIL LISLLA
|
| |