Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0071 |
Symbol | |
ID | 5055762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 59049 |
End bp | 61013 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640467649 |
Product | hypothetical protein |
Protein accession | YP_001152338 |
Protein GI | 145590336 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.207619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTTGG ACACTTATTT AAAAAGGTAT CAGGTAAACA CCGTGGAAAA ACTCGCACCG ATTCTCCTAA TAGCGCTGGG GATATCGCTA ATCGCAGTAT ACTTCGCAAT TCCGCCCAAA GCCCCGTTCA CCACCACGGC GACGGCGGCC ACGAACACAG TCTTCAAAAC CACATCCACA AGCCCCTCCA TAACGCAGAC CATAGAAACA TACACCAGCA CTGCCCAGAC AACGCAACCA CCGCCGCAAC CGCCCTCCAC ACCGCCGGTC GCTAGGCCGC CGGTATTACT GCCATTGTTC AGGGTGGACG TTGTGGCACC TGACGTAGTA AACACAACGC GGATGCCAAT ACAGATAAAC TACACAGTAG TTGTCAGAAA CGTAGGCAAC GGCACAGGCT CTGTTTTAGT TGGCGGCAAA CACTACGTCA TAGACCCCGG CAAAGAGGTT AAAGTAAACG CGACGGAGAC AATCACATTT CCAGGTACCT ACACCTTAGA AGTAGAAGTT AACGGCACGC CGTATTCAAA GACGGTCAAG GTATTTTACT ACACGCCGGT TCTAGAAGCG GAACCTGTAA AAGTAAACGT AACCACCCTC CCCACAAACA TAACCGTGGC TGTGTTGGTT AGGAACAGGG GAAATCTAAC CGCAGTAGTT GAAGGCGTGG AGATAAGACC GGGAGAGGCA AGGACAATAA ACAAGACCAT AACAGTAACC GCGGCCGGTT ACTACTTCAT CAACGTAAGC GGCGTCAACG CCCCTATCGC CGTTAGTTAC TACACTCCAA AACTAGAGTG GAAAATAGGA GGCCCAGAAG AGGTGGAAGC AGTGCCAGGA GAGAGCTCCT CGGCCTGGCT GTGGCTAAAG AATGTCGGCA ACGTCACAGC TAAATTCGTC GTAGACGGCA GACAAATTGT GTTACCGCCT GGTAGCGCAG TAAACCTAAC AAAGTCAGTC ACCGTGTCCA CTGCTGGATA CTATACAGTA GAGTTCGTAG TCAGAGGCCA ACTCAACGCG ACTATGAAAC ATGCAGTAAA AGTAAAGATA ATTGCAACAC GCGTAGAGCT CATAACGTGG TCTCCTGAGC TTAGAAGGAG GTGGCCACAG CCAGGCTCAA CTGAGTCTAT TACGCTGAGC GTGCCTAACA AAACAGTGGC CATGACGTGG GGGTACATCA TCTCGACAAA TGCAACTAGG AGAAGCACGA CCATAGTAGT CGAAGACCCA GACGGTGTAC AGCAATACCA ACTCACGCCA GGCGCCGCCC TTTCAAAAAA CTTTACAATA GTCATGGAAG CGCCGGGCGA GAGAACAGTG GCTATCAAGG TTAATTCGAC TACCTACGGC CTAGTTGTAT CTCTAAAACT CACCCCACCA AAAGTGACAG TAAGAGATAT TACAAAAATA GATTTCTCTG ACAGTAGACC ACTACTTGCC ATCAAAATTA GTTGCAGATA TGCAGACATA TCTTTTGATA TCTTAGAAGT ATCAGGGACG CTCTTATTCA CGCAAACCGG CCGGTCTATT TCCGGCACGA TAATAGTTAG ATCAGCAAGA GGCGTCGATA CTGGTAGCTA TTCGGGCCAG GCAGAGGGGG GGAGGGGATT TCTGAATCTT AACTTGCTTG GGAGGAACGT ACATGTAGAA TTCTCATTGC AACCGGTCAT CATAACGAGA GTGGAGGTCG ACGGGACGCC GTACGACTGC AAAGTCCCGC TAGAGCTGAT ACCCACGATC CTCTATGGCG ACAAGCCCAC CGCCTCTGAC GAGCCGGCAG ATCGATACGC AATGAGATTA ATATCGGCAT TTGCGAGGGG AGACAACGGA GCACCGCAGT GGGCGGTATG GAACGGGGAG TACGTAGAAG TTAGGGACAG AGAGGGACAT GTGATGAAGG TGTATTTCGA AAAGGGCACT GTTAGAATAG AAGGCCCCCT CCAGGCTTAT ATCGTTATAT CCTGA
|
Protein sequence | MSLDTYLKRY QVNTVEKLAP ILLIALGISL IAVYFAIPPK APFTTTATAA TNTVFKTTST SPSITQTIET YTSTAQTTQP PPQPPSTPPV ARPPVLLPLF RVDVVAPDVV NTTRMPIQIN YTVVVRNVGN GTGSVLVGGK HYVIDPGKEV KVNATETITF PGTYTLEVEV NGTPYSKTVK VFYYTPVLEA EPVKVNVTTL PTNITVAVLV RNRGNLTAVV EGVEIRPGEA RTINKTITVT AAGYYFINVS GVNAPIAVSY YTPKLEWKIG GPEEVEAVPG ESSSAWLWLK NVGNVTAKFV VDGRQIVLPP GSAVNLTKSV TVSTAGYYTV EFVVRGQLNA TMKHAVKVKI IATRVELITW SPELRRRWPQ PGSTESITLS VPNKTVAMTW GYIISTNATR RSTTIVVEDP DGVQQYQLTP GAALSKNFTI VMEAPGERTV AIKVNSTTYG LVVSLKLTPP KVTVRDITKI DFSDSRPLLA IKISCRYADI SFDILEVSGT LLFTQTGRSI SGTIIVRSAR GVDTGSYSGQ AEGGRGFLNL NLLGRNVHVE FSLQPVIITR VEVDGTPYDC KVPLELIPTI LYGDKPTASD EPADRYAMRL ISAFARGDNG APQWAVWNGE YVEVRDREGH VMKVYFEKGT VRIEGPLQAY IVIS
|
| |