Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1690 |
Symbol | |
ID | 5054282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1525553 |
End bp | 1527397 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640469231 |
Product | hypothetical protein |
Protein accession | YP_001153893 |
Protein GI | 145591891 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000220008 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAGGG TTGTCACCGC CTTTGACCTG CTGGCGTCTG TGGCTGAGAT GTCGCGTCTG GCAGGTGGGA AGCTGGAAAA CGTATACAGA ACCGGCGCCG GGTACCTTTT CAAATTCGCC GGGGGCTTCG TGGCTGTCAC CAAGTTCAGG GTTTCCCTGA CCGGCATCGT CCCCGAGAAG ACGCACGAGG GGGCTGAGAC GTTGAGGGGG CTTTTCCGCG ACGAGAGGCT CCTCGCAGTC TCTATGCCCC GCTTCGACCG GATTGCGGAA TTCGTCTTCC CCACCGGAAG GCTGGTGGCC GAGCTCTTGG AGCCGTTTAA CATAGTCGCA GTCCGCGAGG GCAGGGTTGT CTGGCTCATG CACAGCTACA AGGGCAAGGA TAGGGCCGTC GTACCAGGAG CGGCCTACGC CTACCCGCCT GCCGTCTTCG TGGACGCCTT GTCAGCCGAC GTGGAGGAGT TAGCCAAGGC CATAGACCCC AGCGACCTTA GGCGTAGCCT AATTAGGAGG CTGGGCACCG GCCCGGAGCT CGCAGACGAG CTGATAGCCC GGGCGGGGGA GTCTCCCCGG GACATCGCGG CGGAGTTTAA GACGCTCATC GAGAGGGTGC GGGCCGGGGC TCTGGAGCCG ACGGTCTGCA TCAAAGACGG CGTCCCCGTA ACTGTCATGC CGATTAGGCC CGTCTCCCTC AACTGCGACG AGTACAAAAG CTTCGACTCC TTCTGGTCAG CCCTGGACTT CTACTTCTCC CCCATGGAGC TGGAGGCAAC GGCGGCTCAG GCAACGCAGG GCATAGCCCA GAGGCGTAAG AGGCTGGAGG CCTCAATCAA GGAACTGGAG GAGAAAATTC CTGAATACAG GAGCGAGGCG GCCAAGCTCA AGGCGGTTGC CCACAAGCTC CTTGTGTATA AGGTAGAGAT AGAAGAGGCG CTTGCCGGCA GGGAGTCCAG TATACGTGTA GTAAACGTAG ACGCCTCCAA GATTAGGATA GAGTTGCCAG AGGGAGGGGG CGTAGAGCTC AAGAAGGGCC TACCCCTAGG CCGCCAGATC ACTGAGCTTT TCGAAAAGGC GAAGGAGCTG GAAGAGAAGG CGCGGAAGGC GGAGCAGGTG TTGGAGAAGC TCAGGAAGGA GCTCTCTGCC CTCGAAGAGC AACAACGCCG AGCGGAGGAG GCGCTGAAGG CGTCGGCCAA GGTGGTGGCT AAGAGGAGCT GGTTTGAGAA ATTCCACTGG ACGGTCACTA CTGGGAGGAG GCCGGTGATA GGCGGCAGAG ACGCGTCGCA GAACGAGGCG GTGGTTAGGA AGTACCTGAA AGACCACTAC TTCTTCTTCC ACGCCGACAT ACCCGGCGCC TCCGCCGTGG CGGCCCCACC CATGGATGAT CCGCTTGAGA TCTTGCAAGT GGCCCAGTTC GCCGCGGCGT ACAGCAGGGC GTGGAAAATC GGCATCCACG CCGTCGACGT ATACTACGTA AGGGGGGAGC AAGTGTCGAA GCAACCCCCC TCCGGCCAAT ACCTGGCCAA AGGATCCTTC ATGGTGTACG GTAAGAGGGA GTACGTAAGG CACATCCGCC TAGAGCTGGC GGTGGGCTGT AGAAGAGACG GCGACATCTA CAGAGCCGTG GCGGCCCCAC CGAAGTCGGC CCCCCTACTC GCCGAGAGAT ACGTGGTGGT GACCCCCGGC AATAAGGAAA AGGGGAAGCT GGCCAAAGAG CTGGCCGAGA AGTGGGGCGG TTGCCCCGTA GACGAGATAG CCGCCGCTCT TCCCGGGCCA TCCCGAATTT CGGAAGAGGG GCGCGGCGCG CCGATACCGT GGGACGAGGT GGAACAAATA TTTGCTACGT GGTGA
|
Protein sequence | MKRVVTAFDL LASVAEMSRL AGGKLENVYR TGAGYLFKFA GGFVAVTKFR VSLTGIVPEK THEGAETLRG LFRDERLLAV SMPRFDRIAE FVFPTGRLVA ELLEPFNIVA VREGRVVWLM HSYKGKDRAV VPGAAYAYPP AVFVDALSAD VEELAKAIDP SDLRRSLIRR LGTGPELADE LIARAGESPR DIAAEFKTLI ERVRAGALEP TVCIKDGVPV TVMPIRPVSL NCDEYKSFDS FWSALDFYFS PMELEATAAQ ATQGIAQRRK RLEASIKELE EKIPEYRSEA AKLKAVAHKL LVYKVEIEEA LAGRESSIRV VNVDASKIRI ELPEGGGVEL KKGLPLGRQI TELFEKAKEL EEKARKAEQV LEKLRKELSA LEEQQRRAEE ALKASAKVVA KRSWFEKFHW TVTTGRRPVI GGRDASQNEA VVRKYLKDHY FFFHADIPGA SAVAAPPMDD PLEILQVAQF AAAYSRAWKI GIHAVDVYYV RGEQVSKQPP SGQYLAKGSF MVYGKREYVR HIRLELAVGC RRDGDIYRAV AAPPKSAPLL AERYVVVTPG NKEKGKLAKE LAEKWGGCPV DEIAAALPGP SRISEEGRGA PIPWDEVEQI FATW
|
| |