Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0980 |
Symbol | |
ID | 5054440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 870144 |
End bp | 871439 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640468536 |
Product | hypothetical protein |
Protein accession | YP_001153212 |
Protein GI | 145591210 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000000197684 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCTCCC TCCTAAACGT AGACTACAAC GACGAGCTCA CAAGGCTGAG GGTACACGAA ATTGTAAAGT ACTTCCAGAG ACTTGGAGTA CCCATAAAAC TTTCGAAAAT ATCCAACGAC ATAATTGCAG ACTCGTTCTA CGTCCACTAC AGGACGCCCA TTCTCAAAGA CGGCCCAGAC AAACAAGAAG AGTCTCTGTG GCACGTATTT ATCAAGACCT ACACATCGTC GGACGTATAC CACGAAATAT CTAAGATCAG CCGCTATAAC TACCAAGTAT CCAAATCAGC ATCTGTCAAG TTGCTGAGAG CTTACAACAG CTTACTGTCT AGGATCGAAA GAGGCGCCGT AGAGGGCTTC GAGGACCAGA AACAAGATTT TAGAGACCTC AGCGAGAACC AGCAACTGCG CAACGAGATA AGCAACCTCC TCCGCTTCTA CATGGGCAAT GTGCGGAACA TAGAAAAGCT GAGGAAATCC ATGACGAAGG CACTTGGAAA CGAGGTGGGC AAGGAGACTG CAGAGCTTCT GTTCGACATA GACATAGACC CCTACAGGGC GAGACTGGCG AAAATTTTAG AATCCCTAGT AGAGATGCTC TCCGCAGTCA AAGAAGAGGT CGACCAGGGA GATGTACAAG AACGGCGTGG GGTCATCTCC GGCGTGACGC GGATAAGGAC ATACAGCGAT CTACAAAAAG CTACGAACCT AAGCAAGGCA ATATACCTCC AGTCCAGGGA GCTCTTCGGC TACAAGCTCG CCACAAAGTC GCTTTCCATT TACGACTTAG CTCTCGACAC AAGGGACAGG GTATACCTAC TGGTTGACAA GTCCGGGTCC ATGTTCTACA GCCTATACGA CGGCGTAGCC ATGGACATGA CGCAGAAAAT AACCTGGGCA ACCGCCTTGG CTATTGCCGT GATGAAGAAG AGCAAGAGGA CCGTATTGAG GTTTTTCGAC CAGATGGTCT ACCCCCCCAT AACCAACGTG AAAGACATCA TCCGGTCGCT ACTCCGCGTC CTACCCCTAG GCGGCACAGA CATCACAGCC GCCGTCCACA CAGCCGTCAG AGACGCAAAA CAACAAAGCC TACACAACTA CAAGCTGGTA ATAATTACAG ACGGCGAAGA CGACATGATC CACCCAGAAG TTCTAAAAAT GGCAAAGACA GCCTTCAGAG AAGTAAAGGC AGTGTTAGTG GGCGGCACCA ATTCCGTGAT TGAGTCGTAT CTTCCAACAA TAAAAGTAAA TACAGCTAGT CCAGAATCGC TGAAAACAGT CCTAAAGAAT ATTTAA
|
Protein sequence | MGSLLNVDYN DELTRLRVHE IVKYFQRLGV PIKLSKISND IIADSFYVHY RTPILKDGPD KQEESLWHVF IKTYTSSDVY HEISKISRYN YQVSKSASVK LLRAYNSLLS RIERGAVEGF EDQKQDFRDL SENQQLRNEI SNLLRFYMGN VRNIEKLRKS MTKALGNEVG KETAELLFDI DIDPYRARLA KILESLVEML SAVKEEVDQG DVQERRGVIS GVTRIRTYSD LQKATNLSKA IYLQSRELFG YKLATKSLSI YDLALDTRDR VYLLVDKSGS MFYSLYDGVA MDMTQKITWA TALAIAVMKK SKRTVLRFFD QMVYPPITNV KDIIRSLLRV LPLGGTDITA AVHTAVRDAK QQSLHNYKLV IITDGEDDMI HPEVLKMAKT AFREVKAVLV GGTNSVIESY LPTIKVNTAS PESLKTVLKN I
|
| |