Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2204 |
Symbol | |
ID | 5055971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1975543 |
End bp | 1976553 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640469756 |
Product | hypothetical protein |
Protein accession | YP_001154402 |
Protein GI | 145592400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.692047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGAGGA TCAAGTTGCG GCTGGCGCCC GGGCTCGAGG TCCTATTCAC CGATAGGGAG CTGGCCCTAA ACAAGGTAGA GGAGTGGGCA GCAAGCGGCA CTTTCCCGGT TCAAGTAGTC TTCGGGCCTG AGGGCTGTGG GAAGACGGCT TGGTTGAAAC AGTCTGCGGT GTTGTTGAGG AGGCTGGGCT TCGACGTGGT GTACCTCAAC CCATTGGAGA AGGAGTTATA TGCCGAGCTC GATTTGCCCG ACGTGAAGAG GCGCATAGCG GAGATTCTAA GAGAAGCCAC CGACGAGGCT TGGGCCAGAG CCGTCTGGGC CGCCGTGGAC CTGGCCAAGG AGCTCATCCG GGCGGGGAGG CGCAAGGTAG CCGTCCTGAC CGACGACGTG TTCCAAGCAA TCGGCTTCAA CAAAGCCGCG GTATATGTCA AAGGCCTCCT CGGCCTCATA GAGTACCCGC CGAGAAGCGT AGACGTCATA GTAGCCGTAG TGGCGACGAG TGAAGGCGTT ACGAGGAGAG AGATCGGGCG GCATGACTGG GCCGACCTAC TACCCATGTG GAACATGCCG CGCGACGGCT TTAAACAGCT CTACGACCAG ATACCCGGCG AAAAGCCCCC CTTTGATGAG GTGTGGAGGA CGACTGGCGG AAACCCTCGG TTGTTACAGG TGCTCTACAA GGCTGGGTGG GACGTGAAGG CGGTGTTACA GAGGGTCGTG GATAGACGAA AGATAGAAAC TCTAGTCAAC GGCCTAGGCT ACGGCGAAAG GGAGCTCTTG AGGAGCGCCG TCGAGGACCC CGATTCGCTG CTCACTAGAG AAGGCATACC CCTAATGGAC AAGCTGGTGG ATCTAAACCT AATTGTCGAC GCGATACCTC CGCGAGACCC CCACTTCTGG GCCGGGGAGC CACCGCCCGA GAGGGATCCC GAGTTAGGGA TCGGCAGACG GGTCGCCTGG CAGACGCCGC TGCACAGAGA GGCCGTGAGG AGCGCCCTGG AAAGCGCATA A
|
Protein sequence | MERIKLRLAP GLEVLFTDRE LALNKVEEWA ASGTFPVQVV FGPEGCGKTA WLKQSAVLLR RLGFDVVYLN PLEKELYAEL DLPDVKRRIA EILREATDEA WARAVWAAVD LAKELIRAGR RKVAVLTDDV FQAIGFNKAA VYVKGLLGLI EYPPRSVDVI VAVVATSEGV TRREIGRHDW ADLLPMWNMP RDGFKQLYDQ IPGEKPPFDE VWRTTGGNPR LLQVLYKAGW DVKAVLQRVV DRRKIETLVN GLGYGERELL RSAVEDPDSL LTREGIPLMD KLVDLNLIVD AIPPRDPHFW AGEPPPERDP ELGIGRRVAW QTPLHREAVR SALESA
|
| |