Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0904 |
Symbol | |
ID | 5054634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 800795 |
End bp | 802231 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640468461 |
Product | hypothetical protein |
Protein accession | YP_001153137 |
Protein GI | 145591135 |
COG category | [S] Function unknown |
COG ID | [COG2855] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAACAT ATGCCATGGC TCAGCAACGA AAGATTGATT GGAGTTCTTT GTGGAAGAAA GAAGATTGGT GGGCCCTGTG GCTTGGGCTT TTTGTTTTCT TACTGGCGTG GTTCCTCCTA TTGGGCTGGG TGCCTAAGAC CAGCGTGTGG ATTGACCCAT CAAAAAGTAT ATCAACAGCG AGCAAGGAAT TTGCATACCT CGGCGGCTGG AGCCTCATAT TGCTCTACTT CTTCACCCTA GTGGTGTTGT CCATAGCGGC TGCGCTCATG AAATACGACG TGAAGGCGTT TGCAGTGGGC TATACCGTTA TTTTCTGGCT TTCCTACCTC ATGTGGTGGT TTAGCAACTA TGCTTACATC GCCGCCACTC CTGACGTATG GCCTAGATAT GGAATTAACT GGAGCCTCAG CCTAACTGGC GAGGCTGGCT GCATATTTGC ATTAGTTCTC GGCCTTATAA TAGGCAACAC CGTGAGGAAG CTACCCAAGC CGCTTGAAGT AGCAGCCAGG CCTGAGTGGT ATATCAAGAC TGCCATAGTC CTACTAGGCG CAGTGGTTGG CGCAAAGGCG CTTCAAAATA TGACTGTCGC CGCTGAGGTT TTAACTAGAA GCCTCATAGC AATTGTAGCT GCGTATTTAA TTTACTGGCC AATTTCATAC TTAATATCAA GAAAAATTGG CTTAGATAAG CAGTGGGCTG CGACGCTGGC TTCCGGAGTC AGCATCTGCG GAGTCTCTGC GGCCATAGCC ACTGCGGCGG CAATTGGGGC CCCCGCCGTG ATTCCAGGCA CAATTGCATC TATCATAGTC ATATTTGCAG TAATCGAGTT GATAATTCTC CCCTGGGTGG CGGCTCAGAT TTTGACATGG GCCCCTTTGG CCGCGGGGGC GTGGATGGGT CTTGCCGTTA AGACGGACGG AGCTGCGGCA GCCTCAGGCG CTGTTACAGA TGCATTAATA AAAGTTAAAG TGCCTGAAGC CGCAGGATGG GTCACTGCAA CGGCGGTGAC TGTTAAAGTA TTTATAGACA TTTGGATTGG ACTGTGGGCA TTTGTACTAG CCCTGTGGTG GGTTACGAGA GTAGAGAGAA AGCCAGGAGA GAAGGTTCAA GCCGTAGTCA TTTGGTATAG ATTTCCAAAA TTTGTAATTG GCTATTTCGT TACAATGTTT GCAATATTAG CCCTGGCCTC TTTCATACCT ATTAAAGACG CCATTTCTCT TGCCAGTGCC GTAACGGGAC AGTCGGACGT GTTAAGACAG TTCTTCTTCC TCATAACTTT CACCTCCATA GGGCTAACTA CTAACTTTAG GAAGTTTAAA GAAATCGGCG CCGGCAAGGC CGTGGTGGCT TACTTTATAT CCTTGCTGGT TATAATATTC ATAGCCTTAG GTCTCGCCGT GGCTTTCTTT GCAGGATTGC CTCTGCCCAA GTCCTAA
|
Protein sequence | MLTYAMAQQR KIDWSSLWKK EDWWALWLGL FVFLLAWFLL LGWVPKTSVW IDPSKSISTA SKEFAYLGGW SLILLYFFTL VVLSIAAALM KYDVKAFAVG YTVIFWLSYL MWWFSNYAYI AATPDVWPRY GINWSLSLTG EAGCIFALVL GLIIGNTVRK LPKPLEVAAR PEWYIKTAIV LLGAVVGAKA LQNMTVAAEV LTRSLIAIVA AYLIYWPISY LISRKIGLDK QWAATLASGV SICGVSAAIA TAAAIGAPAV IPGTIASIIV IFAVIELIIL PWVAAQILTW APLAAGAWMG LAVKTDGAAA ASGAVTDALI KVKVPEAAGW VTATAVTVKV FIDIWIGLWA FVLALWWVTR VERKPGEKVQ AVVIWYRFPK FVIGYFVTMF AILALASFIP IKDAISLASA VTGQSDVLRQ FFFLITFTSI GLTTNFRKFK EIGAGKAVVA YFISLLVIIF IALGLAVAFF AGLPLPKS
|
| |