Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1111 |
Symbol | |
ID | 5055485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 997811 |
End bp | 998860 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640468667 |
Product | hypothetical protein |
Protein accession | YP_001153341 |
Protein GI | 145591339 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000841175 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGGGGGT TGGGGGGCCG GCGGGGTTTG TGGGGGTTTG TTTTTATTTG GGGTGTTGTG TGTTTTGTGG ATGAGTTGTT TGTGTTGGCT GGGTTGCTTG GGGTGTTGGG TGAGAGTGCG GGGGAGCCGT CTGTGGAGCT TGTGGGGGCG CGGGGGGAGG GGGGTGTGTC TTCTTGTGGG TACTACGTGG TGAGGCGTGG GTTTGGGGTG CCGCCGGGGG ATGTGCATGG GTTGGATAGC CACACGTCGG TAGTGGAGTT TGAGGGGGTG TCTGTTATTG TGGCTACTGG GGCGTTGGTG GGGCGGGTTT TGGCCACGGT ACCGGGGGTG GCGGCGAGGT GGCTGGGGGT TAGGCTGAAT TTTAGGGGTG GGGTTGAGTT GCTGGAGGGT TCTGGGCTGT ATTACAGGTC GGAGGTGCTT GGGGTGCCTT TCGATGCTGG GTTTGACTTG GAGGCGGCGA GGGATGAGGT GAGGTACCAC GTGGAGGGGG TGTTGGCGGG GGTGTGGGAC GGGGGTGGGG TGTTGCTGGT GGATGGGCCG GTGTTTAGGG TTCCCGACGT GTACCAGAGT GGTGGGGGGT TTTTCCGGCT GTATCTGGAG TTGGCTAGGG CGAGGGCGGC GTTGCTGAGG GGGGCGGTGG GGGTTGTGAA GAGGGTGGAG CGGTCTCGGT ACTTGGCGAG GTGTGCCGGG GTTGGGTCGG ACGACGAGGT GGCGGCGCGG CGGCTTTTGA ACAACTCGCC GGGGTACGTG GGGCCGGTGG TGGTGGAGTG GGAGGGGCTT CGTAAGTACT TGTTCTACGT GGCTGTGCCT GCGCCGAGGG GGGTTCGGGT GTTTCGGGTG GAGGCGCTTG AGGAGGGGCT TGCGGAGGAG GCGGCGTCGT GGCTGGGGTC GCTTTCAGAC GCCTCTGGGT TGCCTCTGCC GCTGGCTGTG GCGGATAGGG TGGCGCGGCG GCTGAACGCG GCGGCTGTGA AGCTCCTCTA CGCCGCTTCG CCGGTGGAGC CGACGTACCG GGGGCTTGAG GTGGTGCAGG CGGCGCTGGG GGAGCTGTGA
|
Protein sequence | MWGLGGRRGL WGFVFIWGVV CFVDELFVLA GLLGVLGESA GEPSVELVGA RGEGGVSSCG YYVVRRGFGV PPGDVHGLDS HTSVVEFEGV SVIVATGALV GRVLATVPGV AARWLGVRLN FRGGVELLEG SGLYYRSEVL GVPFDAGFDL EAARDEVRYH VEGVLAGVWD GGGVLLVDGP VFRVPDVYQS GGGFFRLYLE LARARAALLR GAVGVVKRVE RSRYLARCAG VGSDDEVAAR RLLNNSPGYV GPVVVEWEGL RKYLFYVAVP APRGVRVFRV EALEEGLAEE AASWLGSLSD ASGLPLPLAV ADRVARRLNA AAVKLLYAAS PVEPTYRGLE VVQAALGEL
|
| |