Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1041 |
Symbol | |
ID | 5055955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 927073 |
End bp | 927993 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468597 |
Product | hypothetical protein |
Protein accession | YP_001153271 |
Protein GI | 145591269 |
COG category | [S] Function unknown |
COG ID | [COG4034] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.966516 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTTTGT ATTTGGCGAT TGGCGGGGGT GGAGATGTGG TCTTCGCCGC TGCTCTAGCC GGGGAGGAGG CGGTGGGGCA GTTGCCCTGG GAGCGCTTCG TGGTGGACCC CGAGCCCGGG CCCGTGCCGC CAAGTGCTCT GCGCGAAGTA GTAGACCTCG GAGGAGGGCT ATACCTCGCT ACGCCGAGGA GCTACGCGGA GAGGGGCGGG AGGATATTCA AGACTCAGGG CATGTGCGTC GCTGAGGTTT TGAAAAAGTC CGTCTACCTG GCAGACCCCT ATAAAAAGCC GAGCGAGCTG TCCAAAACTT TGTCGCGGTT TAACAGAATC ATCGGCGTCG ACGTTGGGGG CGATGTGTTG GGCATTGGAT GTGAGGACTC TCTGAGGAGC CCCCTCTCCG ACGCCTACGG CCTGGCGGTG TTGGCAAAGG CAAAGGAGCT GGGCATCGAG GCCGAGGTCT GGGTCATGTC GCTGGGGGCC GACGGCGAGT TGAGCAGGGA GTATCTCATG AACAGGTTGG CCGAGGCGGC CCGGGCAGGC GCACTGCTTG GGTCTGTGGG TATCGGGAGG GCCCAGATGG AGGTCTTAGA GTCGCTGGTG GGGAGGTGCG TTACGGAGGC GTCGGCGGTG GCTGTGAGGG CATTCCGCGG GGAGCACGGG CCGTCTACGA TGCGCGGCGG GACTAGACAG ATCGTGGTAG ATGCCTGCGC CTTGGTGGGA TTCCGCTTCG ACCCATCGGC GCTTTTGGCC CTTAACAAGG CGGCTAGGCT AATTTACCAG CACGACGCTC CTATCGACGA GGCGGCGGAG ATCCTCTTAG CGCACGGGAT GCCGACAGAG CTCCACTTAG AGCGGCTTTT GGCAAAGGGC ATGACCTTAG GGGAGGCGGT GGAGGAGTTG AAGGGGATGA AGCGTTGCTA G
|
Protein sequence | MSLYLAIGGG GDVVFAAALA GEEAVGQLPW ERFVVDPEPG PVPPSALREV VDLGGGLYLA TPRSYAERGG RIFKTQGMCV AEVLKKSVYL ADPYKKPSEL SKTLSRFNRI IGVDVGGDVL GIGCEDSLRS PLSDAYGLAV LAKAKELGIE AEVWVMSLGA DGELSREYLM NRLAEAARAG ALLGSVGIGR AQMEVLESLV GRCVTEASAV AVRAFRGEHG PSTMRGGTRQ IVVDACALVG FRFDPSALLA LNKAARLIYQ HDAPIDEAAE ILLAHGMPTE LHLERLLAKG MTLGEAVEEL KGMKRC
|
| |