Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0030 |
Symbol | |
ID | 5054620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 22809 |
End bp | 24329 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640467610 |
Product | hypothetical protein |
Protein accession | YP_001152299 |
Protein GI | 145590297 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAGGAG CCATAGCTAA ATGGCAACTG CGGGGAATTG AGAAGACCGA CCTCGATCCG CTGAAGTTTC GCAATACAGC AAGGCTCTTC ACCACGTTGG GTCTAGGCGT TGTAGCCGCA GGAGCCGTGT TGCTGGCGTT AGGCTTCCCG CTTGGCCCTC TCATCCTGGG CACCGGCGCT ATGGTGTTTA TGATGCCGAA GTTGATTGTG TCTGTACGAT CTGGATCGAT ACGCGAGATG CTTTCCACAG AGATGGTGTT CTTCACAGCG TTGGTGCAAA TGGGCTTTGC CACAAAGGCG CACCTCAACC TCCTTATTGA GCGCATGACG CGGTACAAGG AGCTCCCTGG CGTTAGGACT TTGGCTATCG CCGCGTGGAA CAACGCCAGG ATGATAGGGA TGGAGCCTCT AGACTCGATG AAGAGAGCCG TGGAGAAGCT AGCCCCCCAA AAACTGGTGT ACCGGTTCAA CTCTCTCTAC ACGTCGCTGA GGATAGGCGA GGACATCGTA GCCAAGCTTT CCCTCTACAT GGATATGGAC ATCGTGGAGT TCTCAACCGC GATGCAGAGG CGTATGGATA GGCTCACCTC CTTAATATCC TCTCTCGTCG TCGGCCTTGC CATGTTGTCG GTGACAGCCG TGGTTATAGG CAGAGGAAAC CCGGCAATGC TTATACTAAT GGGCGGCCTC CTCCCCGCCG TCCTCGGCGT CGTGATGGCG CTGTTTATTA ACGTACCTCT CATGAAAATG GACTTCCGCC TGCTCCCCCT CATCTTTGGC ATAGTCTCAT CTGTGGTGAT GATTGTCATA GGCCTCACGC CAATTGCCCA GTACGCCCCC TTGCTCGCAT TCGCCGTTGC GCTAGCTGGA TGGGTAATTA CTCGCGAAAA GATCGAAGCT AGAGCCTTTG AAAGGGGGTT TATGGACTTC GTCTATACGG TATTCGACGA GTTGAAGAGG GCGCCCTCTG TCTACCGCGC CGTGGAGAAC GCAATAACCT TTGGCGACTA TGGCCCCTTT AACAAGAAGG CAGCTGCCAT ACTCCAGACC ATGAAGGTGG GGGATCACAA GCTTGAAGAC GTTGTGTTAA AAGATATGCA ACCCGTAATG TCCGTAGTTC TCAGGATGCT GTTTGATATC TACCGCCTCG GCACACTGCC ACGCGCAACA ATTGACCAGC TCCAGAACTT TGTGGTGAAG CTCTTCGAGT ACAGAAACGA AATAGGAAAA ACACTTAACA TAACCAGGTT TTTAGCACTG GCCGGCGCGG CTATGGTGGC CTTTGTAAAC ACCTCGATGA TAAAGCTGAC CGAGGCTATG TCCAAGATCT CGGGGGGCGC AACGCTGGGC GGCGTCGGGA TGAGTATGTT ATACTTCGCG CTTGGGATTA TGGCCATCGG CTACTACTTC CTATTCTCTA AGATAAGCTT CTCAACCAGA GGCGGCTTGT TGTATCTGGC GATGTTATTC CTAGCTATCT TCGTGGCTTC TTTCGCCGTA GGGGCGTTCT TAAGAGGGTG A
|
Protein sequence | MLGAIAKWQL RGIEKTDLDP LKFRNTARLF TTLGLGVVAA GAVLLALGFP LGPLILGTGA MVFMMPKLIV SVRSGSIREM LSTEMVFFTA LVQMGFATKA HLNLLIERMT RYKELPGVRT LAIAAWNNAR MIGMEPLDSM KRAVEKLAPQ KLVYRFNSLY TSLRIGEDIV AKLSLYMDMD IVEFSTAMQR RMDRLTSLIS SLVVGLAMLS VTAVVIGRGN PAMLILMGGL LPAVLGVVMA LFINVPLMKM DFRLLPLIFG IVSSVVMIVI GLTPIAQYAP LLAFAVALAG WVITREKIEA RAFERGFMDF VYTVFDELKR APSVYRAVEN AITFGDYGPF NKKAAAILQT MKVGDHKLED VVLKDMQPVM SVVLRMLFDI YRLGTLPRAT IDQLQNFVVK LFEYRNEIGK TLNITRFLAL AGAAMVAFVN TSMIKLTEAM SKISGGATLG GVGMSMLYFA LGIMAIGYYF LFSKISFSTR GGLLYLAMLF LAIFVASFAV GAFLRG
|
| |