Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1059 |
Symbol | |
ID | 5056257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 942836 |
End bp | 944041 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468615 |
Product | hypothetical protein |
Protein accession | YP_001153289 |
Protein GI | 145591287 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACCTG AAATTCTGCG GCTGGCGTGG CAGGCTTTGT GGGAAAGGAA GGGCCGCACC ATAGGGGCTG TGGTGGGGGT GGTCATCGCC TTCACCGCTT TGAGCTACGC CCTTCTGCTA GGCCAGACGT TTAAAGACAA CGTTGCTCAT TACTTCACTT CCAACTTCCA GATGAACGTG CTCTACGTGA TGGGGTCTCA GTTCACAGAT GCAGATGTCA GCACCATATC AACAATAAGC GGGGTGGAGC TGGCTGTGCC CATAACCTCG GCGAGGGGTG CAGTGAGGGT GCCCGGCACT TCGGGCCAGA CCCCCGTGAC CGTCTACGGC GTGCCCCCCT CTCTTATTTC CCAAGTTCTG CCCCCCACGT CGCTGTACGA CGGGGAGCTG ATTGTAGGGT CAAACCTCGC CATGGTGGGG TACTACGTGG CCTTTGACCG TTCTACTGGC CAGCAGAGGG TCGCAGTCGG GTCACCGCTA TCCCTCGCTA TCGGCAGGAG GTCAACCACT GTGGTAGCCT CGGGCATAAT GGCCACTGGT GCCTTGGGCT TTGTAGACAC TACGCGGGGG GTGGTGATGG ACATAAACAC CTTCCGCCAG CTTACCGGCA TCACCACCTA CAATCTCGTG ATGGTGTACC TAAAGGACGT ATCCCAGATA GACGCCGTTT CAAACGAAAT CAAGGCCAAC TTCCCCAACG TAGACGTGGT GTCGCCCCAG GCCATCCTCC AGACAATAAA CAGCTTCCTA ACCGCCTTCC AGCTCTTCCT CGGCCTCATC GCCGGGGTCA GCACCGTGAT CACCGCCCTT TGGCTATACG ACACCATGTC CATCAGCGTC GTGCAGAGGA CAAAGGAGAT AGGGATACTG AGAGCCCTGG GCTTTAGGAA GATGGACGTA ATGGCCATGT TCCTCGCCGA AGCCTTCATA ATAGCGGCTA TAGGAGTATT AGTAGGTCTC CTCCTCATAA TTCCACTGTC CCAGATGGGG CTACCGCTGT TAGGGGGAAT GCAACAACAG TCCATGTCGG CTGGCGGCGC CTTTAGGCCG CCCCAAGGGG GCTTTAACAT ATCGTCGCTT GTGCTAGACC CCGTGGTCTT GGCCGCTACC GCGGCGCTCG TGGTGGCGAT AAACCTAGTC GGTGCCCTCC TCCCAGCCTA CAGGGCAGGG AGACTCGACG TCGTGTCGGC GCTTAGGTAC GAATAG
|
Protein sequence | MLPEILRLAW QALWERKGRT IGAVVGVVIA FTALSYALLL GQTFKDNVAH YFTSNFQMNV LYVMGSQFTD ADVSTISTIS GVELAVPITS ARGAVRVPGT SGQTPVTVYG VPPSLISQVL PPTSLYDGEL IVGSNLAMVG YYVAFDRSTG QQRVAVGSPL SLAIGRRSTT VVASGIMATG ALGFVDTTRG VVMDINTFRQ LTGITTYNLV MVYLKDVSQI DAVSNEIKAN FPNVDVVSPQ AILQTINSFL TAFQLFLGLI AGVSTVITAL WLYDTMSISV VQRTKEIGIL RALGFRKMDV MAMFLAEAFI IAAIGVLVGL LLIIPLSQMG LPLLGGMQQQ SMSAGGAFRP PQGGFNISSL VLDPVVLAAT AALVVAINLV GALLPAYRAG RLDVVSALRY E
|
| |