Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1764 |
Symbol | |
ID | 5055674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1582955 |
End bp | 1585330 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640469309 |
Product | hypothetical protein |
Protein accession | YP_001153967 |
Protein GI | 145591965 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1361] S-layer domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0673103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000629319 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACCGA TATATCGGTT AATATTAATA ATAGCCTTTG CAATGATAGC GAATGCCCAG GTAGCTACGC CTAGTATGCA GCAGTCTTCT TTTAACGTAG GCGTATCGTA CTCCCCGCCG TATATATATC CCGGCTCTAT TGTCCAACTA TATATAACAG TTATATCTAT ACAACCTGTG TCTAACTTAT ACATAGATAT AAATTCCCCT TTTAAAGTAT TAACAGGCTC TACAATTCAA ATAAAACAAA TATCACCCGG CACGCCGGCA ACGTTCACAG CAGTTATTCA GGTACCCCTC GACGCGCGGC CTGGATACTA CACTATTAAG GTAGTAGCGT ATACTCTTAT GTACTCGGCA GAGTCAAGCA TCGATATTGA AGTGCTGCCC TTTGACTTCT CTTCTCTTGT CGTGGCAAGG CCTATGGCTT ATCTACCTGG CCAGCCTGTT CAGCTACCCG TTTTGCTATT TAACCCCACG GCTGACTACA TAAGAGCCCG CGTGGCAATA AATGGGTCTG CCGTTTCACA ATACCTAAAC TCTTCGCTTT CGTGCGATAC TCTTATACCT CCTAGGTCCA ACTCTACGTG TTTTCTAAAC TTTATAACAC CTAGCGGGTT GAAGCCGGGG TTCTACAACG CTACCTTAAC AGTTACTTTG AGTAGCATAT CGGGGTACGC CGGTGCTGTA ACTTTTAGAA AGACGGTACA ACTGCCGATC ATCAGCGGCG TTGACGTGAA CGTGATGGCG TCGCCCACGC AGCCTGTAGT TGTAGGCTCC CCCACATTGG TGACGTTGGC GATATCTCAG GGAAGTTCAG TGCCTTTGCA AAACGTCACT ATTCGAGTCC TAAGAGGCGA TGGTGTAGAG GTCTTAAACT CCAACCCAGT TACCGTACCT TTGCTCACGC AAATACAGCT ACCCGTCCAG CTTGTAGTAA GTAAGTACGG GAATGTGAAG ATACCTGTGG AGATATGTCA CTACTCGTCC TGCGGTATTA AATACGCCGA GTTGTATGTG CCAATCCCAG GGCTTTCCGT AAATGCTGTT TTTACCCCGT TTCAAGGATA TCCAAGCTCG TTGGTGCAGT CTACGTTCAT CATAGCGACG AACTACACGA TATCCAATGT CGCCGTGGAA ATCCGCGCCC CATTCAAAGT GTTGACAAGT CCCGCTGTTT CGTTGCCGTT TCTGTCTCCC CAGTCTCCCG CCACTATAAA CGTGGTTTTT GAAATTCCTA AAGAGACCAA GCCTGGGGTT TACCCCGTCA ATGTGACAGT AGCCGGCGCC GTGTATAAGT TTTACTACGA AGTATTGAAA CCTGAATTTA GCGTCGTGGC GACGTTTAAC CCGCCGGTAG CGTATCCAGG TGCCGTTGTT TCCGGCAACC TGGTCGTGAC TTCTCCTTTT AATGCAAAAG ACCTAGATAT ATCCATATCG ACTCCTCTAT CTCTCATATC TCCGGCAGAG TACCGCTTAC CCTATCTGCC GCAGGGACAG CCATACACGG CTTATATAGT CCTGCAAGTG CCTGAGGGAA CGCCGCCTGG GAAGTACCCC CTTACGGTGA CTGTAAATGG GGAGAACTAC ACGTTCTACG TCAACGTCGG CGCGCCAAGT GTGGTAATTC AAAACGTAGT AATTACCCCT CCTAAAATAC TTGAAGGCAT TGCTACGGCG CAAGTGGCTG TACAAGTCCT AAATACGGGC CCCGTCGTTG CGCGGAATGT CACAATTACA TTGTTAAACT CTACTATTGG GAAGCAGAGC TATACGTTGG ATTACCTCCC GCCCGGATCG CCGATCACTT TGACGTACTA TGTCGATATT TCAAAACTCC CACCTGGCCA TTACGACGTT GTTGCGCAAG CTAGTTGGAG CGGCGGCGTT TACGTAGCTA GGGGGTCTCT AGATGTGGCT AAGAAAAACG CCCTTAAAGT GGACTACAAG GTATACAACG TCGCGCCAGG CTCAACAGCG GTTTTAGTAC TTAACATAAC GAATCTAGGC CCCGACGATG TTAAGAATTT AAGAGTGTCG TTTACCCCCT CGCAAGTATT TGAACTACAC GCCTCTAATA TCGCAGACGT GGCGACTGCC GGCGTTAGAA TGCTGGGAGA CCTGGCGCCG GGAGCCTCTG TCAGTACTGC GTTTTTACTA GATGTGTCCG ATAAGGCGGT TCCCGGGATC TACGCTATAA CGCTGGTGGC TACGTGGAAC CAGACAGGCG TCTTTATGCC CAGCGTGCAG TATATAAACA TCCCTATTGA AGTGAAAAGC GGCATCGACA TGTTTGTCGT TATACCGCTT GTCTTGACCA TATTGCTCAT CGTAGTGGGG CTGGTTACTG TAGCGCGTAG AAGGCGCCGT GGATAG
|
Protein sequence | MKPIYRLILI IAFAMIANAQ VATPSMQQSS FNVGVSYSPP YIYPGSIVQL YITVISIQPV SNLYIDINSP FKVLTGSTIQ IKQISPGTPA TFTAVIQVPL DARPGYYTIK VVAYTLMYSA ESSIDIEVLP FDFSSLVVAR PMAYLPGQPV QLPVLLFNPT ADYIRARVAI NGSAVSQYLN SSLSCDTLIP PRSNSTCFLN FITPSGLKPG FYNATLTVTL SSISGYAGAV TFRKTVQLPI ISGVDVNVMA SPTQPVVVGS PTLVTLAISQ GSSVPLQNVT IRVLRGDGVE VLNSNPVTVP LLTQIQLPVQ LVVSKYGNVK IPVEICHYSS CGIKYAELYV PIPGLSVNAV FTPFQGYPSS LVQSTFIIAT NYTISNVAVE IRAPFKVLTS PAVSLPFLSP QSPATINVVF EIPKETKPGV YPVNVTVAGA VYKFYYEVLK PEFSVVATFN PPVAYPGAVV SGNLVVTSPF NAKDLDISIS TPLSLISPAE YRLPYLPQGQ PYTAYIVLQV PEGTPPGKYP LTVTVNGENY TFYVNVGAPS VVIQNVVITP PKILEGIATA QVAVQVLNTG PVVARNVTIT LLNSTIGKQS YTLDYLPPGS PITLTYYVDI SKLPPGHYDV VAQASWSGGV YVARGSLDVA KKNALKVDYK VYNVAPGSTA VLVLNITNLG PDDVKNLRVS FTPSQVFELH ASNIADVATA GVRMLGDLAP GASVSTAFLL DVSDKAVPGI YAITLVATWN QTGVFMPSVQ YINIPIEVKS GIDMFVVIPL VLTILLIVVG LVTVARRRRR G
|
| |