Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aboo_1226 |
Symbol | |
ID | 8828188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Aciduliprofundum boonei T469 |
Kingdom | Archaea |
Replicon accession | NC_013926 |
Strand | + |
Start bp | 1174107 |
End bp | 1178972 |
Gene Length | 4866 bp |
Protein Length | 1621 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | CUB protein |
Protein accession | YP_003483596 |
Protein GI | 289596900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.287366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTG GAAGATTTGG TAAATTGGGG AGTATCCTAG TGGTAACTTT AATGGTTTTA ACAGCGTTTG CAGTGATAAT CAGTGTACAA CCATCCGTTC AGGCAAAGCA AGCAAATATG CAGGGAGTAC ACTTCTTATA CAGGGATGTG CACATAAATG TAAAAGATTT GAAATATGTG ACAGGCCCAA ACGGTGGGAG ATATATTGTA GGCAAGGGTA TGAGGGAATT AACTAACCCT GGAGATCCTG CAGTACCTGT GAAAATAATA AGCTTTACCT TGCCTGCAGG TGCAAAGAAC ATAAGAGTCA ATTTGCAAAA TATATGGATG ACTTCTTACG GAAAGCTGAA AATATCTCCA ATACCTGCAC CTGCATTGAA ATCAGGAAGG GCTTTCCCGG CTAAGTTTTC ACCACCAAAA TACAATGAGA AAGTATACAA GTCCAGCAAA TATTATCCCG ATAAAAACTA CGATTTTACA ATAAGCAAGA CAATGGATAA GACGATAGTG AATGTTTACA TATATCCTGT AAAGTACAAT CCAGTTACAA ACGAAGTAAA AGTAATGACC CATGCAAAGG TGGTTGTTTC CTATTCTCCA GGCTCTCTTA AGAGCGGCAG TGCTCCTATT AATGTAGAAA ACATAATAAT CACCTCTCCA CAGCTAAAGG CAGCAGCGCA GAGACTTGCA GATTTCCATA ACAGCACAGG AGTAACATCA TGGGTAGTTA CAACTACTTG GATTGCAAAG AACTACCAGC CTGCACCGAA TCCGCCAGTA AACGGATATG CAAATGCTAC CACAGATCCT TATGGTTTGC TCGGTGGTCA GGCCCTACCT CTGGAAAAGA ATATGATAAT AGGATACAAT TATACATTGG CAAAGAAAAT AATAGCATTC TTGCAGAATG AGAGTTATGG GAGTAATGTA GAATACATAA CAATATTTGG TAATGCAAGA GCAGTACCGC CAAGCTATTA CTGGACTGAT CAGTATATGT ACCTACTGGC ATATTATGGA TTATCAGATA TGTATGATGC ATGGATACCC ACGGATGCTT TCTATGCCCA GCCAAACTAC AATTCTACAT ACTTCAGCTA TGAGCCACAA TTCTTCATAG GTAGAATTCC CGTTAATCCT TTGACTGCAA ACAAGGTTGT TAATAAAATA ATATATTACG CAACTCATAA AACTGCAGGT ATAGAGAATG TCACACTCTC AGGAGGACAG GTATTTGAAA CCCCATACTT CCTTGGAGAG ACAGGAGTTC TAGAGCCACT TAACTATGGA TGGCTTGATG GAGCAAAGGT CACTGAGTAT TTCCACACAC TAAGAAATTA CACATATAAT AATTTCATAA AGATGATGTA CAACAGCGAT ATGATAATTG AAATAACCCA TGGCTCAGGA TTTTCATTCT GGCACCACAA CGATGAAGTT AGTGCTTGGG ATTTCAGCAT GAATTCATCC TATGGAAGCC TGCCAATTTA CATATCTGGC TCTTGCCTAA ACGGTGCTTG GGATGAAGAG ATGTATCCTT CAGAGTTTGC ATCTGGAATA AATGGCGGTA CATCAATAGC TGAAAAGATG CTTTATGCAC CAAATGGCAT AATTGCATAC TATGGAGCAG ATAGAGAGGC ATATGGAAGC ACTTTTGCCT ACTTTGATAA TGGGACATTG GTAGCTCCAA ATGACTTTGG GGACTTGATA ACAGAAGATG GTACTGTGGC AGGATATTTC CTAGCTACAC ACTATTATGG TTATGCAACC CTTGGTATGA TGTACTATTA TGCTCTGAGT CTTTACAACT ATTGGCTAGG TGAAAATCTC ACAAGTGTTG ATCCATGGGA TACTGGAATG AATGATAATC CTTGGGCAAG GTCTTACTTT GAATACTCAC TCATAGGAGA TCCCGCATTG AAAGTATCTG GAAACGGAGC CAGTTATCCA TCTTACAATG TGCCAAGTGT TATAATACCC AACGCGGTTT ATAATTCAAA TGATGTGCCA ATAATTACAA GAGGAAAGGA AGTAAGCGTG AATATAAGTA CCGATTCTCC AATAGTTAAA GTTGAGCTTC TTTATCTAGA GCATGGATAT GGTTCTGGTC CTTGGGGAGC ACAGCCAACT TATTATGATT TCATACTTGA TATGAAAACT TTGCATCCTG TGGCGACAAA CAATGGGGTA AATAACTTTA CATATTCGTT TGTGCCAAAT AGAGAAGGAA CCTATATACT ATCCGTATAC AGTGCAGATG GAAAGAACAC AAGGTTCTAT ATGTCTTGCA GCTCTCCAAT GCCACCTCTG AGTGCTACGG AAAGAACAAA TTATAATGGG AGAGCCTATA ATGAGAATAT AAAGAGTACC GCCCCAGATG TTCAGGTTAT AACTCTTCTA GACTATGGAG ATGTTGCATT TGGAAAAACC ACGAATGTTA CTGCGGTTGT ATATAATTCT GGAAATGCTA CAGCAACTAA TATAACGGTT CATTTCTATC TGGAGAACTA TGTTCTTATA TTCCAAGAAG GTAACCTAAG CCATCCCTTG GTCTGGCTTG GGAATGCAAC TATATCTTCT CTTGCACCTG GTGAAGTTGC TTATGTAACG ATACCTTGGA AAGCTGTTAA CTTAATACCT TTGGGAATGG ATCCAACAAA TTCTTCAGTT AGGTGGCAGT ATGTAGTTGC GAGTGCTGAT GTCGCTGGCG ATACAAATTT GAATAACAAC GCGATGTGGG CTTTGTTCCA TGTGCATTTG CCTTTAGATG TCTGGGTTCA AAAAGTATTC ATCCAAAAGG ACCCAGTTGT AGGAGAGCCC AATAGTATCA CATTTGAAAT AACAAACATA GGCACAACAA CCACTGCCAC AAGCAATGTA ACTGTTATGG ATTACTATGG TACAATAAAA ACAGTGTCTG TGACTCTTGC ACCTGGACAA ACAAAGTTCA TAACCGTGCC TTGGACACCA AAGGTTGCAG GTGAGGATAT AATTGAAGTA ATGGCAAGCA CTCCAGGAGA TCAGAATCCA AGGAATGATG TTAGCTTTTA TTATCCAAAT GGATACTGGT CAATTGCAAC TTTCAAGGTA TTGAGTTACG ATGTAGAGCC AGTTATGGCA TATCAGCGTG ATGATAATAT AAATATTGAG TTATACAACT TTGGTCCCTT AGCTTCTAAT GGTACTAAAG TTGATCTGTG GGAAATGGAA AATGTTACAG ATATAAATAT ACAATCTCCG CATCCATATC CAAACAACTA TGATAATGTT TGGACAATAA CTGTTCCAAA TGCAACAAAG ATAGCTTTGC ACTTTAATTA TCTATACGTT GAGCCTGGAT GGGATTATGT GTATATTTAC AACTCCACAG GCAAGCTACA AGTATCTTAC ACCGGCTTCT ATAACGATCT GTGGACTCCG TTTATAAACG GCTCTACTGT GTATGTAGAG CTAGTATCTG ATGAATCTGT TAATTATACT GGATTCTATA TAGACGCATA TTCTACAGGG ATCTCTCATA TTGGTAGTAC CCAGGTGGGT GCTATAGGTT CCGGAGATTA TATAACTGCT ATTATGTCTC TAGGTAATAC TACTGCTGGG GTACACTATT ATAAGATATC TACAAATACT CCGGGCGAGA GTGCAACACT GACCACAGGA GGTTCAAGCA ATAATATTAT TTATCAAACA ATTAATGTTA CAGATACAAT AGCTCCCCAG ATAGAATCTT TCCATTTGGC GAATACCATT ACAAATGATA GAAGTCCAGA GCTTAACTTT ACATGGTACG ATGAAATTTA TAGTGGATTC TTCAAGGTGA GTGTACAGAT AGATGGTATA ACAATACCTG CGAGGGTAAT TGCTAATGGC AATAGTGGAA ATGTAATTGC AAATGTTCCA TTCCTGCTAG CTGATGGAAA GCATACTGTG CAAATAACTC TTGTAGATAA TGGAGGGAAT AAAGTTACTG AAAGTTGGAC ATTTACAGTG GATGCTACAC CGCCAAGTTT GAAGATAACC ACATCTACAA ATACTCCTAT AACATATACA TCTACCTTCT GGATAAATGG AACCACAGAA CCTGGTGCAA CGGTGACCAT AAATGGAGTA AATGTTCCTG TTGATCCAAA TGGAAACTTT GCATATAAGA CTACCTTGGT CAACGGTGAA AATGTGTTCC GGGTTGTGGC CACTGACGAG GCAGGCAACT CTGCCACGGA AATAGTTACT GCTTTATATT TGCCTCAGAT ACCAGAAATA TTGAATGCGA TTAATGCTAT TAACTCGGAG ATAAGTAATT TACAGAGTCA GGTAACTACT ATGAAAGATG AAATCTCAGC TCTCCAAGGC AAGATATCTA CTATCCAGGG TGACATATCC ACAATTAACA GTGATATAAA TAATATAAAA TCCCAGCTTG ATACCCTAAA TTCTAGAATT AATACACTGC AAAATGATTT AAAAGAGAAT GTTACTGCTC TTAACAAAGC AATTGAGGAG TTAAATACAA CATTGGTAAA CAAGATTGAA CAGAACATAA ACAACTTGCA GAGCCAGATT AACGATTTGA AGAACCAAGC AAATGATTTA CAGAGCAATA TACAGCAGAT TAACACAAAT ATAAGTGATA TACAGAGTAA GAACAATGAG CAAGATAAGG CAATAAGCTC TAGCAATAAC ATAGGTTATG CAGGTATAGT TTTGGGAATT ATAGCACTGA TAATAGCGAT AGTGGCGGTT GCAAGAAAGC CTAAGGTACC CATGCAGAAG GAGGAAAAGA AAGAAAGCAC TAAAGAAAAT CCTAGTGAAG AAAGTGAAGA AGATATAGAT GAAACTGAAG AGGGCTCTGA GGAAGAAAAA GAATAA
|
Protein sequence | MKGGRFGKLG SILVVTLMVL TAFAVIISVQ PSVQAKQANM QGVHFLYRDV HINVKDLKYV TGPNGGRYIV GKGMRELTNP GDPAVPVKII SFTLPAGAKN IRVNLQNIWM TSYGKLKISP IPAPALKSGR AFPAKFSPPK YNEKVYKSSK YYPDKNYDFT ISKTMDKTIV NVYIYPVKYN PVTNEVKVMT HAKVVVSYSP GSLKSGSAPI NVENIIITSP QLKAAAQRLA DFHNSTGVTS WVVTTTWIAK NYQPAPNPPV NGYANATTDP YGLLGGQALP LEKNMIIGYN YTLAKKIIAF LQNESYGSNV EYITIFGNAR AVPPSYYWTD QYMYLLAYYG LSDMYDAWIP TDAFYAQPNY NSTYFSYEPQ FFIGRIPVNP LTANKVVNKI IYYATHKTAG IENVTLSGGQ VFETPYFLGE TGVLEPLNYG WLDGAKVTEY FHTLRNYTYN NFIKMMYNSD MIIEITHGSG FSFWHHNDEV SAWDFSMNSS YGSLPIYISG SCLNGAWDEE MYPSEFASGI NGGTSIAEKM LYAPNGIIAY YGADREAYGS TFAYFDNGTL VAPNDFGDLI TEDGTVAGYF LATHYYGYAT LGMMYYYALS LYNYWLGENL TSVDPWDTGM NDNPWARSYF EYSLIGDPAL KVSGNGASYP SYNVPSVIIP NAVYNSNDVP IITRGKEVSV NISTDSPIVK VELLYLEHGY GSGPWGAQPT YYDFILDMKT LHPVATNNGV NNFTYSFVPN REGTYILSVY SADGKNTRFY MSCSSPMPPL SATERTNYNG RAYNENIKST APDVQVITLL DYGDVAFGKT TNVTAVVYNS GNATATNITV HFYLENYVLI FQEGNLSHPL VWLGNATISS LAPGEVAYVT IPWKAVNLIP LGMDPTNSSV RWQYVVASAD VAGDTNLNNN AMWALFHVHL PLDVWVQKVF IQKDPVVGEP NSITFEITNI GTTTTATSNV TVMDYYGTIK TVSVTLAPGQ TKFITVPWTP KVAGEDIIEV MASTPGDQNP RNDVSFYYPN GYWSIATFKV LSYDVEPVMA YQRDDNINIE LYNFGPLASN GTKVDLWEME NVTDINIQSP HPYPNNYDNV WTITVPNATK IALHFNYLYV EPGWDYVYIY NSTGKLQVSY TGFYNDLWTP FINGSTVYVE LVSDESVNYT GFYIDAYSTG ISHIGSTQVG AIGSGDYITA IMSLGNTTAG VHYYKISTNT PGESATLTTG GSSNNIIYQT INVTDTIAPQ IESFHLANTI TNDRSPELNF TWYDEIYSGF FKVSVQIDGI TIPARVIANG NSGNVIANVP FLLADGKHTV QITLVDNGGN KVTESWTFTV DATPPSLKIT TSTNTPITYT STFWINGTTE PGATVTINGV NVPVDPNGNF AYKTTLVNGE NVFRVVATDE AGNSATEIVT ALYLPQIPEI LNAINAINSE ISNLQSQVTT MKDEISALQG KISTIQGDIS TINSDINNIK SQLDTLNSRI NTLQNDLKEN VTALNKAIEE LNTTLVNKIE QNINNLQSQI NDLKNQANDL QSNIQQINTN ISDIQSKNNE QDKAISSSNN IGYAGIVLGI IALIIAIVAV ARKPKVPMQK EEKKESTKEN PSEESEEDID ETEEGSEEEK E
|
| |