Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0432 |
Symbol | |
ID | 6377352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 504418 |
End bp | 507426 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642681597 |
Product | hypothetical protein |
Protein accession | YP_001957576 |
Protein GI | 189501859 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0267452 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAC ATTATAGTGT AGCTCGTCAA TTTATGGCGT GTATTCTATT TGTAAGCCTA TGCTTACAAA GCTGTACAAA TATTTATGTC CCATCAAACC TTCCAATAAA AGAGGGCGAA AAATACGTTA GTGAGAGGGT TGTCCGGCAG CTAATAGGCA AACAATCGAT TACCAAAGAA GGGCATATCG TTACTTTTTA TCAGCAAGGA AACCAACTGC GTGCGGAAGT TGGAGAAAAT TTACCACAGG GATTCGATAA AACGTATACT GTGCCTGTGT ATATAGAGGA AGAAATGAAG CTAGCACAGG TAGCTAGTTT AAACGAAGAA GCTCAAAAGA AAAACATTCA TGTGGTTTTT CCTAAAAACC AACAGGAAGG GTATGTATAT GTAGGCCACA CAGGTTTAAT GGGAGGAGGT AAGGATAAAA AAAGCAGGAA GCAAGTATTG CCAGGAGAAG AAGAGGTAGA AGAAGAAACA GAGGAAAAAT CCCCCAGAAG TACTGAAACA GAAGGTTATC AGGTAGCAAT CCAACCAGCC TTTTCGCCTG AGCACCAACA AAAGTCCACA CCTCGTTTGA TCTTAAGTTT AGATGGAGGA GGTATTCGAG GGCTACTAGA AGCAGATGCG TTAAATTACA TAGAAAAAGT ATTAGCAGAA AGAATTATAA ATCATTTTGG TGATCGATCT GCTCCAAAAC CTGATGTGCG CTTAGGTGAA TATTTTGACT TAATTGCAGG CACTTCCACA GGTGGTATTA TTGCTTTAGC TATGCGTATT TTAGACCTTG CTACCAATCG GCCACGTTAT AATATGGAAA TAGTATCAGG AATATATAAA GATAAAGGAG GCAAAATCTT TTATGGTAAT AACAAACTTT GGAAACTATT GTGCCAAGCA AAATCTAATA TATATAATCC TAAGCCTTTA GAGGATATTT TAACAGAGTA CTTTGGCAAT GCAACTTTAC AAGATTTATG TGATCCTGTT TTAATTACTA CGTATGATAC AGATAAACCT GGTATTTATC TTTTTAAAAG CTCTGATACA AAAAATGGTG CAAGCAAAAA CTTTTATGTA AAGGATGTTG CTAGGGCTAC TTCAGCAGCT CCTACTTATT TCCCTCCAGC ACAGATTAGT TCTATAAGTG GAGAAAAATA TTGTTTTATA GATGGAGGAG TTGCTGCGAA TAATCCCGCT CTCTACGCTT ATACATATGC TAAGGATAAC TTATACCAAA ATTCTCGTTT CCATCTTATT TCTTTAAATA CAGGAACATC CCCAAAACCC AGCTTAGCAC GTACAGCAAG TAAAGGTGGT GTTCTATTGG TACCTAAACT AATAGAGGTA GCTATGAATA GCAACAGTGA TGCTGTTGAG TCGTATACAG CATCTTTGAT TACTGAGAGA CCAGGAGACA CTTATACACG TTTAGAATTT GAAATTGATC ATCAGACCAC CAAAGCACTT GATAATGCTA GTAATAGCAA TTTAGAAAAG TTGGTAAAAT ATGCTTGCAA AACAGTGGAG AAAGAAAAGG ATGAAACCTT GAAGACTATA GTGGAGGCTA TAGTGGATAG GTTAAAAAAG TGCAATTATT ATGTTTTTCA CTCGCTTGTT AAGGAAGCCC GTGAGCAACT ACAAAATGGT GAGGGTAGGG CTGATTTATC CAAGACATAC CTACAATCTT TGATGCTGCC GTATGTATGT GAGCGTGCTA CATGGGAAAT TGCACATGCT TTAAGCGTAC CCCACATGCC CAGTTTAACC TATTTAGATT TAAGTGGTAA TGAGCTTATA TCTAAAGGTA ATAGTTTAAC ATATTTAGAA AAGCTTAATA GTCTTATTTA CCTAGATCTT AGTAATACAG GTCTAACCAT AGATGGATTA GCAAAGCTAA AAGGTGCTAA GTTACATCTA GATATACTAA AAGTAAGAAA CAACCCAAGA TTAAACTGGA TAAAGGCTGG TAGGATTGCA AATGAAATTG AGAACTATAA AATTTCTTAT CTGGATAGTG ATGTTATGAG AAATTTGGCA AGTCACTATC AAACTCAAGG CCGAGACACT AGAGCGGCTC TAATGATTGA CCTAGCTGAT GATAAACATA CTACTGGTCC TGCTGAATTT CATTTAGGTA GGATGTATGA GAACGGATGG GATTTAGCTA AGAACTGGGA AAAAGCAATA TTATGGTATC AAAGAGCTGG TAACCAAAAC CATACAGAAG CACAATACAG GTTAGGTAGG ATATATGAAA ATGGCAGGGT AGCAAAAAAG GATGAGCAGA CGGCTGCTCA ATGGTATGAG AAAGCTGCTA TACAAGGAAA TAGAGTGGCA CAATATGCAT TATGCTCCAT GTATGAAAGA GCTGTTAGAC AAGGGTGCCC AAAGGTACAA TATAGCTTGG GAAAAATGTA TTATAATGGC TGGGGAGTAG ACAAAAATTA TCAGGAAGCA GTGGAATGGT ATCAAAAGGC AGCTAACCAA GGATATGCAG AAGCACAATA TCAATTAGGA TATATGTATG AATATCCCAA AGGGCTATTG CAAAATTACA AGGAAGCAGC CAAGTGGTAC CAAGCAGCAG CTAAGCAAGG TATAATAACT GCTCAGGTTA AGTTAGCAGA TATGTCTTAT TATGGACTAG GTGTTGATAA GGATGAACAA GAAGCATTCA GATGGTTTCA AAAGGCAGCT AATCAAGGAC ATGCAGCGGC ACAACTTGTT TTGGGGGTAA TGTATGTCAA TGGACGAGGT GTTACCAAGG ATGATGTTAA AGCTGTAGAA TGGATTGAAA AGGCAGTTAA TCAAGGAGAT GCAGAAGCAC AACTTGTTTT GGGGATAATG TATGCCAATG GACGAGGTGT TAATAAGGAT GAAGAACAAG CAGTAGCATG GTATCAAAAA GCTGCCGATC AGGGAAGTGC AGTTGCACAA TATATGCTGG AGCAGAGGTA TGAGAATGGA CGAGGTGTTA CCAAGGATGA TGTTAAAGCT GTAGAATAG
|
Protein sequence | MKQHYSVARQ FMACILFVSL CLQSCTNIYV PSNLPIKEGE KYVSERVVRQ LIGKQSITKE GHIVTFYQQG NQLRAEVGEN LPQGFDKTYT VPVYIEEEMK LAQVASLNEE AQKKNIHVVF PKNQQEGYVY VGHTGLMGGG KDKKSRKQVL PGEEEVEEET EEKSPRSTET EGYQVAIQPA FSPEHQQKST PRLILSLDGG GIRGLLEADA LNYIEKVLAE RIINHFGDRS APKPDVRLGE YFDLIAGTST GGIIALAMRI LDLATNRPRY NMEIVSGIYK DKGGKIFYGN NKLWKLLCQA KSNIYNPKPL EDILTEYFGN ATLQDLCDPV LITTYDTDKP GIYLFKSSDT KNGASKNFYV KDVARATSAA PTYFPPAQIS SISGEKYCFI DGGVAANNPA LYAYTYAKDN LYQNSRFHLI SLNTGTSPKP SLARTASKGG VLLVPKLIEV AMNSNSDAVE SYTASLITER PGDTYTRLEF EIDHQTTKAL DNASNSNLEK LVKYACKTVE KEKDETLKTI VEAIVDRLKK CNYYVFHSLV KEAREQLQNG EGRADLSKTY LQSLMLPYVC ERATWEIAHA LSVPHMPSLT YLDLSGNELI SKGNSLTYLE KLNSLIYLDL SNTGLTIDGL AKLKGAKLHL DILKVRNNPR LNWIKAGRIA NEIENYKISY LDSDVMRNLA SHYQTQGRDT RAALMIDLAD DKHTTGPAEF HLGRMYENGW DLAKNWEKAI LWYQRAGNQN HTEAQYRLGR IYENGRVAKK DEQTAAQWYE KAAIQGNRVA QYALCSMYER AVRQGCPKVQ YSLGKMYYNG WGVDKNYQEA VEWYQKAANQ GYAEAQYQLG YMYEYPKGLL QNYKEAAKWY QAAAKQGIIT AQVKLADMSY YGLGVDKDEQ EAFRWFQKAA NQGHAAAQLV LGVMYVNGRG VTKDDVKAVE WIEKAVNQGD AEAQLVLGIM YANGRGVNKD EEQAVAWYQK AADQGSAVAQ YMLEQRYENG RGVTKDDVKA VE
|
| |