Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0851 |
Symbol | |
ID | 6377039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1078070 |
End bp | 1080955 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642681990 |
Product | hypothetical protein |
Protein accession | YP_001957951 |
Protein GI | 189502234 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00404215 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAGAA ATTCTTCTCC ATTTCAAGTT TATATAGCTT GCATCTTACT TATCAGCTTT TTCTTACAAA ACTGCTGCGG AGAGTTTAAT AATAATCCAC TTATTTCGAC TCAAGAAAAG CAAACAGCAT CTATCCAAAC GCATTCGCAA TTAATTCTTA CTCCAAGAGC AGACATCGGG TCTTTGGCAG GCCAAGGACT AAAAGCCCAG GGGGGCCATG CGGTTACTTT TTATCAAGAA GCCGGTGAGT TACGAGCTAA TGTCGAGATG AATGTGCCTG AACGGTTTAG TAAAACCTAC GAAGGGTTAT TAGTAACCCT TGAGCAAGGA GCAGAAGTAA CGAGATTACC ACAGCTAAGT GAACAGGCGC AGGAGCGTCG CATCCAGCTT CAATTAGCCA AGGGCAACCA GCCAGCTAAA GTAGTGATCT ATAAAGGAGC AGGGCTCGTG GGAGGAATGC AGGAAGGAGA GGAGAAAGCA GATGAATCTA TACCTAACGA ATGCCTTTGC CCCATTACTC ATGAGCTGAT GGAAGAGCCT GTTATTGCAC GGGACGGGCA TACGTATGAG CGGGCGGCCA TTGAGCGATG GTTTTCAATG GGCAAACGCA CGAGCCCTGT CACAGGAGCT AAAATTGGTA GCACTAAGCT GATACCTAAT TATGCCATCC GCAGCCTGAT TAAAAGCTTA AAGACACACA ATTCTAGTTT AGCTAAGCCT ATTCTGGGTA TGCAAACTGC GAGACAGCAA TCTCTATCAG TACCATCTGG AGCTTGCTTT TTTGTGAGTG ATAGCGTTAT TGATGTACTT TTCTCTCAAT CCATAAGAGG AGATGTGGTA GCATTAAGTA AGCTTTTAAA TTTGAGTGAA AATTACCATG TCCATGCGCA GTATAAAGTA GGGGTTATGT GTGCAGAGGG GCGAGGTATA GCAAAAAATG CGGCCAAAGC AGTAGAATGG TATGAGAAAG CAGCTAAGCA AGGACATGCG GTTGCACAGT CTAATCTAGG ATGGATGTAT GCAGATGGGC GTGGTGTAGC TCAAAATTAT GCTAAGGCAA TAAAATGGTT TCAAAAAGCT GCCAACCAAG GACATGCAAG TGCGCAATAT AAATTAGGAT GGATGTATGC AGAAGGGCTA GGCGTAGTAA AAGATGCAAG GAAAGCAATA GAATGGTATG AGAGAGCAGC TAAGCAAGGA GATGCAAGTG CACAATCTAA TCTAGGCGTA AGTTATGCTA ATGGATGGGG CGTAGCAAAA GATGCAAGGA AAGCAATAAA GTGGTTTCAA AAAGCGGCGG ATCAAGGACA TGCAACCTCA CAATATAATC TAGCATGGAT GTATGCAGAT GGGCAAGGAG TAGTAAAAGA TACAAGGAAA GCAGTAGAAT GGTTTCAAAA AGCGGCTAAT CAAGGATATG TAAAAGCACA ATATAATCTA GGATGGATGT ATGCAGAAGG ACGAGGAGTA GACAAAGATG CAAGGAAAGC AATAGAGTGG TATAAAAAAG CTGCCAAGCA AGGACACGCG GATGCACAAT TAAAACTAGG AGCGAGATAT TTCAAGGGTG AAGGTATAGC TAAAGATTAT GCTAAAGCAA AAGAATGGTA TGAAAAAACA GCCGATCAAG GACATGCCCA TGCTCAATAT AATTTAGGAT ATATGTATGA AAAGGGATTA GGAGTAGCTA AGGATTATGT TAAAGCAATA GCATGGTATA AGCAAGCAGC CAATCAAGGA CATGCAAAGA GTCAATATGC CTTAGGAGTG ATATATATAG AAGGTCAAGG AGTAGCCAAA GATGTAAGAA AAGCAATAGA ATGGTATGAA AAAGCAGCCA ATCAAGGGCA TGCAGATGTA CAATTAAAAT TAGCAGCGAG ATATTTCAAG GGTGAAGGTA TAGCTAAAGA TTATGCTAAA GCAATAGAAT GGTTTCAAAA AACAGCCAAT CAAGGACATG CCAATGCGCA ATATAATTTA GGATATGTGC ATGAAAAGGG ATTAGGAGTA GCGAAGGATT ATGTTAAAGC AATAGAATGG TATGAGAAAG CAGCCAATCA AGAACATGCA AAGAGTCAAT ATGCCTTAGG AGTGATATAT GAGAGTGGAG AAGGAGTAGA AAAAGATGAA AAAAAAGCAA TAGAGTGGTA TGAGAAAGCT GCCAATCAAG GACATGCAAG GGCACAATTT AGCCTAGGGG TTATGTATGG AGAAGGAGAG GGAGTAGAAA AAGATGAGAG GAAAGCAGTA GAATGGTATG AAAAAGCAGC CAATCAAGGA CATGCAAGAG CACAATTTAA ACTAGGATGG ATGTATGGAG AAGGACGAGG AGTAAGTCAA GATTATGCTA AAGCAATAGA GTGGTCTGAA AAAGCAGCCA ATCAAGGACA TGCAAGAGCA CAATATAATT TAGGATGGAT ATATGAAAAT TGGAAGGGAG TAGCCAAAGA TTATGCTAAA GCAGTAGAAT GGTTTCAAAA AGCTGCTAAT CAAGGATATG CAAGAGCACA ATATAATTTA GCTCGGATGT ATGACCATGG ACAAGGTGTG GTCCAAAACT ACCAAGAGGC AGTAAAATGG TATGAAAAAA GTGTTGGACA AGGAAATAAC TATGCCAAGG CCTACCTAGG TCGTTTGTAT TATCATGGTT TTGGAGCTGA GAAAAATCTT TTACAAGCAA GTAAATTAAT CGAGGAGGCT ATTATCCATA TGAAAAGCAA GGCTGAGGAG GGTTGTATAG AAGCCCAATA TATAGTAGGA TGGATGTATC AATATGGCCT AGGGGTAATG CAAGACCACG TGGAGGCAGC AGTGTGGTAT AAGAAATCAG CTAACACTTA TCCAGCAGCA CAAAAAGCAT TAGATGAGTT GAACCATGCT GGCTAA
|
Protein sequence | MKRNSSPFQV YIACILLISF FLQNCCGEFN NNPLISTQEK QTASIQTHSQ LILTPRADIG SLAGQGLKAQ GGHAVTFYQE AGELRANVEM NVPERFSKTY EGLLVTLEQG AEVTRLPQLS EQAQERRIQL QLAKGNQPAK VVIYKGAGLV GGMQEGEEKA DESIPNECLC PITHELMEEP VIARDGHTYE RAAIERWFSM GKRTSPVTGA KIGSTKLIPN YAIRSLIKSL KTHNSSLAKP ILGMQTARQQ SLSVPSGACF FVSDSVIDVL FSQSIRGDVV ALSKLLNLSE NYHVHAQYKV GVMCAEGRGI AKNAAKAVEW YEKAAKQGHA VAQSNLGWMY ADGRGVAQNY AKAIKWFQKA ANQGHASAQY KLGWMYAEGL GVVKDARKAI EWYERAAKQG DASAQSNLGV SYANGWGVAK DARKAIKWFQ KAADQGHATS QYNLAWMYAD GQGVVKDTRK AVEWFQKAAN QGYVKAQYNL GWMYAEGRGV DKDARKAIEW YKKAAKQGHA DAQLKLGARY FKGEGIAKDY AKAKEWYEKT ADQGHAHAQY NLGYMYEKGL GVAKDYVKAI AWYKQAANQG HAKSQYALGV IYIEGQGVAK DVRKAIEWYE KAANQGHADV QLKLAARYFK GEGIAKDYAK AIEWFQKTAN QGHANAQYNL GYVHEKGLGV AKDYVKAIEW YEKAANQEHA KSQYALGVIY ESGEGVEKDE KKAIEWYEKA ANQGHARAQF SLGVMYGEGE GVEKDERKAV EWYEKAANQG HARAQFKLGW MYGEGRGVSQ DYAKAIEWSE KAANQGHARA QYNLGWIYEN WKGVAKDYAK AVEWFQKAAN QGYARAQYNL ARMYDHGQGV VQNYQEAVKW YEKSVGQGNN YAKAYLGRLY YHGFGAEKNL LQASKLIEEA IIHMKSKAEE GCIEAQYIVG WMYQYGLGVM QDHVEAAVWY KKSANTYPAA QKALDELNHA G
|
| |