Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1218 |
Symbol | |
ID | 6376753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1556591 |
End bp | 1558249 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642682316 |
Product | hypothetical protein |
Protein accession | YP_001958274 |
Protein GI | 189502557 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000594421 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAGAAG GAGAAGAGGA AGCAGCAGAA GATAAGCTAT CAGATGAGAA TATACCCGAC GAATGTTTTT GCCCCATCAC CCAGGAGATT ATGGAAGATC CGGTCATTGC TCAGGATAGC CATAGCTATG AACGATCAGC CATACAACGC TGGTTTGATG TGGGAAAGCG GGTCAGCCCT ATGACTGGAA AGAGGCTGCT TAGTACCGAG CTCATAGCTA ATTATACCAT GCGTAGTTTA ATTCAGGATA TAAAAGCACA GGTACCTGTT TTAACCAGAC ATAAGCTGGA TATACGTAAT ATTGAAGCAG CTATTAAACT CAGAGAAGAA GAGATAGAAG AAAAATTGAT ACAAAAGGGG CATTTAGTAG AAAAAGAAAG CCAGGAACGG TTAAGCTTAG AAGAAAAGCT ACAACAAAAA GAGATAGAGT TGCAACAACA TAAAAGAAAA TTGGAAGAAA AAACAGCTCA GCTTCATATT ATGGAAAAAC GAATTGGGCT ATTAAAAGAG CAAGTTAACT CTTTTATAGA AAGAGATAAA CAGATGCGTA CAACGATGCA AGAGTGTATA CTACAAATGC AGCAGTACAT GGTGCAGCCT GGTCCACCTG TTAGTAGTTC AAGCAGCAGT GTTTCTGCCT TTCAACAAAA AGTAAAAGAA GGAGTTCTAG AAGGCAATCA TAATAATAAG GCAGAACAAG ATTATCCAGC AGTTTCTGAA CAAAAACTTC GATATTTTTT AGACAGTAAG AGGTTGAAGG GAAAGGTTCT AGATGAGCAA GAAGCTATAA AACATTGTAA AGAAGGAGCT ACTATTGGGC ATATGTATGC ACAATATGTG CTAGGTAATA AGTATAGGAG CGGGCGACAG GGATTAAAAA GAAATTATGC TAAAGCTAAA AGATGGTATG AAAAAGCAGC TGAACAAGGA TATGCAGAGG CACAATATAA GCTAGGAGCT ATGTATGATA ATGGAGAAGG GGTAACAATA GACTTTATTG AAGCTAAAAA GTGTTATGAA AAAGCAGCTT GCCAAGGTGT GGCGGTTGCT CAAGCTAGGC TAGCAAGCTT ATACTATTAT GGACGAGGGG TTCAATTAAA TAGGGCTGAA GCAGAAAGAC TATGCTTACA AATAAGAGAG AAAATAGCCA TAGATGCTCA AAAAGGTGAT GCAGATTGCC AGCTTAGTTT GGGTTGGATG TATTATCATG GTTGTGGTAT AAGGAGGAAT TACTCAAGAG CTATGGCATG GTATCTGAAA TCTGCTAACC AAGGATGTGC AGCTGCCCAG AATAATTTAG GCGTTATGTA TGCGTATGAT TGGTTCGGAG CGATAAAAAA AGACTATACA AAAGCTAGGG AATGGTATCA GAAAGCAGCT GAACAAGGAT ATGCACATGC ACAATCTAAC CTGGGGGGGC TATATTATTC TGGGCAAGGG GTAGAGAAAG ATGATAGAAA AGCATGTGAA TGGTATCAGA AAGCAGCTGA ACAAGGATAT GCACATGCAC AATATAGCTT AGGCATAATG TATAGGAATG GATTTGGGGT AGGAAAGGAT AATATAAAAG CTATAGAATG GTTTCGAAAA GCCGCTGAAA AGGGCTATGA GGATGCACAA ATAATACTTA ATTCGGTAGT AATCCATTTT TCATCATAA
|
Protein sequence | MEEGEEEAAE DKLSDENIPD ECFCPITQEI MEDPVIAQDS HSYERSAIQR WFDVGKRVSP MTGKRLLSTE LIANYTMRSL IQDIKAQVPV LTRHKLDIRN IEAAIKLREE EIEEKLIQKG HLVEKESQER LSLEEKLQQK EIELQQHKRK LEEKTAQLHI MEKRIGLLKE QVNSFIERDK QMRTTMQECI LQMQQYMVQP GPPVSSSSSS VSAFQQKVKE GVLEGNHNNK AEQDYPAVSE QKLRYFLDSK RLKGKVLDEQ EAIKHCKEGA TIGHMYAQYV LGNKYRSGRQ GLKRNYAKAK RWYEKAAEQG YAEAQYKLGA MYDNGEGVTI DFIEAKKCYE KAACQGVAVA QARLASLYYY GRGVQLNRAE AERLCLQIRE KIAIDAQKGD ADCQLSLGWM YYHGCGIRRN YSRAMAWYLK SANQGCAAAQ NNLGVMYAYD WFGAIKKDYT KAREWYQKAA EQGYAHAQSN LGGLYYSGQG VEKDDRKACE WYQKAAEQGY AHAQYSLGIM YRNGFGVGKD NIKAIEWFRK AAEKGYEDAQ IILNSVVIHF SS
|
| |