Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0963 |
Symbol | |
ID | 6376959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1247651 |
End bp | 1252303 |
Gene Length | 4653 bp |
Protein Length | 1550 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642682089 |
Product | hypothetical protein |
Protein accession | YP_001958050 |
Protein GI | 189502333 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.424465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC GGTCTTACTT GTTTGGCTAC TTACAAGATT TTAATCTAAG AATCATGCTT TTACAATTTG GCAAATTACC CAGGTTATTA ACACAGTTAT TCTTTGTCCT TTGTATTCTT ACAGGGTTTA AAGGAGTAAG ACTGGACACA GTTTTCCAGC AATCAATTGC TTTAACAACT AGCTACAATC TGCTAGCTTT GCAACAATCC AGTATTTACC AGCCAGTGGC TAATAGCCTT GTAGAAGAGA AAAAAGTTAC AGTTAGCATA GCTAACAACA AGGAAATGAC AGAAGAGAAG AAGGAAACAC TTTTACCTTC AGAAGCAAGA AAACAAACGT TAGAGCCGCA TCAAGTTATT AGTCAATCTT TTGAACAAAT CCTAGCTGCT TTATATAATA ATGACCCACA CTTATACTCC ATCAACTTAA CCCACCAACA ATTAAACCTA GAAAAGCTTC TACAGCTTGA ACAAGCCATC GACAATAATC CTGTTATTGG TTACATACAA TGGGGTGAGC TGCCAGCAGA TTGTGAAACT ATTAAAGAAC AAATACAACA AAAGCTTATC AGCAACATAG CTGCTTATAC GTATTACCCC AATGACTATA TCCATGGGCT GTTGGCTTAT CAAGTCTATA GCAATCCTAA GCTAGGGCAA ACCATCGACC TTTCCCTTGT TAGTAAAGAA ATAGGACATC AATTCTCTCC TAGTATCCAT ACCTCCTGGC AAGTAGTACA GGTACAAGAT GACAGCATCC ATACTGGCTA TTATAGTGCT TTGTATGTCA ATGAAATTAC CCACCAAGCG GTACTGGCTT TCCAAGGTAC TAAAGTTGAA GGGCTGAAAG ACCTGCTCAA AAAGAATTCT GATCTTAAAG AAGATATTGA TGGTGTATTA GGTAATGCGA TTACTCAACA AAATGCCCTG GCTTATGTAG CTACAAAAAA TGCTGTGGAC TATGTCAAAG ACAACGAATA TAATTTATCC ATTACAGGTC ACTCACTGGG TGGTTACTTG GCAGAACTAG CCGTCGCTTT TTGTTATCGG GATTTTAGTT ATCGCCAAAC AAAAGGCATT GTTTTTGATA GCCCTGGTAC AGTTAATAAG CTCGATAAGT TTAAGTCAAA TGTCATCAAT AAAGCTACCA AGTTTTCTAT TGCTTCTCTG CCCATTGTTA CGTATCTCTC TGCTCCTAAC TTAATCAATA CCTGTAATGG TCATCCGGGA GAGGTGTATA GGGTCTATCC TCAGCTTATG TGGTCAGCGG AGATACAGAA ATGGATGAAA AGGGCCAGCA AGGTACCACT CATTGGTTCG AAAATAAAAG GAAACAACAA AGGTCTTTTA GCCCTTACAG GTCATAGCCT ATCCACCATA TTAGCCTTGT TTGATCCTAC TACTGGCAAA CCTAGCACGT ACATACGGAT AGCTGACTGG CCTAAAATTG ATACGGATAA TGTGACTTAT ATAGGGAAGA AAAAAGAAGA ATTAGAAAAA AAATCCCTAC TCTCAACGGG TCTCTCTATT TTGACTGGCA AAACAAATGT AATTGGCGGT GCTTTCCAAA TGCTACCTTC GCTGGTAGGT GGTACAGCTG CTAGCGTATT GACGGTGCTT AAAGATTACT TAAACATCAA TCAAGCTCAA TACTTGACTA CGCTAGCTTA CTTGGATGAT AACTACAAAG AAACCAAACT AAGTACACAA AAAGAGTTTA CCCTTCGCTA TAAAGGCCAT TATCGTATTA GCGACAAGGC TATGAATGAA CATATTCTAT ATACAGAGAA CTATGAAGGG GTTGATTGGT ACCTTTATGA GTTATATAAG TACAAAGACA AGCTAGCGCA ATCCCCCACA AGAGATATAA CTTTTGCTGT ACTAAAGAAT GTCGTGCAAG AATATGAGGT CGTCAGCTTA GATGAACAGC CTTACGTAAG ATTAACAGAA AAAAGAGGGC AGGTAGAAGC TTTGCGTGAC AAGATGCAAC GGTCGCTCAA GGTATTGTCA GCTGCTAGTA TTAAGAAAAC ACTAGAAGAC AGCAACATCG ATGCATTAAC CAAGCAGCTA GAAAAAAGGT CCATCAAACA GTTTAGCCAG CTTCACAACT ATATAGCACA AGCCAAATTA ACAACCTACC TAGCCAGAGA AGACAAGCAG CAAGAGTTAA GTCAAAAATT AGAAAAGTCA GGCATATGCG TAATATATGG GCATGGTGGT GTAGGAAAAA GTACACTCGT AGCCCAGTAT GGACATAACC AAAAAGGCAA GCAACTGGTA TGGTGGATGC AAGCAGAAAC ACCAGAAAAA CTGGCAAAAA GCTATCAAGA TTTAGCACAA GATTTAGGCA TTGACTATCA GAAGTTAGCA CAAGGATTCA AGCAATCGTA CGATAACTAC TTACCCGAGC TTAGCAAGAG GGTTTACAAT GCCCTAGAAG ATCGCAAACA GCCTATATTG CTTATCTTAG ATAATGCAGC AGACGCCAAC CTAATAGATA AATGTTTATT GTACCGACCT AGCTTAGTAC AAGTGGTTAT TACAACCAGA AAAGCAAGAG ACTTTAAAGC CTATAGCCAA GTAGAATTAG CTGCTTTCAA GCGAGAAGAA GGAGAAAAGT ATATTAAAAA TTGCTTTGAG TCCAGCTTAT ACAAACCCAG CGAGCAAGAA ATAGCAACTT TAATCAAAGA AGTAGGCCTG ATTCCTCAAA AGCTGGCCTT AGCTGTTGGC TATATCGGTC AAAAACGCTT AATGACCGTA CAAGGTTATA TTGAAAAGCT ACAAGCCATT AAGAAATCAG GTAAAAAAGG AGAAAATCAG TTTGTCTTGC CTGAGGTAAG CTTGGGACTA GAAAGCTTAG ATACTCAATC TCAGTTAGTG ATGCGCTATG GTGCTTACTT GGATCCGGAT TTTATTCCGT TATCCTTAAT AATTGCTCTG GTAGAGGTAA AGGATGAGGA AGTATTAGAG AACTTGTTAG CTACGCTAGA AGAATTGTCC CTCATAACAA TTATAAAGGG GCAAGGCAAT GAACTAGGCA TCCAAATACA CCGAGAGGTA CAAGCAACCA GTAAGCAGTA CCAAGGGTGG GATGCTGCAA ATAACCTCTC AGAAGAAACA TTGTTATCTT CTATTATCAA AGTACTGCAC GAACGTATGC CTTGGGTAAC GGAAGTACCT GACAATACAT GGGATCAAGC AAAAATATAT GCAAGCAATG TAGGCCATGT ATTATCCACT ATTGAACAAA CTATAAAACC AACCTATGTA CTTGCTGACT TAATAAGCAG GCTGGGTAAT TATAATCAAC AAGTAGAGCG CAACTTCAGC CAGGCATTAA AGTACAATCA ACAAGCGCTT GAAGTAAGAA GATCTTTATA TCCAGGAAGT AATCTTCAAG TAGCTGAGAC ATTAGATAAC CTTGGGGGTA CTTATAAGGT TCTAGGCCAG TATCAAGAAG CTTTAAAATA CTATCATCAA GCTCTTGAAG TAAAAAAGGA CTTATATACT GGTAACCATA TACACCTAGC TGAATCATTA ACCAATGTAG GATTAGCACA TAAAGCGCTA GGGCAATACC AGGAAGCTGC AACTTATTTA AAACAAGCCT TTGAAATGAG GCAGGCTTTA TATACAGGCA ACCATCCTCA CATAGCTGAA TCCTTACACA ATCTGGGAGC TATTTATAAA GCTTTAGGCC AATATCAAGA ATCATTAAAA TATTATCAGC AAGGGCTTGA AATGAGGCAG GCTTTATATA CAGGCAACCA CCCTCACATA GCACAATCGT TTAATAATTT AGGTTTGATC TATAAAGCTT TGGGACAGTA CCAAGAAGCA CTAAAATATC TAAAACAGGG GCTTGAAATG CGTAAAGCCT TATATACAGA CAAACATCAT CGAGTAGCAC AGTCATATAA TAATGTAGGA AGTGTTTATA AATCTTTAAA GCAATATCAG GAAGCTTTAA AATACTACCA GCAAGCACTA GATATGAAAA AGTCTTTGTA TATGGGCAAT CACCCTAGCA TGGCTATTTC ATTGAATAAC ATAGGAAATA TCTATACGGC TTTAGGCCAA TATCAAGAGG CATTGAAATA CTTAAAGCAA GCCCTTGAGA TGCGACAAGC ATTATTTACA GGCAATCATC CTCAAACATC TATTTCACTC AATGATTTAG GAGATTTTTA TCAAGCCTCA GGAGAATATC AAGAAGCACT CAAGTATTAT CAACAAGCAC TTACAATGAG GCAATCATTA TATACAGGCA ATCATCCTGA TATAGCTATT TCTCTTAATA GTATAGGGTA TGTTTACCAG ACTTTAGGTC AGCACCAAGA AGCTTTAAAA TACTATCAAC AAGCACTCAA TATGTGGAAA TGTGTGTATA CAGGTAACCA TCCCAAAATA GCTATTTCCT TAAATAATTT GGGGAGTGTT TATCAAGCTT TAGGGGAACA TCAGGAAGCA GTCAAGTATT ATCAGCAAGC CCTTGTTATG CGGCAAGCGC TGTATCCGTG CAATCACCCA GATATAGTTA TTTCACTCAA TAAGCTAGGA GATATTTATA CAGCTTTAGG CCAGCATCAA GAAGCGTTAA CATGTTATCA GCAAGCCCTT GTAATGCGGC AGGCGTTGTA TATAGATAAG TAA
|
Protein sequence | MKIRSYLFGY LQDFNLRIML LQFGKLPRLL TQLFFVLCIL TGFKGVRLDT VFQQSIALTT SYNLLALQQS SIYQPVANSL VEEKKVTVSI ANNKEMTEEK KETLLPSEAR KQTLEPHQVI SQSFEQILAA LYNNDPHLYS INLTHQQLNL EKLLQLEQAI DNNPVIGYIQ WGELPADCET IKEQIQQKLI SNIAAYTYYP NDYIHGLLAY QVYSNPKLGQ TIDLSLVSKE IGHQFSPSIH TSWQVVQVQD DSIHTGYYSA LYVNEITHQA VLAFQGTKVE GLKDLLKKNS DLKEDIDGVL GNAITQQNAL AYVATKNAVD YVKDNEYNLS ITGHSLGGYL AELAVAFCYR DFSYRQTKGI VFDSPGTVNK LDKFKSNVIN KATKFSIASL PIVTYLSAPN LINTCNGHPG EVYRVYPQLM WSAEIQKWMK RASKVPLIGS KIKGNNKGLL ALTGHSLSTI LALFDPTTGK PSTYIRIADW PKIDTDNVTY IGKKKEELEK KSLLSTGLSI LTGKTNVIGG AFQMLPSLVG GTAASVLTVL KDYLNINQAQ YLTTLAYLDD NYKETKLSTQ KEFTLRYKGH YRISDKAMNE HILYTENYEG VDWYLYELYK YKDKLAQSPT RDITFAVLKN VVQEYEVVSL DEQPYVRLTE KRGQVEALRD KMQRSLKVLS AASIKKTLED SNIDALTKQL EKRSIKQFSQ LHNYIAQAKL TTYLAREDKQ QELSQKLEKS GICVIYGHGG VGKSTLVAQY GHNQKGKQLV WWMQAETPEK LAKSYQDLAQ DLGIDYQKLA QGFKQSYDNY LPELSKRVYN ALEDRKQPIL LILDNAADAN LIDKCLLYRP SLVQVVITTR KARDFKAYSQ VELAAFKREE GEKYIKNCFE SSLYKPSEQE IATLIKEVGL IPQKLALAVG YIGQKRLMTV QGYIEKLQAI KKSGKKGENQ FVLPEVSLGL ESLDTQSQLV MRYGAYLDPD FIPLSLIIAL VEVKDEEVLE NLLATLEELS LITIIKGQGN ELGIQIHREV QATSKQYQGW DAANNLSEET LLSSIIKVLH ERMPWVTEVP DNTWDQAKIY ASNVGHVLST IEQTIKPTYV LADLISRLGN YNQQVERNFS QALKYNQQAL EVRRSLYPGS NLQVAETLDN LGGTYKVLGQ YQEALKYYHQ ALEVKKDLYT GNHIHLAESL TNVGLAHKAL GQYQEAATYL KQAFEMRQAL YTGNHPHIAE SLHNLGAIYK ALGQYQESLK YYQQGLEMRQ ALYTGNHPHI AQSFNNLGLI YKALGQYQEA LKYLKQGLEM RKALYTDKHH RVAQSYNNVG SVYKSLKQYQ EALKYYQQAL DMKKSLYMGN HPSMAISLNN IGNIYTALGQ YQEALKYLKQ ALEMRQALFT GNHPQTSISL NDLGDFYQAS GEYQEALKYY QQALTMRQSL YTGNHPDIAI SLNSIGYVYQ TLGQHQEALK YYQQALNMWK CVYTGNHPKI AISLNNLGSV YQALGEHQEA VKYYQQALVM RQALYPCNHP DIVISLNKLG DIYTALGQHQ EALTCYQQAL VMRQALYIDK
|
| |