Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1443 |
Symbol | |
ID | 6377493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1864314 |
End bp | 1867475 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642682512 |
Product | hypothetical protein |
Protein accession | YP_001958461 |
Protein GI | 189502744 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.639099 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAT CTTATACTAT AAATCAGCAA TTTATAGGTT GCCTTTTACT TGTAAGCTTG CTTTTACAAA GCTGTAGTGG TTTAGGTAAC CCATGTATGC CTATTGAGAA AAACAAGATA GCGCATATAC AAACTGATAC TTCTACAGCA ACAACTGCAT ATCAGATTGA TGTAACAGAA TCTGTACCTG TATTGGCAGA GGATTCTTCC ACTAGTGGCC AAGCTTTAGT CCAATTATCT ATGTACGATA ATAGTTTACC TGATATAGCA AAACAAGAAA TTAAAGCAAC TATAAGAAGT GAACAACTAT CTAATAATAT ACATATAGGC AAACAGATAC AGTTAATAAA TGCTCCAGTA AAATCAGTGC AATCATCTAT CCATTCATCT ATTAGCAAGA CTACTGAATT AATCAGGTCT AAACAGCATA TAAGAAAAAA CCAACATACT GCAGCAGAAA AAAGAAATAA ACATTTGACA GCTAGCTTAT TAAAATACCA GCAATACACA ATAAAAGGAG GGTATGAGAT ACAGTTCTCG CAGACTAAAG GAAAATTACA AGCCATAGTA AGAAAGTGTT ATCCTACAGG ATTTAGCGAG CAAGTATTAC CTGTTATTAT AACACCAGGA TTTAGCTTTA CAGAAAAAGA GGTAGTTAAT GAAGGCTGGC AACAACAATA TGTACATATT TTCAAGGATT ATGTATATGT AGGCCAAAGT GGTCTGCTGG GTGGCATGAA ACTAGGGTAT CGAGGATATG TAGAAGATGA GCTTCATACG AGTGCTGTGC CTGATGAGTA TTGTTGCCCA ATCACCAAAC AAATTATGGC TGAGCCTGTT ATGGCAGCAG ATGGTTATAC TTATGAGAAA AGTGCTATTG AGCAACATAT GAATGAGAAA GGAGCCATTA GCCCTTTTAT CCGAAAACCG TTAACCAGTA CGAACTTAAT ACCTAACCAG GGGCTAAAAA GAGCTATCCA AAATTATGTA GAAAAGAATA AGAAATTTTA TGAACAACAG TGTATTAAAG CAATACAAGA AGTTGATATT AATTCATTGC TACATTTAGA AAAGTTGGGT ATTAATATTG ATGTGGCTGA TGAAAATGGC TGGACATTGA TACACTATGT TATTTATCAA GCAAAAATAG AACACATAAA GTTTTGCTTA AATAGGAAAT TTAACATTAA TGCTGCTAGT GCAAGTTTAA AGGGATTATA TTTTCCACCG GATTTACCGC AATTAATTGC CGCTCAGTTA GAAGAAATCA ATATAAAAGC TGCCAAATAT AATCTTTCTG CAAAGATTAC TGAGCAAACG ATATTCCAAA TAATGAGAGA ACTAGAAAAC CAAGCTAAAG CAAATGATTA CTCAATTGAA AAAGGCCAGC AAAGCTGGGA GGCCTTTGAA AGGGATGCCA TAAAGGTTGA ACAAGCATCC TATAACAAAG CTTTTTTTTC CGGATCGGCC CAAAATGCTT GGCATATGAC TTCTAATAGA AGTTCCTATT TTTCAGGTTC AGAATCGCGT CGGAGGAATT GGCAACACTA TCACGATAAC CTTTCTCGCT GGCTAGATGA GCGAAATCAA GTTGTAGCTC TATTGGCTGA GGTTAAAAAA ATTGAACAAA ATATAAATAT TTTGCAGCAA AGAAAACAAT GTGTATTTAA TAACTTAGCG CCCCTCCATT TGGCAACAGC TCAAAAAAAT GAAAAAATAA TAATGCAGTT AATTCAGCTA GGTGCTGATT TAGAATTAAA AGATGGCAAT GGAACAACGC CAATTTTTTG GGCTGTTTAC CAAAATGAAT TAAAACTTTT AAAATTGTTT GTTGACAATG GGGCAAATTT ACAAGTATCT GATAATCAAG GTAATACCTT ACTTCATGTT GCTGCACAAT ATGCTGATTT AAATATCATC AATTATTTAA TTGAAATCGG TTTTTATCAT TTGCAGGAAA ATCATCTAGG GCAAACAGCT ATTGCTGTTG CCTTAAAGTA TGAGCGCAAA GCAATTGCCG ATTTTATTAA TAAAAAAGGG GCTGAAGGTT TACAAGCAGC GCTTTTTAGA ATCAACCGAG GTCAACAACA GCGTAAGGAT TCTTCTACAA TTTTGGGCTC TGGTAGCAGT CAGTCGAATC TTTATTATCC AGCTTCCTTA AGCAATCCTA CTCCATTAAA TAAACCAAGC AATACAACCA GCCTTGCTAG TAATACCAAT CTGCATCTAA CTACCTTACC GCCATCACCT GCTTATGATG AACCGCAAAG CGCTGGTCGT TTAGCCCCAC CACTTACCTC TGCAACGCAG GTAACTGAAC AATTTAGTCG GCTTAGGATT TCGTACGAGA TTCCTTATCA GGCTCTCCAT TTTCAACAAG AGCTTGGCCG TGGAGGGTTT GGGATTGTGT ACAAAGGTGC TTATCAAGAC AAGCTAGTGG CAATAAAACA GTTAATGAAT CAGGACTTAT CTAAAGCTCT CATACATAAT TTCAAACAAG AGATTAGTAT GATGGCAAGG TTGGAATCAC CTTATGTTAT TAAATTTATT GGTGCTTGTT TCCAAGCGCC ACACTATTCT CTGGTGATGG ATTATATGCC TAATGGGGAT CTTTACCACT TTCTTCAAAA ACCAGGACAA ATAGATTGGC AGCTACGATA TCAAATTGCT ACTGATATCG GCCATGGTGT AAATTATCTG CACTCACACG GTATTATTCA TGGTGATTTA AAAAGTCTAA ATATTTTATT AGATAAAAAT TATCAAGCCA AAATAACGGA CTTTGGCTTG GCTAAAATTA AGATATCTAG TTCGATTAGT ACCTTAGTGG GAGGTCAGAA AGGAGGGTCG CTCCGTTGGA TGGCACCTGA GCTCTTAACA GCCGAAGAAG AAGAAACTAG TAATACAAAA GCCTCCGATG TTTATAGTTA TGGTATGGTA TTATGGGAAC TTGGCGCAAG GCAAATACCT TATGCTAATA AGAGGGACCC TCAAGTTTTG GCTTTAAAAT TACAAAATAA ACATGAGCCC ATCACTCCGG ACACTCCTCC GTCAATATCT GCACTTATCC AATGGTGTTG GAAAGAAAGA ACCAAAAGAC CTGCAATTAC TGAAGCGGTA GAAACCTTGG AAAAAGAGCA GAGGTTACTT TTAAATAAAT AG
|
Protein sequence | MKRSYTINQQ FIGCLLLVSL LLQSCSGLGN PCMPIEKNKI AHIQTDTSTA TTAYQIDVTE SVPVLAEDSS TSGQALVQLS MYDNSLPDIA KQEIKATIRS EQLSNNIHIG KQIQLINAPV KSVQSSIHSS ISKTTELIRS KQHIRKNQHT AAEKRNKHLT ASLLKYQQYT IKGGYEIQFS QTKGKLQAIV RKCYPTGFSE QVLPVIITPG FSFTEKEVVN EGWQQQYVHI FKDYVYVGQS GLLGGMKLGY RGYVEDELHT SAVPDEYCCP ITKQIMAEPV MAADGYTYEK SAIEQHMNEK GAISPFIRKP LTSTNLIPNQ GLKRAIQNYV EKNKKFYEQQ CIKAIQEVDI NSLLHLEKLG INIDVADENG WTLIHYVIYQ AKIEHIKFCL NRKFNINAAS ASLKGLYFPP DLPQLIAAQL EEINIKAAKY NLSAKITEQT IFQIMRELEN QAKANDYSIE KGQQSWEAFE RDAIKVEQAS YNKAFFSGSA QNAWHMTSNR SSYFSGSESR RRNWQHYHDN LSRWLDERNQ VVALLAEVKK IEQNINILQQ RKQCVFNNLA PLHLATAQKN EKIIMQLIQL GADLELKDGN GTTPIFWAVY QNELKLLKLF VDNGANLQVS DNQGNTLLHV AAQYADLNII NYLIEIGFYH LQENHLGQTA IAVALKYERK AIADFINKKG AEGLQAALFR INRGQQQRKD SSTILGSGSS QSNLYYPASL SNPTPLNKPS NTTSLASNTN LHLTTLPPSP AYDEPQSAGR LAPPLTSATQ VTEQFSRLRI SYEIPYQALH FQQELGRGGF GIVYKGAYQD KLVAIKQLMN QDLSKALIHN FKQEISMMAR LESPYVIKFI GACFQAPHYS LVMDYMPNGD LYHFLQKPGQ IDWQLRYQIA TDIGHGVNYL HSHGIIHGDL KSLNILLDKN YQAKITDFGL AKIKISSSIS TLVGGQKGGS LRWMAPELLT AEEEETSNTK ASDVYSYGMV LWELGARQIP YANKRDPQVL ALKLQNKHEP ITPDTPPSIS ALIQWCWKER TKRPAITEAV ETLEKEQRLL LNK
|
| |