Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0224 |
Symbol | |
ID | 6376525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 249024 |
End bp | 250238 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642681412 |
Product | hypothetical protein |
Protein accession | YP_001957397 |
Protein GI | 189501680 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.505071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCTA ATAAAAGCAC TACCATCAAT CTTCACATTT GTATTAATCT TCTAAAGTTA TTATTAGTAT TGGTTACAGT AGTTGCTTGT GAATGTGTCC CAAAGGGAGG CAGAGGTGAT ATAATAAATC CAGAAATTCC CAAAGCAAAC GATAAAAGCG AAGAACCTAT CACCAATGAT AAGAACAAAA AGCCTATTCC CAAGGAGATG ATTGATGCTG CTGAAGCAGC AAAGAAAACC AATTTAGTAA TTTTACTTAA AACGTTCCAG GAGCTAGAGG CCGAAGACAT TAATATTGTC AATGGACTAC TTAGTTCGGA CCTGGATATT TTAGGTTTAA AAGAAGCTGT AGAATTTGGG AACTTAGATA TTGTAGATAC AATACTAAAA AGGGGAGTAA ACGTAAACAA AAAGATACAA GAAAACGAAA AAGATGAATT TATACTTCTA AATTATAGAA ATGATCTTAC ACCTTTACAT ATAGCTGCTA AATCTGGCCA TACCGAAATA CTACTCAAAT TAATAGAAAA AGGTGCAGAA CTAAATGCAA AGGATAAATA TGGTGATACT CCTTTACATC TTGCTGCTGA TGCTGGGCAT GCTGATATAG TATTTAAATT GATACAAAAG GGAGCAAACA TAAAATCAGC AACTAACGAT GGGTATACCC CTTTACATCT TGCTATCATG AAAGCACATA CGGAAATAGC GTTAAGTCTG ATAGAGCAAG GAGCAAATCT TGACATATCA AGCATTGAAG GAGATACGGC ACTTAACCTG GCAGCCAGGA AAGGCTATGC CAATATAGTG CTAAAATTAA TAGAAAAAGG GGCTGACGTA AATATAAAGA ACAAAATAGG TTTACATCCT TTATATTATA TTATTCGAGA AGGGCATGGT GATATAGCTT TAACATTAAT AGAAAAAGCA AAAGAAATAG AAGTAAATAC AATAGATAGG CAGGGAAATA CTCTTTTACA TTTGGCTGTA TATGGAGATA TTAGGCTACT ATCAAAATTA GTAGAAAAAG GGGTAAAAAT TGATATTACG AACAATAGCG GAAATACACC TTTACATATT GCTGCTAAGT ATGGTTGTAA AGAAGCGGTA TCAGTATTAG TAAACTGCGG AGCAAAGAAA GATATAGCAA ATAACGAACG CAACACCCCT TTAGATTTAG CTAAAACAGA AGAAATAAGA GCTTTACTAA AATAG
|
Protein sequence | MQPNKSTTIN LHICINLLKL LLVLVTVVAC ECVPKGGRGD IINPEIPKAN DKSEEPITND KNKKPIPKEM IDAAEAAKKT NLVILLKTFQ ELEAEDINIV NGLLSSDLDI LGLKEAVEFG NLDIVDTILK RGVNVNKKIQ ENEKDEFILL NYRNDLTPLH IAAKSGHTEI LLKLIEKGAE LNAKDKYGDT PLHLAADAGH ADIVFKLIQK GANIKSATND GYTPLHLAIM KAHTEIALSL IEQGANLDIS SIEGDTALNL AARKGYANIV LKLIEKGADV NIKNKIGLHP LYYIIREGHG DIALTLIEKA KEIEVNTIDR QGNTLLHLAV YGDIRLLSKL VEKGVKIDIT NNSGNTPLHI AAKYGCKEAV SVLVNCGAKK DIANNERNTP LDLAKTEEIR ALLK
|
| |