Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1221 |
Symbol | |
ID | 6376561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1559614 |
End bp | 1561032 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642682318 |
Product | hypothetical protein |
Protein accession | YP_001958276 |
Protein GI | 189502559 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000594421 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATCA ATTATAGCGT CGCACAGCAA TTCATAGCAC GTCTTATACT TATAGGTTTA TGCTTACAAA GCTGTGGTGG AGGATTCGAC AATAATCCAC TTATTCCTAC CGGGGAAGAG CAAGTAGCGT CTATACAAAC TACTACACAA GCAATCCTTC CTCGAGCAGA CATCCAGCCT ATGGTAGATA AACAATTGAC AGCACAAGGG GGGCATGTAG TTAATTTTTA TATGGAAGCA GGTGATTTGC GAGCTAATGT AGTAATGAAC GTACCTGAAG GATTTAGTAA AACCTATGAG GGAGTGGAAG TATTATTAGA GCAGGGAGCA GAGTTATCGG ACCTACCTCG ATTAGGCCAG CAAGCACAAC AACGACGTAT TCATCTTCAA CCAGCACATG TAGGAAAGCC AGCTAAAATA GTTATTTACA AAGGAGCAGG GTTAATGGGG GGTGGCAACC CAGATTCTGA TGAACAGAAA GAAGGAAGCG AAGAAGAGGA AGAAGAGGCC CAATCAGATG TACAAATTCA AGTAAGTAAC ACACAACAAC TAACCAGTGT TGCCTCCTTA CATGCAGCTA CTGAAGAGGG AGATATAAAG AGAATTAGGG GCTTAATACA AGCAGGCATA GATGTAAATA CTAAAAACAA TAATAATTGG ACACCTTTAC ATATAGCTGC TCAGCGGGGC CATCTAGAAG CAGCCAACAA CTTATTAGCG GCAGGTGCAA ATATCAATAC TACGGACAAT AATGGATTAA CACCGCTATA CTTGGCTGCT TTACTAGGTC ATTTAGAGCT GGTAAAGCTG CTTATAGAAC ACAGAGCAGA TGTAAATATT GCTAATACAA AGGGTTGCAC TCCTTTGTAT ATGGCAGCTA TGAAAGGAAA TTTAGAAGTA GTTAAAACTT TAGCCTTCTC AGGAGGGGCC AATATAAATA TACAGAACAA TGAGGGTTTT ACTCCTTCTT ATATAGCTGT ACAAAGAGGG CACTTAGAAG TAGTTAAGTA TTTAGTAGGT GCAGGCACTG ATGTGAATAT TCGCGATAAT AACGCTCTTA CTCCGTTATA CATCTCTGTT TTAAAAGGGC ATATAGACAT TGCCAAACAA TTAGTGGCAT TAGGCGCTGA TGTACAGGAT CCTTTATATG GAGCTGTCAA GAAAGGAAAC TTAGAAGTAG TTAAGCAATT AATCCAACTA GGGGCCTATA TTAATGCTAA AGATGATAAT GGTTATACGT CTTTGCATGT GGCTGTTAAA AAAGGCCATG TGGAAGTAGT TAAGTTATTA CTAGAAAACG GAGGGAATTT ACACTGTAAA GATAGTGCCG GCTCTTCATT ACTTCATATA GCTGTTAGGA AAGATCATAT AGAATTGGTA AAGTTTTTAT TAGTGCAAGG AGTTTCTCCA AAATTTTAA
|
Protein sequence | MKINYSVAQQ FIARLILIGL CLQSCGGGFD NNPLIPTGEE QVASIQTTTQ AILPRADIQP MVDKQLTAQG GHVVNFYMEA GDLRANVVMN VPEGFSKTYE GVEVLLEQGA ELSDLPRLGQ QAQQRRIHLQ PAHVGKPAKI VIYKGAGLMG GGNPDSDEQK EGSEEEEEEA QSDVQIQVSN TQQLTSVASL HAATEEGDIK RIRGLIQAGI DVNTKNNNNW TPLHIAAQRG HLEAANNLLA AGANINTTDN NGLTPLYLAA LLGHLELVKL LIEHRADVNI ANTKGCTPLY MAAMKGNLEV VKTLAFSGGA NINIQNNEGF TPSYIAVQRG HLEVVKYLVG AGTDVNIRDN NALTPLYISV LKGHIDIAKQ LVALGADVQD PLYGAVKKGN LEVVKQLIQL GAYINAKDDN GYTSLHVAVK KGHVEVVKLL LENGGNLHCK DSAGSSLLHI AVRKDHIELV KFLLVQGVSP KF
|
| |