Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0353 |
Symbol | |
ID | 6377298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 414318 |
End bp | 415592 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642681523 |
Product | hypothetical protein |
Protein accession | YP_001957507 |
Protein GI | 189501790 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.326353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATAA AAATAACCAA AGAAGCAGTA TTAGATTATC ATGCACAAAA GCCTGCAGGT AAACTAGGTA TACATGCCAC CAAGCCATTA CAAACTCAAT ATGACCTATC AATAGCATAT TCACCAGGAG TGGCCATCCC TTGCCAAGCT ATTGCCGAAG ATAAGCAACA AGTATATAAT TATACTGCTA AAGGAAACTT AGTAGCAGTT ATTTCTAATG GCACAGCCAT ACTAGGGTTA GGCAATCTAG GCCCTGAAGC AGCTAAACCT GTTATGGAAG GTAAGGCTAT TTTACTTAAA AAATTTGCAG GTATTGATGC GTTTGACATT GAAATTGACG CAACAGAGCC TGCGGATGTG ATACATATTA TCAAGGCTTT AGCACCTACC TTTGGCGGTA TTAACTTAGA AGACTTTAAA GCACCTGAAT GTTTTGAAAT TGAAACTGCA TTAAAAGAAC AATTATCTAT ACCAGTCATG CACGACGACC AGCATGGCAC TGCTATTATA GCAGGTGCTG CACTAAAAAA TGCACTCTTG TTGGTAAAAA AAGAGATTGG TAATATTCAA GTAGTTATCA ACGGTGCTGG TGCTGGTGCT ATTGCATGTG CTAAGCTTAT TGTAGCATTG GGTGTAAAAC CTGGTAACTT GGTAATGTGT GATACACAAG GGGTTATTCG CAAAGATAGA GAAGAGCTGG CAGGAGAGAA ATCAAGATTC GCTACTGATA GGTCTGTCCA TACTTTAGTA GAAGCCTTGA AAGGAGCTGA TGTGTTTATG GGACTTTCAA AAGGCAATAT CCTACAGCCA GAACATATTC TTGACATGGC AGAGCGTCCT ATTGTATTTG CTTTAGCCAA TCCAAATCCA GAAATTAATT ATGATTTGGC AGTGAACACA CGGAAAGATA TTATCATGGC TACGGGAAGA TCTGATTATC CTAATCAGAT TAACAATGTG CTAGGGTTTC CTTATATTTT TAGAGGAGCG TTAGATGTAT GGGCTACAGC TATTAATGAG CCTATGAAGC TAGCAGCTGT AGAAGCTTTA GCCGCACTTG CACAACAGCC TGTTCCCAAT CAAGTGAAAA AGGCTTATGG GGTAGAGTCG CTTGAATTTG GATCTACTTA TATATTACCT AAACCAATAG ACCCTCGACT TATAACAACT GTTTCTCCAG CTGTAGCACA GGCTGCCATA TCATCAGGAG TAGCAAAAAA ACATATAGGT GACTGGGAAG CTTATAAAGA ATCTCTCAAA CAATATATTG AATAA
|
Protein sequence | MSIKITKEAV LDYHAQKPAG KLGIHATKPL QTQYDLSIAY SPGVAIPCQA IAEDKQQVYN YTAKGNLVAV ISNGTAILGL GNLGPEAAKP VMEGKAILLK KFAGIDAFDI EIDATEPADV IHIIKALAPT FGGINLEDFK APECFEIETA LKEQLSIPVM HDDQHGTAII AGAALKNALL LVKKEIGNIQ VVINGAGAGA IACAKLIVAL GVKPGNLVMC DTQGVIRKDR EELAGEKSRF ATDRSVHTLV EALKGADVFM GLSKGNILQP EHILDMAERP IVFALANPNP EINYDLAVNT RKDIIMATGR SDYPNQINNV LGFPYIFRGA LDVWATAINE PMKLAAVEAL AALAQQPVPN QVKKAYGVES LEFGSTYILP KPIDPRLITT VSPAVAQAAI SSGVAKKHIG DWEAYKESLK QYIE
|
| |