Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1223 |
Symbol | |
ID | 6376909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1562264 |
End bp | 1563568 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642682319 |
Product | hypothetical protein |
Protein accession | YP_001958277 |
Protein GI | 189502560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000959128 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACTTTA ATTCATTCCA GCAACTGATA GCACGTCTTC TACTTATAAG CTTATTCTTA CAAAGCTGTG GTGGAGGATT CGACAATAAC CCACTTATTC CTACCGGGGA AGAGCAAGTA GCATCTATAC AAACTACTAC ACAAGCAATC CTTCCTCGAG CAGATATCCA GCCTTTGACA GGTCAAGTAT TGACAGCAGA AGGTGGCCAT GCTGTTACTT TCTATAAGGA AGCAGGTGAG TTAAAAGCTA ATGTAGCAAT GGACGTACCT GAAGGATTTA GTAAAACCTA TGAGGGAGTG GAAGTATTAT TAGAGCAGGG AGCAGAGTTA TCGGACCTAC CTCGATTAAG TGAGCAAGCA CAACAACGAC GTATTTATCT TCAACCAGCA CAAGGCAACC AGCCAGCTAA AGTAGTTATC TATAAAGGAG TAGGATTGAT GGGAGGAGGG AGTAGTGAAG ACGAAGGAGA GGAAGAAGGA ACATATCAAC TGGTGGTGGA GAGCGGAGAA AAGGAAGCCG AAGAAATTGA GCAAGAAAAA GAAAAGCTAC AAATAATTAG ACATACTAAA AGGGGAGTTG CTGAAGCACA CTATCATTAT AATTTATGGA GAGAGGATTT TTTAAAGCTA CAACCTTTAA CTTCTGTAAG AATACAGCCT GAGAAAGTTC CAATGCAGGA ATTAATGAAG CTTTTTGAAA TTAAAGAAGG AGAGTTTTTA GAGAAACAAG TGAAGGAAGT TTTGGAAGAT GCAGATGGTA TCATTCCAGA ACCAGAACCA GAACCATTTA ATAAAAGACA TAAAATATAT GGACAGTTCA TTATTTGTGA GCAGACCCTA CCAGCAGATG ACGGGGGATA CCCAGGTAAT CTTCCAGATT TTAAAAAACG AACGAGGGTT GGATTTAATG GTGATGGGTT ACTTTATCAA AACAATAGTA TTTTAGATTC CTCTGAACAA GCACTGTGGG CTTTAAATCC AAAAGGTAAG ATGTGCATCT TTTTTAGAGA TAGACATCCT GATATTCCCA GTCAAGTACA TCACACTTTC TTTTTCAAAA CAAGTGGTAT TGGCAAACCT GTTGCATGTA GTGGTATTAT TAGAGTTTGC AAAGGTAAAA TTGTGAGTAT TGATAATGAT AGTGGTAGGT ATCAGCCAAG CGTTACTCAG TTGCTGTTAG CAGCAAAATA TCTATTTAAT AAAGGTATTT TAGATCCTAC TATAAGCGTC AATGATGTAG TAAGAGATAA AAGTTTTACA TTAAAAGAAA TGCTAATTTT TGCACATTCT CTTGACCTAA CCTAA
|
Protein sequence | MNFNSFQQLI ARLLLISLFL QSCGGGFDNN PLIPTGEEQV ASIQTTTQAI LPRADIQPLT GQVLTAEGGH AVTFYKEAGE LKANVAMDVP EGFSKTYEGV EVLLEQGAEL SDLPRLSEQA QQRRIYLQPA QGNQPAKVVI YKGVGLMGGG SSEDEGEEEG TYQLVVESGE KEAEEIEQEK EKLQIIRHTK RGVAEAHYHY NLWREDFLKL QPLTSVRIQP EKVPMQELMK LFEIKEGEFL EKQVKEVLED ADGIIPEPEP EPFNKRHKIY GQFIICEQTL PADDGGYPGN LPDFKKRTRV GFNGDGLLYQ NNSILDSSEQ ALWALNPKGK MCIFFRDRHP DIPSQVHHTF FFKTSGIGKP VACSGIIRVC KGKIVSIDND SGRYQPSVTQ LLLAAKYLFN KGILDPTISV NDVVRDKSFT LKEMLIFAHS LDLT
|
| |