Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1251 |
Symbol | |
ID | 6377361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1598933 |
End bp | 1601890 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642682346 |
Product | hypothetical protein |
Protein accession | YP_001958302 |
Protein GI | 189502585 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAGAA CAAGACATAT ACTTTATTTT ATAGGTAGCT TCTTATTAGT TTCCTGTGAT CGGGATATAC CTATCACAAC TTCAGCTCCA AAAGATATTC GTAAAACTAC AATTTTACAA ACAGAAGAAG CTGGTAGCGA TTTGATTGCC TTAAATGAAA ATATGCTTGT GCAAGCTAAT GAAAAGGAAG TAGCTAAAGC AAGCACACCT GTCGATCAGC TAAAATATAT AAGCCAACTT CCGGAAGAGA CAGTGCTGCA AGACAAACAA GCACAAGGAA CTATAATCCA AGCGATTGCT CAAGGCTGGC AGAACATTAA TCAGTATGTC AAAGAAGCCT ATATACCCTT TATCTTAGAG CATGTTGAAA AGGAAGATCC AGAAGCATTG ATCGCCTTAT TATGGTTGAT GGAGAAATAC ACAAAAAATC CTTTACTTAA ATCCTACAAG CTAAAATCAG AGGCTTATCA GAGGCTACTT ATAAAGCTAG AAGAATCTAC AGATCATACA AGTATAGATG GTATGCATGC CTATTATATA GGTGAAATAT ACAGTAGTAA GGCAATAAAA TACTTATCTC CAGATGCAAA CAAGGCCATA GCATATTACC AAAAAGCAGT CAGGATGGGA AATGCTAATG CTGCTCATGC GCTAGGGTAC ATTCATCATA AAGGAATAGA AGTGGAATTA GCACCTAATG CAGCAAAAGC TATAGAATAT TATGAGAAAG CAATCGGGAT GGGAAATACT AAGGCTGTTC ATGCACTGGG CTTTCTTTAT CATAATGGTA TGGAAGGTCA AATAGCACCT AATGCAGCTA AAGCTATAGA ATATTATGAA AAAGCAATCG GGATGGAAAA TGCTGGAGCT GTTCATGCGC TGGGGTACCT TTATCATAAT GGTATGGAAG GTCAAATAGT ACCTAATGTA GCAAAAGCTA TAGCATATTA TGAAAAAGCT ATTGACATGG GATATGCAGA TGCTGCTCAT AACCTTGGCT TTCTTTACCA TAATGGTATA GGAGATCAAT TAGAGCCTAG TGCAGCTAAA GCAATAGAAT ATTATGAGAA AGCTATTAGT ATGGGAAATA CTGATGCTGC TCATAACCTT GGCATTGTTT ATGAGAGAGG AATAGAAGGC CAATTAGTGC CTAATGCATC TAAGGCTATA GAATATTATG AGAGAGCTAT TAATATGGGA GACGTTACGG CTGTTCATAA TCTTGGCATC CTTTATGCAA AAGGGATGAA TGGTCAGTTA GCACCTAATG TAGCAAAAGC CATAGAAGCT TATGAGAAAA CTATCAAGCT AGGAAATGCT GAGGCTGCTA CCGATTTAGG CATTCTTTAT GCAGAAGGGA TAAAAGGCCA ATTAGCGCCT AATGCAGCAA AAGCCATAAA ATCATATGAG GAAGCTATTA AGCTAGGAGA TTTTAGGGCC GCTACTAATC TTGGCTCTCT TTATCATCAT GGCATGGAGG GACAATTAAC GCCTAATCAA GTAAAAGCTA TAGCATATTA CGAGAAAGCA GTTAGCATGG GAGATGCTGA AGGTGCTTAT ACCCTTGGCG TTCTTTATGA GAAAGGAATG AAAGAACATT TAGCGCCTAA TGCAGTAAAA GCTATAGAAT ATTATGAAAA AGCTATTAAA CTCGGAAGAA CTGATACTGC TAATAATTTA GCCGTTCTTT ATCATAGGGG TATGCCAGGT CAATTAGCAT TCAATGCAGT TAAAGCTATA GAATATTATA AGTTAGGTGT TGAGTTAGGT AATGCTGATG CTGCTACTAA TCTTGGCATC CTTTATCATA ATGGTATGCC AGGTCAGTTA GTATCCAATT CAACTAAAGC TATAGCATAT TATGAAAAAG CAGTTAGTAT GGGAGATGCT AAAGCTGCAT ATGGTCTTGG CATTCTTTAT GATAATGGCA TAGAAGATCA ATTAACGCCT AATACAACAA AAGCGATAGC ATATTATGAA AAAGCAGTTA GTATGGGATA TGGAGGCGCT GCTAATAGCC TTGGAGCTCT TTATGCGAGA GGAATAAAAG GTCAATTAGC GCCTAATAGA GCAAAAGCTA TAGCATATTA TGAAAAAGCA GTTAGTATGG GAAATGCTGA TGCTGCTCGT AACCTTATCA CCCTTTATGC GAAGGGTAAA AGGGGGCAGT TAACTTCTAA TAAACAATTA ACTTTTAAAA CATATTATGA TACTTATCTA GCAAATAATA AGTCTGATTA TGTTAAAGAG GGTTTGCTAA AATTTCTAGT AAGCAATCCA TCAATAAAAG TTGATCCCTC TAATATAGCA GAGTCCCAGC AAGGGTTAGA GACTTTTAGG GAAAATACAG AAACACTTAC CGGGCTTATT CTCCTCAAGC AAGAGGAAAA TAATGCCACT TCTCAAGCTA TGCAATTTAA GGATTTCTAT ATTATTCCGG AGCTTTACCC TTGCTATAGT GCACTTATAG AATATCTAGA TAAAGTTAGA AATATAATAC CTTATCTATC AAAATATGGG GTTATGGTTG ATTGTATTAA AATTAAAAAG AATGGAAAAA AAAGAAAAGT TATAGCAGAT GGTGATCATC TTCATACATA TTTGATAGGT GGTCAATCTT ACATATGTTT AGGAGAGAAT AATGTGAAAG CAGGTAAAAT GTTAATGAGC CTTTTGGAAG AGGAAAAAGA TGTGAATCAA ACTGTGGGTT CTATAAGAAA GATGATGATG CAGGGTTCTT CTACAGAATT AGGGCTACGT TTATATAAGC AAGCATTAAA GAAATTACCC TCAGATTATA CAGGGCCAGT GGAAGAATAT ATTGCTACTA GGACTACAGA GACACTTGGT ATGCTAGATA ATATGCAAGC TTTGTGCATA CAATTAAAAG ATATAGTTAT AGCAACAACT TCTCTTAGAA ATAAAAGCTC TATGGAGCTT TATCATTTCT TACAGTAG
|
Protein sequence | MHRTRHILYF IGSFLLVSCD RDIPITTSAP KDIRKTTILQ TEEAGSDLIA LNENMLVQAN EKEVAKASTP VDQLKYISQL PEETVLQDKQ AQGTIIQAIA QGWQNINQYV KEAYIPFILE HVEKEDPEAL IALLWLMEKY TKNPLLKSYK LKSEAYQRLL IKLEESTDHT SIDGMHAYYI GEIYSSKAIK YLSPDANKAI AYYQKAVRMG NANAAHALGY IHHKGIEVEL APNAAKAIEY YEKAIGMGNT KAVHALGFLY HNGMEGQIAP NAAKAIEYYE KAIGMENAGA VHALGYLYHN GMEGQIVPNV AKAIAYYEKA IDMGYADAAH NLGFLYHNGI GDQLEPSAAK AIEYYEKAIS MGNTDAAHNL GIVYERGIEG QLVPNASKAI EYYERAINMG DVTAVHNLGI LYAKGMNGQL APNVAKAIEA YEKTIKLGNA EAATDLGILY AEGIKGQLAP NAAKAIKSYE EAIKLGDFRA ATNLGSLYHH GMEGQLTPNQ VKAIAYYEKA VSMGDAEGAY TLGVLYEKGM KEHLAPNAVK AIEYYEKAIK LGRTDTANNL AVLYHRGMPG QLAFNAVKAI EYYKLGVELG NADAATNLGI LYHNGMPGQL VSNSTKAIAY YEKAVSMGDA KAAYGLGILY DNGIEDQLTP NTTKAIAYYE KAVSMGYGGA ANSLGALYAR GIKGQLAPNR AKAIAYYEKA VSMGNADAAR NLITLYAKGK RGQLTSNKQL TFKTYYDTYL ANNKSDYVKE GLLKFLVSNP SIKVDPSNIA ESQQGLETFR ENTETLTGLI LLKQEENNAT SQAMQFKDFY IIPELYPCYS ALIEYLDKVR NIIPYLSKYG VMVDCIKIKK NGKKRKVIAD GDHLHTYLIG GQSYICLGEN NVKAGKMLMS LLEEEKDVNQ TVGSIRKMMM QGSSTELGLR LYKQALKKLP SDYTGPVEEY IATRTTETLG MLDNMQALCI QLKDIVIATT SLRNKSSMEL YHFLQ
|
| |