Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1714 |
Symbol | |
ID | 8999446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1048757 |
End bp | 1051432 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003573127 |
Protein GI | 294661251 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.804907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCAAGCT ATGCTTATAT TATATTAGTT AGCTTAAATA GGATAAGGCA AGCAATACAA AATATAAATA AGAACATTTT CGTTCTTTTG AAGAATATAA TCATATACCT CAATACAATA ATCGTTAACA AGCATACATG GTTTATAAGC AATAAATCAG TAAGTTTGCA ACAACATATT ATATCTTATT TATGTCTAAG CCCTTCAAAA ATCAATCCTA TGCTTCGCCT GACCTACATA AGAATTATCA GCTTTCTCCT AGCATTTATA TGCTTCGGTT GCCAAAATTA TCATAATCCT AATTTAGAAG AACAGCTACA ATTAGAGCGG CTTGAAGCAA GGATTCGAGA GTTAGAAGAA AAACTAAAAC TTAACAGAGA AAAAACAGAT AAACATTCAG CATCTGTCTA TAATTTATCT CAACAGATAG ATAAAATAAG TAAGAAACTA GCTAAACTGT TAGCGCATCT TAATATTACC TCTACCCAAG ATTTGACAAC CTTAGAGACT GAGCTATCCA CAATAAAAGC TGACATTGTT CCAATAAAAG ATCAACTGGC AGCATTACTA CAATCGATAG ACTATTATGT AAATCAAACG CAGCTACAAG AAGAATTAGA AAGCTTTGAA GATAAGCTTC AAGAGCTGTT AAAGGAAATA AGACAAGCTG CTAAATCTCA TACAGATACT AGTAGTATAG AACAAAAAAT TGAAGGGTTA CAAAAAGCCA TTGATAAGGT TAAAGAAGAA CTAACAGAAT TTATTAGCGC ACAGGCTCAA CAACAAGCCA AGGCAGCAAA AGACCAAGCT AGTACTGCTA GTGGCGATGC AAAAACAGCT AGCGATGAGG CAGAAAAAGC TCAACAACAA GCCGAGGCAG CAAGAGATCA AGCTAATACT GCTAGTGAAG AAGCAAAAGC AGCTAGAAAT GAAGCAGAGA AAGCTCAACA ACAAGCCGAG ACAGCAAGAG ATCAAGCCAA TACTGCTAGT AAGGAAGCAA AAACAGCTAG AAATGAAGCA ATAAATGCCC AAGAAGCTAT AGAAAAAGCT CAACAACAAG TCAAAAGTAA TGTAGAGGCT GCTAACAAAG CTGTTGAACA AGCTAATACT GCTAGTAAGG AAGCAAAAAC AGCTAGCGAT GAAGCAATAA AGGCCCAAGA AGTTACAGAA AAAGCTCAAC AACAAGCCGA GGCAGCAAGA GATCAAGCCA ATACTGCTAA CGATAAAGTA ATAAAGGCCC AAGAAGCTAC AGAAAAAGCT CAACAACAAG CCAAAGCAGC AAAAGAACAA GCTAGTACTG CTAGTAAGGA AGCAAAAACA GCTAGCGATG AAGCAATAAA GGCCCAAGAA GTTACAGAAA AAGCTCAACA ACAAGCCGAG GCAGCAAGAG ATCAAGCCAA TACTGCTAAC GATAAAGTAA TAAAGGCCCA AGAAGCTACA GAAAAAGCTC AACAACAAGC CAAAGCAGCA AAAGAACAAG CTAGTACTGC TAGTGGCGAT GCAAAAACAG CTAGTGATGA AGCAAAAAAA GCTCAACAAC AAGCCAAAGC AGCAAGAGAT CAAGCCAATA CTGCTAGTGA AGAAGCAAAA GCAGCTAGAA ATGAAGCAGA GAAAGTTCAA CAACAAGCCG AGGCAGCAAG AGATCAAGCT AATACTGCTA GTGAAGAAGC AAAAGCAGCT AGAAATGAAG CAGAGAAAGC TCAACAACAA GCCGAGGCAG CAAGAGATCA ATCCAATACT GCTAGTGGCG ATGCAAAAAC AGCTAGTGAT GAAGCAAAAA AAGCTCAACA ACAAGCCGAG GCAGCAAGAG ATCAAGCTAA TACTGCTAGT GAAGAAACAA AAGCAGCTAG AAATGAAGCA GAGAAAGCTC AACAACAAGC CGAGGCAGCA AGAGATCAAG CTAATACTGC TAGTGAAGAA GCAATAAAGG CCCAAGAAGC TACAGAAAAA GCTACCAAAC AAGCTAAAGA TGATGCAGAG ACTGCTACAA ACGCTGCTAC ACAAGCCAAT ACTGCTAGTG AAGAAGCAAA AACAGCTAGA AATGAAGCAA TAGAGGCTCA ACAAGCTGCA GAAAAAGAAG CTACTAAAGC CATGAAGCAA GTAGAGCAAA TTAAAAAGAA AGCTCAAGAA AAAGCACAAC AAAAACAAGC TAAAAAGTTA GCCAAAGAAG AAACTGCTAG AAAAAAAGCT GAACAAGAAG CTATCGAAGA AGATAAAAAG CAGGCAGATT TGGTAGCAAA AGTTAAAGAG GAAGCTATTA AAGTCGCTAA AGAAGCTGTT AAGAAGCAAG TAGAAGATGC TACAGAGCAA GCTAAAGAGG CTAAAAAACA AGCAGATTTA GCAATAAAAG CTAAAGCAGG AGCTATCGAA GAAGCTGAAA AAGCTGCTAC ACAAGCTAAA GTACATGCCG AGATTGCTAC AAACGCTGCT GCTGAACAAG TTAATAAGTC TATCAATAAA CTTCAAGAAC AAATTAATGA AGCTCTTAAA CAATCCGAAG AAGTTAATAG TGATAAAATA TCCGAAGCCA AGTCTGATAT GCAACAAGAA TTAAACACTA AATTAAATAG CTTGAATACT TTAATAAATC GCGTAAAAAA TGAAATTATT GGACTTATTA ACGAAAAAGG AGATGAGGAT AAATCTTGGC TTAAAAGGTT TTTTAATATT GGTTAA
|
Protein sequence | MASYAYIILV SLNRIRQAIQ NINKNIFVLL KNIIIYLNTI IVNKHTWFIS NKSVSLQQHI ISYLCLSPSK INPMLRLTYI RIISFLLAFI CFGCQNYHNP NLEEQLQLER LEARIRELEE KLKLNREKTD KHSASVYNLS QQIDKISKKL AKLLAHLNIT STQDLTTLET ELSTIKADIV PIKDQLAALL QSIDYYVNQT QLQEELESFE DKLQELLKEI RQAAKSHTDT SSIEQKIEGL QKAIDKVKEE LTEFISAQAQ QQAKAAKDQA STASGDAKTA SDEAEKAQQQ AEAARDQANT ASEEAKAARN EAEKAQQQAE TARDQANTAS KEAKTARNEA INAQEAIEKA QQQVKSNVEA ANKAVEQANT ASKEAKTASD EAIKAQEVTE KAQQQAEAAR DQANTANDKV IKAQEATEKA QQQAKAAKEQ ASTASKEAKT ASDEAIKAQE VTEKAQQQAE AARDQANTAN DKVIKAQEAT EKAQQQAKAA KEQASTASGD AKTASDEAKK AQQQAKAARD QANTASEEAK AARNEAEKVQ QQAEAARDQA NTASEEAKAA RNEAEKAQQQ AEAARDQSNT ASGDAKTASD EAKKAQQQAE AARDQANTAS EETKAARNEA EKAQQQAEAA RDQANTASEE AIKAQEATEK ATKQAKDDAE TATNAATQAN TASEEAKTAR NEAIEAQQAA EKEATKAMKQ VEQIKKKAQE KAQQKQAKKL AKEETARKKA EQEAIEEDKK QADLVAKVKE EAIKVAKEAV KKQVEDATEQ AKEAKKQADL AIKAKAGAIE EAEKAATQAK VHAEIATNAA AEQVNKSINK LQEQINEALK QSEEVNSDKI SEAKSDMQQE LNTKLNSLNT LINRVKNEII GLINEKGDED KSWLKRFFNI G
|
| |