Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0703 |
Symbol | |
ID | 6376922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 895729 |
End bp | 898017 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642681854 |
Product | hypothetical protein |
Protein accession | YP_001957821 |
Protein GI | 189502104 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACT ATTATAGTAT TAAATGTCAA TTCTTAGCCT ATAGCTTCAT GATAAGCTTA TGTTTGCAGA ATTGTACCCA TTCTACTTCT TTGCAATCTA TTGAAGATAG CCAAGCAAAC ACTCAGCAGC TCATACTCAT AGGTAAAACA CTTGTTTCCA AACAAGGACA TAAGGTTAGT CTTATCCAAG AACAAAGAAA ACTACAAGCT AGTGTAATAG AAAACTTGCC ACAGGGGTTT AGTAGAAGCT ACCTAGCACC TGTTATAGTA AATCCTGTTC TCAATTTATC GCTAGCCTCT GTTAACAATA CAGAATGGCA AAAGAGCCAT ACTCATATTA TCAACGATCA AGTGTATATA ACACGCTTAG GCTTATTAGG AGGAGGTAAA AGAGGAGAAG AGACAAAAAA AATACAGTAT CCTTTACATG AAGCGGTTAT CAAATGGGAT AAAAACCAGA TACAACAGTT ATTAAACTCT GATATTGACA TTAACCTTAA AAATGAAGAA GGAGATACGT TTCTACACCT AGCTATTAAA CAAATCAAGA TATTACTTAA TAAGCGCTTA GCAGAATTAG GAATTCATAT TATTGATATA GAAAATATGG ACCGTACTAG TCTTCAATAC TTATCTATAG AGGCTATCAA AAAGGACTAT GTCCAAGAAG TAGCCGATCT TTTATTACCA TTACAAGAGA AATTAGCACT TGATTTAAAT GCTTGTAATA ATAAAAGGAA AACTCCTTTA CATATAGCTA GTGGCCAAGG ACATAAAGAA TTAGTCAAGC TATTGTTACA ATTAGGAGCT GATACGCATA AAAAGAATAA AGATGATAAC ACACCATTGC ATCTGGCCGC TGCATATGGC TATCCATCAA TAGTTAAGTT ATTGATCAAG AAGGGAGCTG ATATTAATGC TAAAAATACA GATGATGATA CACCATTGCA TCTGGCCGCT GCATATGGCT ATCCATCAAT AGTTAAGTTA TTGATCAAGA AGGGAGCTGA TATTAATGCT AAAAATACAG ATGATGATAC ACCATTGCAT CTGGCCGCTG TATATGGCTA TCCATCAATA GTTAAGTTAT TGATCAAGAA GGGAGCTGAT ATTAATGCTA AAGATAAGGA TGATGATACA CCATTGCATC TGGCTGCTGC ATATGGCTAT CCATCAATAG TTAAGTTGCT TATAGAGAAG GGAGCAGATG TAAATGCTAA AGGTGAAGAT GGTCAATCCC CACTCCACCT AGCTGCAGGA AGAGGACATA TCAATGTAAT AGAACTACTA TTAGAAAAAG GGGCAAATAT AAATATTAAG GAAAAAGGAG GCGGCCTACC AGTGCACTTT GCTGCTGTAA ATGGCAACTT GGAAGTACTA AAGCTGTTAT TACAAAAAGG TGCAGATATA AATGCTAAAA CTAAAGAGGG TCCTTCTTTA CTAGGCTTTT CTGCTGCGTT CGGCCATTTG GAAATAGTAG ACTTTCTATT AGAAAAAGGA GCTGAGATAC ATGACGGTTA TTGCACAGGC ATATACGAGG CCGCTGCATG CGGGCACTTG GAAATAGTAA AGTTGTTATT GAAAAGAGGA TTAGATGTGA ATGCTAAAGA TAAAAACGGG TGGACGCTAT TGCACTGGGC TACACAAGAA GGCCAAGTAG AAATGGTAGG GCTGTTATTA GCAAGAGGAG CTGATATACA TGCTCAGAAC ATAGAAGGTA GCTCTGCATT ACATATAACT TCTCAGGGAT GGCATACAGA AATAGTAAAA TTATTGCTAG ATAAAGGAGC TGATGTAAAT GTTAAGAATA AATCAGGAGT CGTTCCACTA CATGCAGCTT CAGAGGGTGG CAATATAGAA ACAATAAAAT TATTATTAGA GAGAGTAGCA GAGGTAAATG CTAATGAAGA AACAGGTTAT ACACCATTGG ATTGTGCTAC ACAAAAAGGA CATACAGAAG TAGCAAAGCT GCTATTAGAA AAAGGAGCAG ATATACATGT TAAAGATGAA GTAAGTCAGT CAGCATTACA CTGGGCTGTA CTCAAAGGAC GTGTAGGAGT AGTAAAGCTG TTATTAGAAC AAGGGGCAGA TATACAGGCT AAAAATATAG ATGGAGAAAC TTCATTTCAT TGGGCATGTC AAAAAGGACA TCTAGAAGTA GCTAAACTAT TAATACAAAA TGGAGCGGAT ATAAATGCTA AGGATAAATA TGGTAAGACT CCTATAGATA TTGCTAGGCA GAAAAAATAT AAAGCATTAG AGGAAATGCT ACTAGGAACA ATAAGCTAG
|
Protein sequence | MKNYYSIKCQ FLAYSFMISL CLQNCTHSTS LQSIEDSQAN TQQLILIGKT LVSKQGHKVS LIQEQRKLQA SVIENLPQGF SRSYLAPVIV NPVLNLSLAS VNNTEWQKSH THIINDQVYI TRLGLLGGGK RGEETKKIQY PLHEAVIKWD KNQIQQLLNS DIDINLKNEE GDTFLHLAIK QIKILLNKRL AELGIHIIDI ENMDRTSLQY LSIEAIKKDY VQEVADLLLP LQEKLALDLN ACNNKRKTPL HIASGQGHKE LVKLLLQLGA DTHKKNKDDN TPLHLAAAYG YPSIVKLLIK KGADINAKNT DDDTPLHLAA AYGYPSIVKL LIKKGADINA KNTDDDTPLH LAAVYGYPSI VKLLIKKGAD INAKDKDDDT PLHLAAAYGY PSIVKLLIEK GADVNAKGED GQSPLHLAAG RGHINVIELL LEKGANINIK EKGGGLPVHF AAVNGNLEVL KLLLQKGADI NAKTKEGPSL LGFSAAFGHL EIVDFLLEKG AEIHDGYCTG IYEAAACGHL EIVKLLLKRG LDVNAKDKNG WTLLHWATQE GQVEMVGLLL ARGADIHAQN IEGSSALHIT SQGWHTEIVK LLLDKGADVN VKNKSGVVPL HAASEGGNIE TIKLLLERVA EVNANEETGY TPLDCATQKG HTEVAKLLLE KGADIHVKDE VSQSALHWAV LKGRVGVVKL LLEQGADIQA KNIDGETSFH WACQKGHLEV AKLLIQNGAD INAKDKYGKT PIDIARQKKY KALEEMLLGT IS
|
| |