Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1435 |
Symbol | |
ID | 6377501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1842930 |
End bp | 1847687 |
Gene Length | 4758 bp |
Protein Length | 1585 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642682506 |
Product | hypothetical protein |
Protein accession | YP_001958455 |
Protein GI | 189502738 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.015567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA GTTTTTCGCT GTTCCAGCAA TACATATCCT ATGTTTTACT CATAAGTTTA TTCTTCCAGA GTTGTGGGGG AGGATTCGAC AATAATCCAC TCATATCTAT CAAAGAAGAG CAAATAGCAT CTATACAAAC TAGTACACAA CAAATAATCC CTCCAATAAA TATCCAGCCC CTATTTGATC AAGCATTAAT AGCCCAAGGG GGGCATAGTG TTACTTTTCA TGAAGAAGCA GGCAAATTAA AAGCAGACGT AGTAGTTAAT GCACCTCAAG GGTTTAGCAA AAGCTATGTT GGATTAGAGG TAGCTGTTGA GCAAGGTGCA GAGCTATCGG AGCTACCTAG ATTAGGCCAG CAAGCACAAA AGCGTCGCAT TCACCTTCAA CCAGCACATG CAGGAAAGCC AGCAAAAGTA ATTGTTTACA AAGGATCAGG GCTAGCGGGA GGGATGTTGG AAGGAGATGA ACAAGAAGAT GAACTAGATA ATGAATCCAT TCCTAACGAA TGTTTCTGTC CTATCACCCA GGAGATTATG GAAGATCCGG TTATTGCTCA GGATGGCCAT ACCTATGAGC GGATAGCAAT AAAACGCTGG CTTGATATGG GAAAACGAAC CAGCCCTAAA ACTGGGGCCA GGTTGCTTAG TACCGAATTA ACACCTAACT ATACCATGCG TAGTTTAATT CAGGATATAA AAGCACAGGT GCCTGTTTTA GCCAGGCATA AGCTGGATAT GCACAATATT GAAGCAGCTA TTAAATTAAG AGAAGAAGAG ATAGAAGAAA CATTGGCACA GAAGGGATAT ATAGTAGAAA AGGAAAGCCA GGCGAGGTTA AGTTTAGAAG CAGAGTTAGA AGAAAAAGAA ACAGAGTTAG AAGAAAAAAT AGCCTTACTC GGTGTAATAG AGCAGCGTCT AAAAGGATTA GAGGAACAAG CCAATGCCTT TAAATTGCAG GAAAAATTGC TGAACAAGAA ATTGATGCAA AAAGGACAGC TAATAGAAAA AGGAAGCCAG GAAAAGTTAC ACTTAGCAGC AGCCTTAGAA CAGAAAGACA AAGAATTAGC GGAACAAAAA GCCTTACTCG ATGTAATGGA GCAGCGAGTA GCAGGATTAG AGGAGCAAGT TCATTCTTTT TCAGAACGAG ATAACCAAAT GCGTGCAATC ATGCTACAAA TGCAGCAATG TATGGGCTAC CCATTAGGTG GTCAGCTTGT TCCATCTTCT AGCAGCTCAA GCAGTACGCT TGTTTTGAAT GGTAACCAAT CGTTACAAGA AAATAATGGA CTAGTTAAAC CAAGCCTATC TGTAGTAAAG AATGCAGGAA ATAGAGAAGT AAGAGTAGAA AGAAATATAA AAAAAGATAA AGGAAAAGAG AAATTAAAGG ATGAAGATAA TCAATCAGAT GGTATAGCTT TATTAGGTCA AGCTTCGGCA ATAGTTGAAG ACAAATATGT GCAGCTTCAG CAAGCAGGGG TAAAATTAGC TGACAGGCTA GTAAAAGAAA GAAAGTACCC ATTACATAAA GCTTGCAGGA TTGGTAATTT AGAGGCAGTT AAATACTTAA TAGAAAAAGG AGTAGATATA CACGCTAAGA ATAAACATGG TAATACTCCA CTTTGTTATG CATGCGACAA GGGTCATCTA GAAGTAGTTA AGTATTTAGT AGAGAAAGGA GCAGATATAA ATGCTACAGA TGAAGATGGT GAGACTCTAC TTCATTGTGT ATGCAAAAAT GATAATATAG AATTAGTTAA GTATTTAGTA GAAAAGGGAG TAGATATAAA TGTTATAGAT GGATATGGTG TAACTCCACT GCATTATGCA TGCCGAGATG GTAATCTAGA GGTAGTTAAG TATTTAGTAG AGAAAGGAGC TGATATACAG GCTAAGAATA AAGATGGTGA GACTCCGTTT CATTGGGCTC ACGACAATGA TCATCTAGAA GTAGTTAAAT ATTTATTAGA GAAAGGGGCA AATATACAGG CTAAGAGTAG AGAGAGTGAG AGTCTACTTT ACTGGGCATG CCGAGAAGGT GATCTAGAAG TAATTAAGTA TTTAGTAGAG AAAGGAGTTG ATATACAGGC TACGAATGAA GATGGTGAGA CTCTACTTCA TTGTGCATAC AGCAATAACC ATTTAGAATT AGTTAAGTAT TTAGTAGAAA AAGGAGCAGA TATAAATATT ACAGATGGAG ATGGTGCAAC TCTACTTCAT TGTATATGCA AAAATGATAA TATAGAATTA GTTAAGTATT TAGTAGAGAA AGGGGCAGAT ATAAATATTA CAGATGGAGA TGGTTGGACT CCACTTCATT ATGCTTGCGA GAATGGTGAG CTAGAAATAG TTAAGTATTT AGTAGAAAAA GGAGCAGATA TAAATGTTAT AGATGGATAT GGTGTAACTT CACTGCATTA TGCATGCCGA GAAGGTAATC TAGAGGTAGT TAAGTATTTA GTAGAAAAAG GAGCAGATAT AAATGCTACA GATGAAGATG GTGAGACTCT ACTTCATTAT GCTTGTAACA AAGGTAATTT AGAAGTAGTT AAATTATTAG TAGATAAAGG GGCGGATATA AATATAAAAA GTAACGATCA ATGTACTGCT TTACATTTTG CTACTAGATA TGATCACTTA GAAATAGTAA AGTACTTACT AGATAAAGGG GCAGATATAC AGGCTAAGAA TAAAGAGGTT GAGACTCTAC TTATTTATGC ATGTAAAAAA GGTGATCTAG AAGTAGTTAA GAACTTAGTA GATAAAGGGT CAGATATAAA TGTAAAAAAT AAAAATCAAT GGACTGCTTT ACATTTTGCT ACTAGATATG GTCACTTAGA AATAGTAAAG TACTTACTAG ATAAAGGGGC AGATATAAAT GTAAAAAATA ACGATCAATG GACTGCTTTA CATTTTGCTA CTAGATATAA TCACTTAGAA ATAGTAAAGT ACTTACTAGA TAAAGGGGCA GATATAAATG TAAAAAATAA CGATCAATGG ACTGCTTTAC ATTTTGCTAC TAGATATAAT CACTTAGAAA TAGTAAAGTT ATTACTAGAG AAAGGGGCAG ATATAAATGC TAAGAATAAA TATGGTAATA CTACGCTTCA TAAGGCATGC GAGAATGGTC ATCTAGAAGT AGTTAAGTAC TTACTAGATA AAGGGGCAGA TATAAATGTA AAAAATAACG ATCAATGGAC TGCTTTACAT TTTGCTACCA GATATAATCA CTTAAAAATA GTAAAGTTAT TACTAGATAA AGGGGCAGAT ATAAATGCTA AGAATAAAGA GGGTAATACT ACGCTTCATA AGGCATGCGA AAATGATCAC TTAGAAATAG TAAAGTTATT ACTAGATAAA GGGGCAGATA TAAATGTAAA AAATAACGAT CAATGGACTG CTTTACATTT TGCTACTAGA TATAATCACT TAGAAATAGT AAAGTACTTA CTAGATAAAG GGGCAGATAT AAATGTAAAA AATAATGATC AATGGACTGC TTTACATTTT GCTACTAGAT ATGATCACTT AAAAATAGTA AAGTACTTAC TAGATAAAGG GGCAGATATA AATGTAAAAG ATAATGATCA ATGGACTGCT TTACATTTTG CTACCAGATA TGATCACTTA AAAATAGTAA AGTTATTACT AGAGAAAGGG GCAGATATAC ATGCTAAGAA TAAAGAGAGT GAGACTCTAC TTATTTATGC ATGTAAAAAA GGTGATCTAG AACTAGTAAA GTACTTACTA GATAAAGGGG CAGATATAAA TGTAAAAAAT AACGATCAAT GGACTGCTTT ACATTTTGTT ACTAGATATA ATCACTTAGA AATAGTAAAG TACTTACTAG ATAAAGGGGC AGATATAAAT GCTAAGAATA AATATGGTAA TACTACGCTT CATAAGGCAT GCGAAAATGA TCACTTAGAA ATAGTAAAGT TATTACTAGA TAAAGGGGCA GATATAAATG TAAAAAATAA CGATCAATGG ACTGCTTTAC ATTTTGCTAC TAGATATAAT CACTTAGAAA TAGTAAAGTA CTTACTAGAT AAAGGGGCAG ATATAAATGT AAAAAATAAC GATCAATGGA TTGCTTTACA TTTTGCTACT AGATATAATC ACTTAGAAAT AGTAAAGTAC TTACTAGATA AAGGGGCAGA TATAAATGTA AAAAATAACG ATCAATGGAT TGCTTTACAT TTTGCTACCA GATATAATCA CTTAAAAATA GTAAAGTTAT TACTAGATAA AGGGGCAGAT ATAAATGTAA AAAATAACGA TCAATGGACT GCTTTACATT TTGCTACCAG ATATGATCAC TTAGAAATAG TAAAGTACTT ACTAGATAAA GGGGCAGATA TAAATGTAAA AAATAAAAAT CAATGGACTG CTTTACATTT TGCTACCAGA TATAATCACT TAAAAATAGT AAAGTTATTA CTAGATAAAG GGGCAGATAT ACATGCTAAG AATAAATATG GTAATACTCC ACTTCATAAG GCATGCGAGA ATGGTCATCT AGAAGTAATT AAGTATTTAG TAGAGAAAGG AGCTGATATA AATGCTAAGA ATAAAAATGG TAACACTCCA CTTCATAAGG CATGCGAGAA TGGTCATCTA GAAGTAGTTA AGTACTTACT AGATAAAGGG GCAGATATAC AGGCTAAGAA TAAAAATGGT AATACTCCTA TAGATATTGC TAAGCAGAAA AAATATGGAG CATTAGTTAA TTTACTAACT GAAAAGTTAG ATGCTTAA
|
Protein sequence | MKRSFSLFQQ YISYVLLISL FFQSCGGGFD NNPLISIKEE QIASIQTSTQ QIIPPINIQP LFDQALIAQG GHSVTFHEEA GKLKADVVVN APQGFSKSYV GLEVAVEQGA ELSELPRLGQ QAQKRRIHLQ PAHAGKPAKV IVYKGSGLAG GMLEGDEQED ELDNESIPNE CFCPITQEIM EDPVIAQDGH TYERIAIKRW LDMGKRTSPK TGARLLSTEL TPNYTMRSLI QDIKAQVPVL ARHKLDMHNI EAAIKLREEE IEETLAQKGY IVEKESQARL SLEAELEEKE TELEEKIALL GVIEQRLKGL EEQANAFKLQ EKLLNKKLMQ KGQLIEKGSQ EKLHLAAALE QKDKELAEQK ALLDVMEQRV AGLEEQVHSF SERDNQMRAI MLQMQQCMGY PLGGQLVPSS SSSSSTLVLN GNQSLQENNG LVKPSLSVVK NAGNREVRVE RNIKKDKGKE KLKDEDNQSD GIALLGQASA IVEDKYVQLQ QAGVKLADRL VKERKYPLHK ACRIGNLEAV KYLIEKGVDI HAKNKHGNTP LCYACDKGHL EVVKYLVEKG ADINATDEDG ETLLHCVCKN DNIELVKYLV EKGVDINVID GYGVTPLHYA CRDGNLEVVK YLVEKGADIQ AKNKDGETPF HWAHDNDHLE VVKYLLEKGA NIQAKSRESE SLLYWACREG DLEVIKYLVE KGVDIQATNE DGETLLHCAY SNNHLELVKY LVEKGADINI TDGDGATLLH CICKNDNIEL VKYLVEKGAD INITDGDGWT PLHYACENGE LEIVKYLVEK GADINVIDGY GVTSLHYACR EGNLEVVKYL VEKGADINAT DEDGETLLHY ACNKGNLEVV KLLVDKGADI NIKSNDQCTA LHFATRYDHL EIVKYLLDKG ADIQAKNKEV ETLLIYACKK GDLEVVKNLV DKGSDINVKN KNQWTALHFA TRYGHLEIVK YLLDKGADIN VKNNDQWTAL HFATRYNHLE IVKYLLDKGA DINVKNNDQW TALHFATRYN HLEIVKLLLE KGADINAKNK YGNTTLHKAC ENGHLEVVKY LLDKGADINV KNNDQWTALH FATRYNHLKI VKLLLDKGAD INAKNKEGNT TLHKACENDH LEIVKLLLDK GADINVKNND QWTALHFATR YNHLEIVKYL LDKGADINVK NNDQWTALHF ATRYDHLKIV KYLLDKGADI NVKDNDQWTA LHFATRYDHL KIVKLLLEKG ADIHAKNKES ETLLIYACKK GDLELVKYLL DKGADINVKN NDQWTALHFV TRYNHLEIVK YLLDKGADIN AKNKYGNTTL HKACENDHLE IVKLLLDKGA DINVKNNDQW TALHFATRYN HLEIVKYLLD KGADINVKNN DQWIALHFAT RYNHLEIVKY LLDKGADINV KNNDQWIALH FATRYNHLKI VKLLLDKGAD INVKNNDQWT ALHFATRYDH LEIVKYLLDK GADINVKNKN QWTALHFATR YNHLKIVKLL LDKGADIHAK NKYGNTPLHK ACENGHLEVI KYLVEKGADI NAKNKNGNTP LHKACENGHL EVVKYLLDKG ADIQAKNKNG NTPIDIAKQK KYGALVNLLT EKLDA
|
| |