Gene Aasi_1288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1288 
Symbol 
ID6377432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1645106 
End bp1647301 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content37% 
IMG OID642682376 
Producthypothetical protein 
Protein accessionYP_001958331 
Protein GI189502614 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat
[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0920599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA CTTATACGCT GTTTCAGCAA TATATAGCAC GTCTTCTACT TATAAGTTTA 
TGCTTACAAA GCTGTGGGGG AGGATTCACC AATAACCCAC TTATTCCTAC CCGAAAAGAG
CAAATAGCAT CTACACAAAC TGATACACAA TCAATCCTTC CTCAAGCACA CATCCAACCG
CTAGTTGATA AAACATTGAC TACACAGGGA GGCCATACTA TTACGCTTTA TAAGGAAGCT
GGGGTGTTAA AAGCGGAGGT TGCAATGAAT GCCCCGCAAG GGTTTAGTAA AAGCTATGAT
GGATTAGGCG TATATATTGA GCAAGGATCA GAGTTATCGG CCCTACCTCG ATTAGACGAG
CAAGCACAGC AGCGCCGTAT CCACTTTCAA TTAGCACAAG GAGATAATCC AGCTCACATA
ATTATATATA AGGGAGCGGG TTTGATGGGG GGTGGTAATC CTGATCCTAC AGAAGAAAAA
GAAGAAGAAA GTGATGAAGA GAAGGAAACA TATCAACAGG ATCTACAAGC TCAGGAAGGT
AACGCTCAAC AGTTAACAAC TACCACTTCC TTACATACAG CTATTGAAAA AGGAGATATA
ATGGAGGTTA AGAAATTAAT AAATTCAAAA GTTGATGTAA ATGCTAGAAA TATTAAAGGA
TTATCCCCAC TGTATATAGC TGCTAGGCAA GGTTATTTAG AAATAATTGA GCTATTACTA
AACGCGGGGG CTGCTCCAAA TGATAAAGAT GAATACGGTT ATACTCCTTT ACACCTGGCT
ATGGAATATA ATCATATGGA AGTAGCTAAA CTATTAATAG AAAAGGGAGC TGATATAAAT
GCTACAGATA ATACTGGCAA TACTTTTTTA TATATGTCTA TTTTGGGATT TCAATTAGAA
ATGGCCAAGC AATTAATAGA ATTAGGAGCA GATATGAATG CTAACCTGTA TGCAGCTGTT
CAGAAAAACG ATCTAGAAGT AGCTAAGCAA TTAATAGTGT TAGGAGCAGA TGTGCATGCC
AAAGATAAAG ATGAAAATTC GCTCCTACAT TGGACTGTTA GAGATGAACA TATAAAAAAG
AAAAATCAAA TTAAAATAAT AAAACTGTTA CTAAAGTATG GAGCAGATAT AAATGTTAAG
AATAAATATC AAAATACCCC ATTGCATTGG GCTGTTAGAA ATGGACATAT AGAAATAGCA
AAGTTGTTAC TAGAAAAAGG AGTGGATGTA AATGCTCAGG GTGAATATAA CAATTATCCA
ATACATATGG CTGTTGGGGA GAATGTTGGG AAGGAAGAGA AACAAACAGA GATAGTAAAA
CTGTTACTAA AGTACGGAGC AGATATAAAT GTTAAAAATA AATATCAAAA TACTCCACTG
CATTGGGCTG CTAGAAATGG ACATATAGAA ATAGCAAAGT TGTTACTAGA AAAAGGAGTG
GATGTAAATG TTCAGGGTGA ATATAACAAC TATCCAATAC ATATGGCCGC TGTGAAAGGA
CATACAGAAA TAACAAGACT TCTAATAGAG AAAGGAGCAT ATGTGAATGT TAAAGGCAGT
AATGGTTGTG CTCCTTTATA CATAGCTAAT CGCAATGGTA GGACAGAAGT AGTAAAGTCT
TTAGAAACAA TAGGTATAAT GTATCTAAAT GATGAGAGCG TGGAAAGAGA TGATCAAAAG
GCTGTTGAAA GCCTTAAAAA AGAAGCTGAG CAAGGTAATG CTGTTGCACA ACGCAATCTG
GGATTTATGT ATCAGAATGG AAGAGAAGGA TTACCACAAG ATAATAGATT AGCAATAGAG
TGGTTTATAA AATCTGCTGA ACAAGGCTAT GTATATGGCC AAACTAATCT AGCGTGGATG
TATTATAATA GCAAGGGGAC AGCTCGAAAT TATCATGAAG CATTTAAATG GTATCAAAAG
GCCGCCGATC AAGGACATCC AAATGCTCAA TGTAGATTAG GCTGGATGTA CCAAAATGGA
AAGGGAGTTA GAAAAGATCA TACTAAGGCA TTCGAATGGT ATGAAAAAGC AGCTGAGCAA
GGCCATGAGA AAGCACAATT TGATTTAGGA GAAATATACC AGTATGGTTG GGGAGTAGCG
GAGAATTATA ATAAAGCCCT TGAATGGTAC AGAAAAGCAG CTGAGAATGG AGATCAAGCT
GCAAGAAAGA GAATTAGTTG GCTAACCAAA AAATAA
 
Protein sequence
MKRTYTLFQQ YIARLLLISL CLQSCGGGFT NNPLIPTRKE QIASTQTDTQ SILPQAHIQP 
LVDKTLTTQG GHTITLYKEA GVLKAEVAMN APQGFSKSYD GLGVYIEQGS ELSALPRLDE
QAQQRRIHFQ LAQGDNPAHI IIYKGAGLMG GGNPDPTEEK EEESDEEKET YQQDLQAQEG
NAQQLTTTTS LHTAIEKGDI MEVKKLINSK VDVNARNIKG LSPLYIAARQ GYLEIIELLL
NAGAAPNDKD EYGYTPLHLA MEYNHMEVAK LLIEKGADIN ATDNTGNTFL YMSILGFQLE
MAKQLIELGA DMNANLYAAV QKNDLEVAKQ LIVLGADVHA KDKDENSLLH WTVRDEHIKK
KNQIKIIKLL LKYGADINVK NKYQNTPLHW AVRNGHIEIA KLLLEKGVDV NAQGEYNNYP
IHMAVGENVG KEEKQTEIVK LLLKYGADIN VKNKYQNTPL HWAARNGHIE IAKLLLEKGV
DVNVQGEYNN YPIHMAAVKG HTEITRLLIE KGAYVNVKGS NGCAPLYIAN RNGRTEVVKS
LETIGIMYLN DESVERDDQK AVESLKKEAE QGNAVAQRNL GFMYQNGREG LPQDNRLAIE
WFIKSAEQGY VYGQTNLAWM YYNSKGTARN YHEAFKWYQK AADQGHPNAQ CRLGWMYQNG
KGVRKDHTKA FEWYEKAAEQ GHEKAQFDLG EIYQYGWGVA ENYNKALEWY RKAAENGDQA
ARKRISWLTK K