Gene Aasi_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0224 
Symbol 
ID6376525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp249024 
End bp250238 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content34% 
IMG OID642681412 
Producthypothetical protein 
Protein accessionYP_001957397 
Protein GI189501680 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.505071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCTA ATAAAAGCAC TACCATCAAT CTTCACATTT GTATTAATCT TCTAAAGTTA 
TTATTAGTAT TGGTTACAGT AGTTGCTTGT GAATGTGTCC CAAAGGGAGG CAGAGGTGAT
ATAATAAATC CAGAAATTCC CAAAGCAAAC GATAAAAGCG AAGAACCTAT CACCAATGAT
AAGAACAAAA AGCCTATTCC CAAGGAGATG ATTGATGCTG CTGAAGCAGC AAAGAAAACC
AATTTAGTAA TTTTACTTAA AACGTTCCAG GAGCTAGAGG CCGAAGACAT TAATATTGTC
AATGGACTAC TTAGTTCGGA CCTGGATATT TTAGGTTTAA AAGAAGCTGT AGAATTTGGG
AACTTAGATA TTGTAGATAC AATACTAAAA AGGGGAGTAA ACGTAAACAA AAAGATACAA
GAAAACGAAA AAGATGAATT TATACTTCTA AATTATAGAA ATGATCTTAC ACCTTTACAT
ATAGCTGCTA AATCTGGCCA TACCGAAATA CTACTCAAAT TAATAGAAAA AGGTGCAGAA
CTAAATGCAA AGGATAAATA TGGTGATACT CCTTTACATC TTGCTGCTGA TGCTGGGCAT
GCTGATATAG TATTTAAATT GATACAAAAG GGAGCAAACA TAAAATCAGC AACTAACGAT
GGGTATACCC CTTTACATCT TGCTATCATG AAAGCACATA CGGAAATAGC GTTAAGTCTG
ATAGAGCAAG GAGCAAATCT TGACATATCA AGCATTGAAG GAGATACGGC ACTTAACCTG
GCAGCCAGGA AAGGCTATGC CAATATAGTG CTAAAATTAA TAGAAAAAGG GGCTGACGTA
AATATAAAGA ACAAAATAGG TTTACATCCT TTATATTATA TTATTCGAGA AGGGCATGGT
GATATAGCTT TAACATTAAT AGAAAAAGCA AAAGAAATAG AAGTAAATAC AATAGATAGG
CAGGGAAATA CTCTTTTACA TTTGGCTGTA TATGGAGATA TTAGGCTACT ATCAAAATTA
GTAGAAAAAG GGGTAAAAAT TGATATTACG AACAATAGCG GAAATACACC TTTACATATT
GCTGCTAAGT ATGGTTGTAA AGAAGCGGTA TCAGTATTAG TAAACTGCGG AGCAAAGAAA
GATATAGCAA ATAACGAACG CAACACCCCT TTAGATTTAG CTAAAACAGA AGAAATAAGA
GCTTTACTAA AATAG
 
Protein sequence
MQPNKSTTIN LHICINLLKL LLVLVTVVAC ECVPKGGRGD IINPEIPKAN DKSEEPITND 
KNKKPIPKEM IDAAEAAKKT NLVILLKTFQ ELEAEDINIV NGLLSSDLDI LGLKEAVEFG
NLDIVDTILK RGVNVNKKIQ ENEKDEFILL NYRNDLTPLH IAAKSGHTEI LLKLIEKGAE
LNAKDKYGDT PLHLAADAGH ADIVFKLIQK GANIKSATND GYTPLHLAIM KAHTEIALSL
IEQGANLDIS SIEGDTALNL AARKGYANIV LKLIEKGADV NIKNKIGLHP LYYIIREGHG
DIALTLIEKA KEIEVNTIDR QGNTLLHLAV YGDIRLLSKL VEKGVKIDIT NNSGNTPLHI
AAKYGCKEAV SVLVNCGAKK DIANNERNTP LDLAKTEEIR ALLK