Gene Aasi_0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0432 
Symbol 
ID6377352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp504418 
End bp507426 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content38% 
IMG OID642681597 
Producthypothetical protein 
Protein accessionYP_001957576 
Protein GI189501859 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0267452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAC ATTATAGTGT AGCTCGTCAA TTTATGGCGT GTATTCTATT TGTAAGCCTA 
TGCTTACAAA GCTGTACAAA TATTTATGTC CCATCAAACC TTCCAATAAA AGAGGGCGAA
AAATACGTTA GTGAGAGGGT TGTCCGGCAG CTAATAGGCA AACAATCGAT TACCAAAGAA
GGGCATATCG TTACTTTTTA TCAGCAAGGA AACCAACTGC GTGCGGAAGT TGGAGAAAAT
TTACCACAGG GATTCGATAA AACGTATACT GTGCCTGTGT ATATAGAGGA AGAAATGAAG
CTAGCACAGG TAGCTAGTTT AAACGAAGAA GCTCAAAAGA AAAACATTCA TGTGGTTTTT
CCTAAAAACC AACAGGAAGG GTATGTATAT GTAGGCCACA CAGGTTTAAT GGGAGGAGGT
AAGGATAAAA AAAGCAGGAA GCAAGTATTG CCAGGAGAAG AAGAGGTAGA AGAAGAAACA
GAGGAAAAAT CCCCCAGAAG TACTGAAACA GAAGGTTATC AGGTAGCAAT CCAACCAGCC
TTTTCGCCTG AGCACCAACA AAAGTCCACA CCTCGTTTGA TCTTAAGTTT AGATGGAGGA
GGTATTCGAG GGCTACTAGA AGCAGATGCG TTAAATTACA TAGAAAAAGT ATTAGCAGAA
AGAATTATAA ATCATTTTGG TGATCGATCT GCTCCAAAAC CTGATGTGCG CTTAGGTGAA
TATTTTGACT TAATTGCAGG CACTTCCACA GGTGGTATTA TTGCTTTAGC TATGCGTATT
TTAGACCTTG CTACCAATCG GCCACGTTAT AATATGGAAA TAGTATCAGG AATATATAAA
GATAAAGGAG GCAAAATCTT TTATGGTAAT AACAAACTTT GGAAACTATT GTGCCAAGCA
AAATCTAATA TATATAATCC TAAGCCTTTA GAGGATATTT TAACAGAGTA CTTTGGCAAT
GCAACTTTAC AAGATTTATG TGATCCTGTT TTAATTACTA CGTATGATAC AGATAAACCT
GGTATTTATC TTTTTAAAAG CTCTGATACA AAAAATGGTG CAAGCAAAAA CTTTTATGTA
AAGGATGTTG CTAGGGCTAC TTCAGCAGCT CCTACTTATT TCCCTCCAGC ACAGATTAGT
TCTATAAGTG GAGAAAAATA TTGTTTTATA GATGGAGGAG TTGCTGCGAA TAATCCCGCT
CTCTACGCTT ATACATATGC TAAGGATAAC TTATACCAAA ATTCTCGTTT CCATCTTATT
TCTTTAAATA CAGGAACATC CCCAAAACCC AGCTTAGCAC GTACAGCAAG TAAAGGTGGT
GTTCTATTGG TACCTAAACT AATAGAGGTA GCTATGAATA GCAACAGTGA TGCTGTTGAG
TCGTATACAG CATCTTTGAT TACTGAGAGA CCAGGAGACA CTTATACACG TTTAGAATTT
GAAATTGATC ATCAGACCAC CAAAGCACTT GATAATGCTA GTAATAGCAA TTTAGAAAAG
TTGGTAAAAT ATGCTTGCAA AACAGTGGAG AAAGAAAAGG ATGAAACCTT GAAGACTATA
GTGGAGGCTA TAGTGGATAG GTTAAAAAAG TGCAATTATT ATGTTTTTCA CTCGCTTGTT
AAGGAAGCCC GTGAGCAACT ACAAAATGGT GAGGGTAGGG CTGATTTATC CAAGACATAC
CTACAATCTT TGATGCTGCC GTATGTATGT GAGCGTGCTA CATGGGAAAT TGCACATGCT
TTAAGCGTAC CCCACATGCC CAGTTTAACC TATTTAGATT TAAGTGGTAA TGAGCTTATA
TCTAAAGGTA ATAGTTTAAC ATATTTAGAA AAGCTTAATA GTCTTATTTA CCTAGATCTT
AGTAATACAG GTCTAACCAT AGATGGATTA GCAAAGCTAA AAGGTGCTAA GTTACATCTA
GATATACTAA AAGTAAGAAA CAACCCAAGA TTAAACTGGA TAAAGGCTGG TAGGATTGCA
AATGAAATTG AGAACTATAA AATTTCTTAT CTGGATAGTG ATGTTATGAG AAATTTGGCA
AGTCACTATC AAACTCAAGG CCGAGACACT AGAGCGGCTC TAATGATTGA CCTAGCTGAT
GATAAACATA CTACTGGTCC TGCTGAATTT CATTTAGGTA GGATGTATGA GAACGGATGG
GATTTAGCTA AGAACTGGGA AAAAGCAATA TTATGGTATC AAAGAGCTGG TAACCAAAAC
CATACAGAAG CACAATACAG GTTAGGTAGG ATATATGAAA ATGGCAGGGT AGCAAAAAAG
GATGAGCAGA CGGCTGCTCA ATGGTATGAG AAAGCTGCTA TACAAGGAAA TAGAGTGGCA
CAATATGCAT TATGCTCCAT GTATGAAAGA GCTGTTAGAC AAGGGTGCCC AAAGGTACAA
TATAGCTTGG GAAAAATGTA TTATAATGGC TGGGGAGTAG ACAAAAATTA TCAGGAAGCA
GTGGAATGGT ATCAAAAGGC AGCTAACCAA GGATATGCAG AAGCACAATA TCAATTAGGA
TATATGTATG AATATCCCAA AGGGCTATTG CAAAATTACA AGGAAGCAGC CAAGTGGTAC
CAAGCAGCAG CTAAGCAAGG TATAATAACT GCTCAGGTTA AGTTAGCAGA TATGTCTTAT
TATGGACTAG GTGTTGATAA GGATGAACAA GAAGCATTCA GATGGTTTCA AAAGGCAGCT
AATCAAGGAC ATGCAGCGGC ACAACTTGTT TTGGGGGTAA TGTATGTCAA TGGACGAGGT
GTTACCAAGG ATGATGTTAA AGCTGTAGAA TGGATTGAAA AGGCAGTTAA TCAAGGAGAT
GCAGAAGCAC AACTTGTTTT GGGGATAATG TATGCCAATG GACGAGGTGT TAATAAGGAT
GAAGAACAAG CAGTAGCATG GTATCAAAAA GCTGCCGATC AGGGAAGTGC AGTTGCACAA
TATATGCTGG AGCAGAGGTA TGAGAATGGA CGAGGTGTTA CCAAGGATGA TGTTAAAGCT
GTAGAATAG
 
Protein sequence
MKQHYSVARQ FMACILFVSL CLQSCTNIYV PSNLPIKEGE KYVSERVVRQ LIGKQSITKE 
GHIVTFYQQG NQLRAEVGEN LPQGFDKTYT VPVYIEEEMK LAQVASLNEE AQKKNIHVVF
PKNQQEGYVY VGHTGLMGGG KDKKSRKQVL PGEEEVEEET EEKSPRSTET EGYQVAIQPA
FSPEHQQKST PRLILSLDGG GIRGLLEADA LNYIEKVLAE RIINHFGDRS APKPDVRLGE
YFDLIAGTST GGIIALAMRI LDLATNRPRY NMEIVSGIYK DKGGKIFYGN NKLWKLLCQA
KSNIYNPKPL EDILTEYFGN ATLQDLCDPV LITTYDTDKP GIYLFKSSDT KNGASKNFYV
KDVARATSAA PTYFPPAQIS SISGEKYCFI DGGVAANNPA LYAYTYAKDN LYQNSRFHLI
SLNTGTSPKP SLARTASKGG VLLVPKLIEV AMNSNSDAVE SYTASLITER PGDTYTRLEF
EIDHQTTKAL DNASNSNLEK LVKYACKTVE KEKDETLKTI VEAIVDRLKK CNYYVFHSLV
KEAREQLQNG EGRADLSKTY LQSLMLPYVC ERATWEIAHA LSVPHMPSLT YLDLSGNELI
SKGNSLTYLE KLNSLIYLDL SNTGLTIDGL AKLKGAKLHL DILKVRNNPR LNWIKAGRIA
NEIENYKISY LDSDVMRNLA SHYQTQGRDT RAALMIDLAD DKHTTGPAEF HLGRMYENGW
DLAKNWEKAI LWYQRAGNQN HTEAQYRLGR IYENGRVAKK DEQTAAQWYE KAAIQGNRVA
QYALCSMYER AVRQGCPKVQ YSLGKMYYNG WGVDKNYQEA VEWYQKAANQ GYAEAQYQLG
YMYEYPKGLL QNYKEAAKWY QAAAKQGIIT AQVKLADMSY YGLGVDKDEQ EAFRWFQKAA
NQGHAAAQLV LGVMYVNGRG VTKDDVKAVE WIEKAVNQGD AEAQLVLGIM YANGRGVNKD
EEQAVAWYQK AADQGSAVAQ YMLEQRYENG RGVTKDDVKA VE