Gene Aasi_0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0851 
Symbol 
ID6377039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1078070 
End bp1080955 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content41% 
IMG OID642681990 
Producthypothetical protein 
Protein accessionYP_001957951 
Protein GI189502234 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00404215 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAA ATTCTTCTCC ATTTCAAGTT TATATAGCTT GCATCTTACT TATCAGCTTT 
TTCTTACAAA ACTGCTGCGG AGAGTTTAAT AATAATCCAC TTATTTCGAC TCAAGAAAAG
CAAACAGCAT CTATCCAAAC GCATTCGCAA TTAATTCTTA CTCCAAGAGC AGACATCGGG
TCTTTGGCAG GCCAAGGACT AAAAGCCCAG GGGGGCCATG CGGTTACTTT TTATCAAGAA
GCCGGTGAGT TACGAGCTAA TGTCGAGATG AATGTGCCTG AACGGTTTAG TAAAACCTAC
GAAGGGTTAT TAGTAACCCT TGAGCAAGGA GCAGAAGTAA CGAGATTACC ACAGCTAAGT
GAACAGGCGC AGGAGCGTCG CATCCAGCTT CAATTAGCCA AGGGCAACCA GCCAGCTAAA
GTAGTGATCT ATAAAGGAGC AGGGCTCGTG GGAGGAATGC AGGAAGGAGA GGAGAAAGCA
GATGAATCTA TACCTAACGA ATGCCTTTGC CCCATTACTC ATGAGCTGAT GGAAGAGCCT
GTTATTGCAC GGGACGGGCA TACGTATGAG CGGGCGGCCA TTGAGCGATG GTTTTCAATG
GGCAAACGCA CGAGCCCTGT CACAGGAGCT AAAATTGGTA GCACTAAGCT GATACCTAAT
TATGCCATCC GCAGCCTGAT TAAAAGCTTA AAGACACACA ATTCTAGTTT AGCTAAGCCT
ATTCTGGGTA TGCAAACTGC GAGACAGCAA TCTCTATCAG TACCATCTGG AGCTTGCTTT
TTTGTGAGTG ATAGCGTTAT TGATGTACTT TTCTCTCAAT CCATAAGAGG AGATGTGGTA
GCATTAAGTA AGCTTTTAAA TTTGAGTGAA AATTACCATG TCCATGCGCA GTATAAAGTA
GGGGTTATGT GTGCAGAGGG GCGAGGTATA GCAAAAAATG CGGCCAAAGC AGTAGAATGG
TATGAGAAAG CAGCTAAGCA AGGACATGCG GTTGCACAGT CTAATCTAGG ATGGATGTAT
GCAGATGGGC GTGGTGTAGC TCAAAATTAT GCTAAGGCAA TAAAATGGTT TCAAAAAGCT
GCCAACCAAG GACATGCAAG TGCGCAATAT AAATTAGGAT GGATGTATGC AGAAGGGCTA
GGCGTAGTAA AAGATGCAAG GAAAGCAATA GAATGGTATG AGAGAGCAGC TAAGCAAGGA
GATGCAAGTG CACAATCTAA TCTAGGCGTA AGTTATGCTA ATGGATGGGG CGTAGCAAAA
GATGCAAGGA AAGCAATAAA GTGGTTTCAA AAAGCGGCGG ATCAAGGACA TGCAACCTCA
CAATATAATC TAGCATGGAT GTATGCAGAT GGGCAAGGAG TAGTAAAAGA TACAAGGAAA
GCAGTAGAAT GGTTTCAAAA AGCGGCTAAT CAAGGATATG TAAAAGCACA ATATAATCTA
GGATGGATGT ATGCAGAAGG ACGAGGAGTA GACAAAGATG CAAGGAAAGC AATAGAGTGG
TATAAAAAAG CTGCCAAGCA AGGACACGCG GATGCACAAT TAAAACTAGG AGCGAGATAT
TTCAAGGGTG AAGGTATAGC TAAAGATTAT GCTAAAGCAA AAGAATGGTA TGAAAAAACA
GCCGATCAAG GACATGCCCA TGCTCAATAT AATTTAGGAT ATATGTATGA AAAGGGATTA
GGAGTAGCTA AGGATTATGT TAAAGCAATA GCATGGTATA AGCAAGCAGC CAATCAAGGA
CATGCAAAGA GTCAATATGC CTTAGGAGTG ATATATATAG AAGGTCAAGG AGTAGCCAAA
GATGTAAGAA AAGCAATAGA ATGGTATGAA AAAGCAGCCA ATCAAGGGCA TGCAGATGTA
CAATTAAAAT TAGCAGCGAG ATATTTCAAG GGTGAAGGTA TAGCTAAAGA TTATGCTAAA
GCAATAGAAT GGTTTCAAAA AACAGCCAAT CAAGGACATG CCAATGCGCA ATATAATTTA
GGATATGTGC ATGAAAAGGG ATTAGGAGTA GCGAAGGATT ATGTTAAAGC AATAGAATGG
TATGAGAAAG CAGCCAATCA AGAACATGCA AAGAGTCAAT ATGCCTTAGG AGTGATATAT
GAGAGTGGAG AAGGAGTAGA AAAAGATGAA AAAAAAGCAA TAGAGTGGTA TGAGAAAGCT
GCCAATCAAG GACATGCAAG GGCACAATTT AGCCTAGGGG TTATGTATGG AGAAGGAGAG
GGAGTAGAAA AAGATGAGAG GAAAGCAGTA GAATGGTATG AAAAAGCAGC CAATCAAGGA
CATGCAAGAG CACAATTTAA ACTAGGATGG ATGTATGGAG AAGGACGAGG AGTAAGTCAA
GATTATGCTA AAGCAATAGA GTGGTCTGAA AAAGCAGCCA ATCAAGGACA TGCAAGAGCA
CAATATAATT TAGGATGGAT ATATGAAAAT TGGAAGGGAG TAGCCAAAGA TTATGCTAAA
GCAGTAGAAT GGTTTCAAAA AGCTGCTAAT CAAGGATATG CAAGAGCACA ATATAATTTA
GCTCGGATGT ATGACCATGG ACAAGGTGTG GTCCAAAACT ACCAAGAGGC AGTAAAATGG
TATGAAAAAA GTGTTGGACA AGGAAATAAC TATGCCAAGG CCTACCTAGG TCGTTTGTAT
TATCATGGTT TTGGAGCTGA GAAAAATCTT TTACAAGCAA GTAAATTAAT CGAGGAGGCT
ATTATCCATA TGAAAAGCAA GGCTGAGGAG GGTTGTATAG AAGCCCAATA TATAGTAGGA
TGGATGTATC AATATGGCCT AGGGGTAATG CAAGACCACG TGGAGGCAGC AGTGTGGTAT
AAGAAATCAG CTAACACTTA TCCAGCAGCA CAAAAAGCAT TAGATGAGTT GAACCATGCT
GGCTAA
 
Protein sequence
MKRNSSPFQV YIACILLISF FLQNCCGEFN NNPLISTQEK QTASIQTHSQ LILTPRADIG 
SLAGQGLKAQ GGHAVTFYQE AGELRANVEM NVPERFSKTY EGLLVTLEQG AEVTRLPQLS
EQAQERRIQL QLAKGNQPAK VVIYKGAGLV GGMQEGEEKA DESIPNECLC PITHELMEEP
VIARDGHTYE RAAIERWFSM GKRTSPVTGA KIGSTKLIPN YAIRSLIKSL KTHNSSLAKP
ILGMQTARQQ SLSVPSGACF FVSDSVIDVL FSQSIRGDVV ALSKLLNLSE NYHVHAQYKV
GVMCAEGRGI AKNAAKAVEW YEKAAKQGHA VAQSNLGWMY ADGRGVAQNY AKAIKWFQKA
ANQGHASAQY KLGWMYAEGL GVVKDARKAI EWYERAAKQG DASAQSNLGV SYANGWGVAK
DARKAIKWFQ KAADQGHATS QYNLAWMYAD GQGVVKDTRK AVEWFQKAAN QGYVKAQYNL
GWMYAEGRGV DKDARKAIEW YKKAAKQGHA DAQLKLGARY FKGEGIAKDY AKAKEWYEKT
ADQGHAHAQY NLGYMYEKGL GVAKDYVKAI AWYKQAANQG HAKSQYALGV IYIEGQGVAK
DVRKAIEWYE KAANQGHADV QLKLAARYFK GEGIAKDYAK AIEWFQKTAN QGHANAQYNL
GYVHEKGLGV AKDYVKAIEW YEKAANQEHA KSQYALGVIY ESGEGVEKDE KKAIEWYEKA
ANQGHARAQF SLGVMYGEGE GVEKDERKAV EWYEKAANQG HARAQFKLGW MYGEGRGVSQ
DYAKAIEWSE KAANQGHARA QYNLGWIYEN WKGVAKDYAK AVEWFQKAAN QGYARAQYNL
ARMYDHGQGV VQNYQEAVKW YEKSVGQGNN YAKAYLGRLY YHGFGAEKNL LQASKLIEEA
IIHMKSKAEE GCIEAQYIVG WMYQYGLGVM QDHVEAAVWY KKSANTYPAA QKALDELNHA
G