Gene Aasi_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1218 
Symbol 
ID6376753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1556591 
End bp1558249 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content39% 
IMG OID642682316 
Producthypothetical protein 
Protein accessionYP_001958274 
Protein GI189502557 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000594421 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGAAG GAGAAGAGGA AGCAGCAGAA GATAAGCTAT CAGATGAGAA TATACCCGAC 
GAATGTTTTT GCCCCATCAC CCAGGAGATT ATGGAAGATC CGGTCATTGC TCAGGATAGC
CATAGCTATG AACGATCAGC CATACAACGC TGGTTTGATG TGGGAAAGCG GGTCAGCCCT
ATGACTGGAA AGAGGCTGCT TAGTACCGAG CTCATAGCTA ATTATACCAT GCGTAGTTTA
ATTCAGGATA TAAAAGCACA GGTACCTGTT TTAACCAGAC ATAAGCTGGA TATACGTAAT
ATTGAAGCAG CTATTAAACT CAGAGAAGAA GAGATAGAAG AAAAATTGAT ACAAAAGGGG
CATTTAGTAG AAAAAGAAAG CCAGGAACGG TTAAGCTTAG AAGAAAAGCT ACAACAAAAA
GAGATAGAGT TGCAACAACA TAAAAGAAAA TTGGAAGAAA AAACAGCTCA GCTTCATATT
ATGGAAAAAC GAATTGGGCT ATTAAAAGAG CAAGTTAACT CTTTTATAGA AAGAGATAAA
CAGATGCGTA CAACGATGCA AGAGTGTATA CTACAAATGC AGCAGTACAT GGTGCAGCCT
GGTCCACCTG TTAGTAGTTC AAGCAGCAGT GTTTCTGCCT TTCAACAAAA AGTAAAAGAA
GGAGTTCTAG AAGGCAATCA TAATAATAAG GCAGAACAAG ATTATCCAGC AGTTTCTGAA
CAAAAACTTC GATATTTTTT AGACAGTAAG AGGTTGAAGG GAAAGGTTCT AGATGAGCAA
GAAGCTATAA AACATTGTAA AGAAGGAGCT ACTATTGGGC ATATGTATGC ACAATATGTG
CTAGGTAATA AGTATAGGAG CGGGCGACAG GGATTAAAAA GAAATTATGC TAAAGCTAAA
AGATGGTATG AAAAAGCAGC TGAACAAGGA TATGCAGAGG CACAATATAA GCTAGGAGCT
ATGTATGATA ATGGAGAAGG GGTAACAATA GACTTTATTG AAGCTAAAAA GTGTTATGAA
AAAGCAGCTT GCCAAGGTGT GGCGGTTGCT CAAGCTAGGC TAGCAAGCTT ATACTATTAT
GGACGAGGGG TTCAATTAAA TAGGGCTGAA GCAGAAAGAC TATGCTTACA AATAAGAGAG
AAAATAGCCA TAGATGCTCA AAAAGGTGAT GCAGATTGCC AGCTTAGTTT GGGTTGGATG
TATTATCATG GTTGTGGTAT AAGGAGGAAT TACTCAAGAG CTATGGCATG GTATCTGAAA
TCTGCTAACC AAGGATGTGC AGCTGCCCAG AATAATTTAG GCGTTATGTA TGCGTATGAT
TGGTTCGGAG CGATAAAAAA AGACTATACA AAAGCTAGGG AATGGTATCA GAAAGCAGCT
GAACAAGGAT ATGCACATGC ACAATCTAAC CTGGGGGGGC TATATTATTC TGGGCAAGGG
GTAGAGAAAG ATGATAGAAA AGCATGTGAA TGGTATCAGA AAGCAGCTGA ACAAGGATAT
GCACATGCAC AATATAGCTT AGGCATAATG TATAGGAATG GATTTGGGGT AGGAAAGGAT
AATATAAAAG CTATAGAATG GTTTCGAAAA GCCGCTGAAA AGGGCTATGA GGATGCACAA
ATAATACTTA ATTCGGTAGT AATCCATTTT TCATCATAA
 
Protein sequence
MEEGEEEAAE DKLSDENIPD ECFCPITQEI MEDPVIAQDS HSYERSAIQR WFDVGKRVSP 
MTGKRLLSTE LIANYTMRSL IQDIKAQVPV LTRHKLDIRN IEAAIKLREE EIEEKLIQKG
HLVEKESQER LSLEEKLQQK EIELQQHKRK LEEKTAQLHI MEKRIGLLKE QVNSFIERDK
QMRTTMQECI LQMQQYMVQP GPPVSSSSSS VSAFQQKVKE GVLEGNHNNK AEQDYPAVSE
QKLRYFLDSK RLKGKVLDEQ EAIKHCKEGA TIGHMYAQYV LGNKYRSGRQ GLKRNYAKAK
RWYEKAAEQG YAEAQYKLGA MYDNGEGVTI DFIEAKKCYE KAACQGVAVA QARLASLYYY
GRGVQLNRAE AERLCLQIRE KIAIDAQKGD ADCQLSLGWM YYHGCGIRRN YSRAMAWYLK
SANQGCAAAQ NNLGVMYAYD WFGAIKKDYT KAREWYQKAA EQGYAHAQSN LGGLYYSGQG
VEKDDRKACE WYQKAAEQGY AHAQYSLGIM YRNGFGVGKD NIKAIEWFRK AAEKGYEDAQ
IILNSVVIHF SS