Gene Aasi_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0544 
Symbol 
ID6376686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp692469 
End bp695873 
Gene Length3405 bp 
Protein Length1134 aa 
Translation table11 
GC content35% 
IMG OID642681698 
Producthypothetical protein 
Protein accessionYP_001957674 
Protein GI189501957 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.142127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAG AGCGTCCGTT AATTAATCGG TTAGGCACCC ATATTTTAGT TTCAGCAATC 
TTGCTATTTC TAGTTATAGG ATTTTTAGAA GGCTGTTCAG AGTCTTATAA CCATATTATT
CCTAATTCAA AAAGATCAAA AGATTCCAAG TCTAAATCCG TTACCCAACC GCCTTCTTCT
AATAATACAG AAGCATCAAA AGGCGCCTAT TTACAAGCAG ACAAAAAGTA TGATTCTTAT
TTTAATAAGC TACCTGCTGA AATACTACAA GAAGGCCAAC TACATCCGAA TAACACTAGT
ACTAACCAGT ATAAAGTTAC TTTTTTTGAC TCGCCAACTG ATTCCTTTAT AGAGAATGAG
GATGATACCT CAAATATTGT AGAACCTTCT TTTGAAAAGG ATCAACAGTT AGTATCTAAA
ACCCCTATAA ATCAAAATAA AGGTTTCTTG GCTAAAGATC TGCTTTCTAA TCAGCATAGA
GAAGCTAAAA AACAAGCTAT GCAAGAAAAA GAGGTACAGC TGATAAATAA GATCTTTGGA
GTTCAAGGAG GGTATATAGT TAGATTTTAT AAAGAGAGAA ATATATGGAA AGCTGAAGTA
ACAGATGATG CATCTACGAA GTTTCACTTA ATAGCTAATT TAGGAGAAGA TGAAGAATTA
ATAGAAAATA TAAACAGATT ACTTACTATA GATAACTACC AACCCGACAA ACCCCCTTCA
AAGCTTATCC AAGTTGTTTT TCCGACCGCT TCCCAAGCAG GTGTTGTATA TATAGGAGGT
TTACTAGGAG GAAGTAGAAA AAAAATAAAA AGAATAACCG TTTCCTCTTC TTCCTCTTCT
GGTGAAGGTG AAAAAGGACA GGAAGAAAAT AAGCAAGAAC AAGCTAGAAA AGAGCTAGAG
AATAAGCTAA AAAACATACT GAGCCAACTT GATAAACTTG ACTCTAAAAA AGATAAGGCT
AAATATAATA ATGCTAAGGA GTTCTTATTA AGTTTCCCAT GGGAACAATA TAAACAATAT
TTAGGCAATC AAGAGACTAC AGAACAAGGT TCAGAACTTC TTGAATGGGC AACTATATAT
TATAAGAAAA TTGCCAAAAC TAAAAACTCA ACAGCGGCAA AGGCTCTTAT AAAATTAAGA
GCTCTAGGTT TATATGTGCC TAACAATTTA GATAAACACG CAATTGAGTG GTATCAACAA
GCAGTTGAAA GTGCTATTCA AGGCACAGAG TTAAATTTTG GAGAATTTCA AGCAAGCGCA
AAAAAGGAAT TCAAACTATC TATAGAAGAC TTAGAAAGAA GAAAGAAGAA AAGAACAACA
TCAGAACCTA GTTGGCAAGG CCCTATGCGG ATATCTGATC TAATTGTAAT GGATTTAGAT
GTTAATAAGG CACCTGGGGA GGTGGCATGG GGAGGGAATT ACGATTCATT TAAAGAAGCT
AATGCAGACA TGATTTTTGG TCCCTTGTTT GAGTATGCGA ATACACGCAA ATATGAGGAT
GCCATTATGA CTATTGATCC TTCTGGAAAA GGTACTGATG AAACGGCATA TTGTGTGGCC
AAACGATGTG GAGATTACTA TTATATTATG GATGTTGGAG GAATGGCCGG GAGTTATGCT
AGGGAGAATT TGGAATCTGA AGATGAAGAA AACCCCAAGC ACAGCTTAGG AAATAGTCCA
GAGGTTTTAA AAAAATTAAT ACTAATTGCC CAAGAATATG GTGTTAGCAG AATAATTGTT
GAAAATAACA ATGATAAATC TTTTGTGAAG CTCTTAGAGA AAGAGATGCT AGAATTAGAA
TCTTCTAAGG ATGTAAAATT CAGGGTGATA AAATTTGATT CAGTCCATCA AGGAAAAAAC
AAAGAAAAAA GAATCTTAGA AACGCTTCGT CCTTTGTTAA TCAATCATCA AATTGTTATT
AATCGAGAGG CACTTAAAAA TGATTTTGAA TCTAAGCCCC AAGATAATCT GTACTATAAA
TTCTTTTATC AATTAATGTC AATAGTAAAA AATAAAGCTC ACTCAACAAG TTATTTTGAT
GGACGCAAAC CTTTACATGA TGATCGGGTA GATGCTGTAG CAGATGCTAT ACGTTATTTG
AAAGATAGAA AAGATAACCT AGATAATACC AAAGAGGACT TAAAAAAACT AAAAGACCGT
TCAAGAAAAG GAAATATTGA TGCACAATTT GAATTAGCCA GAAGGTATAA AGATGGAATA
GGAGTATCAG AAAATTTTGA GGCAGCAATA GGATTGTATG AACTGATTGC TGGGAATAAA
AACATTGAAA AGAAAAAAGA TAAGTATACA GAGGCTCTCT TTAATTTGGC TCAACTGTAT
CAAATTAAAT GGAAGGAACA GCAAATACAA ATAGAGCAAG AAAAGCAAAA AAAAGGAAAA
GAAAAGAATA AAAGTGAGGA GTACTCTTTA GAGCAAATAA GTGAATTCTA TAGAAGAGCA
GCTGATCAAG GACATGCAGG AGCACGTTAT GAATTAGGTA GACTCTATGA AAAAGGCTTA
GTAAAACAAA AAGCACATAA GGGAAAAGGA GAAAATAAAA CTAATGCTTC TATTGCAATC
AATATGTTTG AGATAGCGGT GGACAAAGGT CATGCTAAAG CAGCTTATAG ACTAGGTAAG
ATATATCAAA ACGGTTTGTT AGAATTAGAA AAAAACCCTA TTAAAGCAAT GAAATATTAT
ACAACTGCTG CTAATATGGG AAATATGAAA GCTAAGCTTT ATTTAGCAAA TATGTATTAT
GAGGGTAAGG GTATAGAGCA AGATTATAAA AAAGCTAAAA AATATTATAA AGGCGCAGCT
GAATTTGGTT ACTTGGAAGC ACAGATCAGA CTAGCCGATA TGTATTATGA GGATCAGGAA
GGGCAAAACT ATATAAAAGC TTTTAAATGG TATCAAGAAG CTGCTATTCA AGGAAGTCAA
GAAGCACAAT ATAGGCTAGG TAAGATGTAT GAGAATGCCT GGGGCATAAA AAGAAAGGAC
TTAGAACAAG CACTTCGCTG GTACAAAGCG GCAGCTGAGC AAGAGCATAT AGATGCTCAG
TTTGAAGTAG GTAGACTTTA TGAAAATATG GATGATTACA TTGAGGCATC TGAATGGTAT
GAAAAGGTGG CTAGCCAAGG TCATGCAGAA GCTTGTTTTA AATTAGGCAA CCTCTTACTG
GGTGATAAAT TAGGAGAAAT GGATGAAACT AGGGCAATTG AATTGTTTGA AACAGCTGCT
GACCAGGGTC ATGCAAATGC AAAACTGCGT CTAGCTTTAA TGTACTCACT TGGTAAAGGG
GTAGAAAAAG ATGAAGCTAA AGCATTAGAG TATTATGAGT CCGATGATGT ATTACCAACT
GAGCTCTTAG AGAATATTGT TAAAAGAGAA CGGGCCGATA GTTAA
 
Protein sequence
MKQERPLINR LGTHILVSAI LLFLVIGFLE GCSESYNHII PNSKRSKDSK SKSVTQPPSS 
NNTEASKGAY LQADKKYDSY FNKLPAEILQ EGQLHPNNTS TNQYKVTFFD SPTDSFIENE
DDTSNIVEPS FEKDQQLVSK TPINQNKGFL AKDLLSNQHR EAKKQAMQEK EVQLINKIFG
VQGGYIVRFY KERNIWKAEV TDDASTKFHL IANLGEDEEL IENINRLLTI DNYQPDKPPS
KLIQVVFPTA SQAGVVYIGG LLGGSRKKIK RITVSSSSSS GEGEKGQEEN KQEQARKELE
NKLKNILSQL DKLDSKKDKA KYNNAKEFLL SFPWEQYKQY LGNQETTEQG SELLEWATIY
YKKIAKTKNS TAAKALIKLR ALGLYVPNNL DKHAIEWYQQ AVESAIQGTE LNFGEFQASA
KKEFKLSIED LERRKKKRTT SEPSWQGPMR ISDLIVMDLD VNKAPGEVAW GGNYDSFKEA
NADMIFGPLF EYANTRKYED AIMTIDPSGK GTDETAYCVA KRCGDYYYIM DVGGMAGSYA
RENLESEDEE NPKHSLGNSP EVLKKLILIA QEYGVSRIIV ENNNDKSFVK LLEKEMLELE
SSKDVKFRVI KFDSVHQGKN KEKRILETLR PLLINHQIVI NREALKNDFE SKPQDNLYYK
FFYQLMSIVK NKAHSTSYFD GRKPLHDDRV DAVADAIRYL KDRKDNLDNT KEDLKKLKDR
SRKGNIDAQF ELARRYKDGI GVSENFEAAI GLYELIAGNK NIEKKKDKYT EALFNLAQLY
QIKWKEQQIQ IEQEKQKKGK EKNKSEEYSL EQISEFYRRA ADQGHAGARY ELGRLYEKGL
VKQKAHKGKG ENKTNASIAI NMFEIAVDKG HAKAAYRLGK IYQNGLLELE KNPIKAMKYY
TTAANMGNMK AKLYLANMYY EGKGIEQDYK KAKKYYKGAA EFGYLEAQIR LADMYYEDQE
GQNYIKAFKW YQEAAIQGSQ EAQYRLGKMY ENAWGIKRKD LEQALRWYKA AAEQEHIDAQ
FEVGRLYENM DDYIEASEWY EKVASQGHAE ACFKLGNLLL GDKLGEMDET RAIELFETAA
DQGHANAKLR LALMYSLGKG VEKDEAKALE YYESDDVLPT ELLENIVKRE RADS