Gene Aasi_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1153 
Symbol 
ID6377465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1473062 
End bp1475497 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content36% 
IMG OID642682259 
Producthypothetical protein 
Protein accessionYP_001958218 
Protein GI189502501 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACAAA AACTAATCAA ACATATTAAT GTAATAACGC TTGTCCTAGT GTTAGCTGCC 
TATACGCGAT GCCATCATAA GGAAAATACT GCTACCACAC GGATTGTTCC TACACCTATT
CCTATTACTC AGCAAATGGT ACAATTAGCT ACAGATAAAA ATCAAATTCT GTTAGCCGAT
ATACTTCAAA GTTTACAGCA TGACCGAATG GCTATTGTTA TGGATTCTCA TGGTGCTGAT
GTAAGACATT TTTATGCTTC TGTGTTATAC CAAGCTTTGA TAATAAAAAA CTTACCAATC
GTACATGCCC TCATATCAAA TGGCATTGAT ATTAATGTGG CTGATTGTAG TAAATACACA
GCATTACATT GGTCGATTGT ATGGAAAGAT TTAGTCCTTT GCCAATTTTT ATTAAGCCAA
GGTCAACTAG ATATAAATTG CGCAAATGAA GATGGTAATA CACCATTACA TTTAGCGATT
TTGGAAGACT GTATTGATAT TGCCAAGTCC ATTACATCAC ATCAAAGAGT TAATATTAAT
GCTGTTAATA ATGCTGGTTT TACTGCATTA CAACTAGCTA CTTTACGTAA TAATCTGCAG
ATGGCTGAAT TACTATTAGA AAAATCAGCT ACGGATGTTA ATATGCAGAA TGTTGTAAAT
GGTCGTACTG CTTTACATTT AGCATTCGAT TGGTACAGTA TACCTATGGT AGATATATTA
CTAGATAGGC CAGATATTAA TGTGAATCTT AAAGATAATA ATGATTGTAC TCCTCTTCAT
TTGTCAACGC TTAATGGTTA CTATGACGTA CTTATAAAAC TTTTAGATAA AGAAGCTGAA
GTTAATGTGC CAGACCATAA AGGTGATACA CCAGCACACG TAGCTGCTAG CGGAGGGTAT
GTTAAGATAC TTAAAGAGTT GAAAAATAGG GGTGCTCGCT TAGATCTACC TAACAAGCGT
GGTTATACTC CCCTTCATTT AGCTGCATTG AATAAACATT ATAAGATAGT AAAATGTATG
CTACAGGTAG CGCCAAAACT GAATATAACC ATAGATGTAA ACGTACGTGA CAATGAAGGA
AATACTCCTT TGCATTTGGC GACAAAAAAA GGAGATATGG ACATAGTTAT GGAATTAAGA
ACAAGAGGTA CTGATATAAA CTTATGTAAT AAACAGGGGC ATACACCCTT TCATTTGGCA
ATACTTAATG AAAATTATGA AGTAGCTAGA GTGCTTTTAC CAGAATTAAA CATAACAGCA
AATGCACAAG ATAAAGAGGG TAATACACCG TTACATATAG CTGTTAGTAA AGGATATCCC
AGTATAGTTG CCGATCTAAT CCTTATGGGA GCGAGAATAG ACATTCCGAA TAAAAATGGA
CATATTCCAC TACATTTGTC AGTATTTAAT GGTCATTATG AAGTTTTTAA AGAACTTATA
AGGGCAGGAT CTTTAAAGTT TGCAAACTTT AAAGATAATA AAGGTAATAC ACCATTGCAT
TTAGCTGCCA GTGGAGGGTT CTGGAAAATA GTTCTGGAAT TGATAGAGGC AGGTGTTAAC
ACAACTTTTG TAAATAAAAA TGGTTATACC TTTTTGCATC TGGCATTACT CAATGGCCAT
TATCAACTAG TTAAGAAATT TTTCCAGGCA AGAGATAAAA AAATACATAT AGATACGCAA
GATAATACTG GCAATACGTT ATTGCATTTA GCTGCTAGAA GAGGGTATAT GAAAGTGATT
TTGCAGTTAG GTGGCATAGG TGCTAACCTA GAGCTGCTCA ATAAAGATGG CCGTACACCA
CTACATTTGG CAGTACTTAA GGATCATCAT CAGATAGTAA AAACGTTCTT GCACTCAGCA
CCCGAATTAA ATATTGATTT ACAAGACTTT AAAGGCAATA CACCATTGCA TTTAGCGGCT
AGTAAAGGTT ATGAGGACAT AGTTGTTGAG TTAATAGGTA AAGGTGCTAA TTTGAATCTA
GTCAATAATT ATGGACATAC GCCCCTTCAC TTGGCAGTTT TAAAAGGGCA TCATCAAGTA
GTTAAGATGC TTTTGCTGGC AGAGGCTGAT ACAAATGTTC GGGATGAAGT GGGTAATACG
CCATTACATT GGGCAGCTGA TGCAGGGTAT GCTTGTATAA TCTCTGCATT AAGAGTTAAA
GGTGCTAAAC TCAACCTTGG TAACGATGAT GGTCAAACAC CTCTCCATTT AGCTGTGGTT
AGTGGTCATG ATTCAGCAGT TGAAGAAATT TTGCGAACAG GAGCCGATGT AGATGCACAG
GATGATGAAG GTAATACACC GTTGCATTTA GCAGTTATTA ATGGATATTG GCACATAGCT
TCAAAGTTAA GGGCTAACGG TGCTAAACTT ACTCTTAAGA ATAAAAGCCG TAAAATGCCT
CTACAAGTGG CAAAAGAGTA TAGTAAATTG TTATAG
 
Protein sequence
MLQKLIKHIN VITLVLVLAA YTRCHHKENT ATTRIVPTPI PITQQMVQLA TDKNQILLAD 
ILQSLQHDRM AIVMDSHGAD VRHFYASVLY QALIIKNLPI VHALISNGID INVADCSKYT
ALHWSIVWKD LVLCQFLLSQ GQLDINCANE DGNTPLHLAI LEDCIDIAKS ITSHQRVNIN
AVNNAGFTAL QLATLRNNLQ MAELLLEKSA TDVNMQNVVN GRTALHLAFD WYSIPMVDIL
LDRPDINVNL KDNNDCTPLH LSTLNGYYDV LIKLLDKEAE VNVPDHKGDT PAHVAASGGY
VKILKELKNR GARLDLPNKR GYTPLHLAAL NKHYKIVKCM LQVAPKLNIT IDVNVRDNEG
NTPLHLATKK GDMDIVMELR TRGTDINLCN KQGHTPFHLA ILNENYEVAR VLLPELNITA
NAQDKEGNTP LHIAVSKGYP SIVADLILMG ARIDIPNKNG HIPLHLSVFN GHYEVFKELI
RAGSLKFANF KDNKGNTPLH LAASGGFWKI VLELIEAGVN TTFVNKNGYT FLHLALLNGH
YQLVKKFFQA RDKKIHIDTQ DNTGNTLLHL AARRGYMKVI LQLGGIGANL ELLNKDGRTP
LHLAVLKDHH QIVKTFLHSA PELNIDLQDF KGNTPLHLAA SKGYEDIVVE LIGKGANLNL
VNNYGHTPLH LAVLKGHHQV VKMLLLAEAD TNVRDEVGNT PLHWAADAGY ACIISALRVK
GAKLNLGNDD GQTPLHLAVV SGHDSAVEEI LRTGADVDAQ DDEGNTPLHL AVINGYWHIA
SKLRANGAKL TLKNKSRKMP LQVAKEYSKL L