Gene Aasi_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1229 
Symbol 
ID6377259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1568697 
End bp1572446 
Gene Length3750 bp 
Protein Length1249 aa 
Translation table11 
GC content40% 
IMG OID642682325 
Producthypothetical protein 
Protein accessionYP_001958283 
Protein GI189502566 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0154751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA ATTATACCCT CACACAGCAA CTGATAGCTC GTCTTCTACT TATAGGATTA 
TGCTTACAAA GCTGTGGTGA AGGATTCGAC AACCATCCAC TTATTTCTAC ACAAGAAGAG
CAAATAGACC AGGTACAGGT TACTGCTAAG CAGTTAGCAG ATACAACCTT TATTCCAAAG
GAAGATCATC AAATTGATTT AGATCAGGAA GCAAACGAGC TACAACCTGC TGCTAGCCAA
GGAGACTCTA GTTTACTTAA TAAATATCAA AAGGTAGGAG GACAAGGAAA TAATATAGTT
GTCATACAGG AGCAAGACAG AAAAGATAAA CCAATTAAGC GTAATGAAAA GGTAGGTGTA
GTACACTATA GTAATAACAA GCTAACGATA AGAGATAGTC GGCCAGCAAC AAAGCAACCG
TACACTAGCC CAGCAAACAA AACTACCCAA AGTATTGCAG GCCAGCAGCA GCTCGTCGTT
AATAATAGCA TGAAAGCATT ACTAACCAAC CAGGTGTGTA TCACAAAAGA AGGCTACCAA
TTAAAATTTG AACAAAAAGG GGCAGGTACA TTAGAGGCTA TCGTAGAAAA TAGATTTCCT
ACTGGCTTTA CTAAGCAACG CTTACCTGTT GTAATAGATG GAGGATTAAG TTGCACTGAG
GCAGCCGTTG GCAATATAGC TTGGCAGAAG CAGTTCATCC ATGTTAGCGA TCAATATGTG
TATGTAGGCC AGAACGGTTT ACTAGGAGGA GGAAAAGATA AGGGTCTAGT ATCCGAAGAA
GAACAAGAGG AAGCCAATGA GAAAGAGCCG CTTGGAACAG AAATAAACCT ATCTAAAGAA
GCCATCAAGA GCTTATGGCA GATTGGGCAA GAGGTAGGAG AAGAAAAATA TCGTGATAGA
GCCTTAGGGC TTTTAGCCTT GGAAGCGCTC AACAAAGGAG GCGTATATAC AACGCGTTAT
TTAAAGAAAT TGAATCAACT AGGTACAATC CCTATTATAG TAGGTAACCA ATCAGCCTTA
CCAGCAGCAT GGCAAGGTAA AACACCGGAT GAATTACAAG CGGTCTTACA CACCTTGTAC
CTACAAGCAT TAGAAGGGAA AGAAAGGCAT TTATCCACCC TTGCCAAAGC TGAGCAGGCA
GCTGAGCAGC TAGCTATTTT GGAGCTTAAG TATCACTTTC AAAAACTTTT AGAACCTACT
AGCCTAGAAA AAGAAGGGCT TGTTAAAGAT ATACAAAAAG GGATTGATAG ATTAGGCATA
CTCCCTATGA GTGAGCTAAT ATCCTTACGA AAATCCTACC TCAGTTACTT AGTGCGAGAA
GCCTTAACAA TGGATTTAGC AACCAATACG GCCCATGTTA AAAAGTTTAT AGAAAAGGTA
GGAGAGTTAG CACAAAAAGG AAAGAGTAAG GCTAAGCATA GTCAGACAGA GCGAGAAGCT
TTAAATGCTT TCTCGGGCAA GCTATACCGC CAGTTAGGGG GCCTCTCTAA AAAGCTAAGC
CAGAACTCCC ATGAAGCAGT AAAGAGTATA GAATGGCAGC GTAGTTATTA TCTAAAAGCG
ACCAAGCTAA AAGATGCGGT TGCCTATTAT CAGCTAGGGT TGCTATACGG AGATAGTAAT
AGTACATATT ATGATATGCA TGAAGCCAAA GAAGCCTTAG AGGAATCAGC CCGACTCGGG
TATGTACTGG CTTTCTATGC CTTAAGCGAG CATTATGAAA AAGAAGGGGA CACAGAATTA
GCCCTGTATT GGCAAGAGCA AGCAGCCAAG AAAGGACATC CCCAGGCACT TTACAAGTTA
TCTCGAGCCT CTGAGCAAGA AGCTCGTTTA AACTACTTAG AACAAGCAGG TTTTTCAGGA
GATAGCCAGG CCCAATATGC ATTAGGTAAC TATTACTGGG GTCAAGGGGA TTATGCTACG
GCTATTGAAT GGTACAATAA AGCAGCCTAC CAAGGTGTGC AAGCTTCCTA TGCCTATTTA
GGAATTGCGG CCCGTAAAGG GTTAGGTTGT CCTAAAGATA TCCAAGCGGC TTTGGGTTAT
TATCTTCGGT CTGAGAAAGC AGGAGAAGAT AATGCCATTG CCCGGTATGG AAGGGCTAGA
CTTTATGAAA AAGGAGAAGG CGTAAAGGCA GACCTAGGTA CTGCATTAAG TCTCTATACC
CAGGCTAGTG AACTAGGCCA TGCAAAGGCC TCCTATCATG CAGGCAAGTT GCATTTAAGT
GGGCAAGTAG AAGGATGCAA TATTCAAGCC GGGTATGCGC TTTTAGAAAA AGCAGGTAAA
GCAGGGATAT ACCAAGCATG CTTGCTTTTA GGTAAGCTGT ATGAACATGG TTGGGGCAGG
CTACCAGATC CAACAGCAGC CTTACATTGG TATAGCCAGG CCGCAATAAA TTCATCCAAG
GCTAAACGTA TGAGCGACTT AGCTCAATTA GCATTGCTAC ATAAGCTAGA TAATGATATT
GGAATCGAAG AAGAAAAAAA ATTATCGTTG CTAGACCAAG CTTCATTAGT AGAAAAAAAT
AAGTATATGC GACTTGAAGA AGCATATCAG GAAGCTCAGA AAGCTAAAGT TGCTGGAGAG
GAAATGCTAC AATTTTTAAG GGACCGGAAT CGAAAATATA AACTTGATCT AGTAGAAAAG
AGTAGAGCAC TCGAACAACT CAAAGGTCAT AACGAAGCAG AAAAAGGAAA GTTATACCAG
CAATTACAAG AGAATGAGGA ACAAGTTAAG AAATTAACAG AAAAACTTTG TTTGTATGAA
CAAGAGAATA GCTTAGCTAA AAAAAATGCT GCTCTTTTGC ACCATCAACT AACCGATAAA
GAGCAAGCCT ACCAAGCTCT TAACAAACAA GAAGTAAATT TAAGAAGACA ACTAGCAGAA
AAAGAAGAAG CGTACAAGAA GCTAATGCAG GAGAAATATC TCCAACAAAA AACTAATAAA
GAATTAGCCC AGTTATTAAT CAATAAAGAG AAATATCTAC TTCATGCGGC TGTTGAAAAT
GGTCAACTAG CAGTAGTCAA GATGTTCCTA AAAAAAGGAG CTAATATACA GGCTAAGGAT
GTAGAGGGTA AATCACCTTT GCATTTAGCT GCTCGTGCAG GCCACCTAGA AATAGCCAAG
CTGTTACTAG AAAAAGGAGC TGATACAGAG GCTAGAAATA GCTATGGTAA TTCGCCCTTG
CACTCTGCTA CTAAGAATGG TCAACTAGAA ATAGCCAAGC TGTTACTAGA ATCCGGAGCA
GATATAGAGG CTAAGGGTGA ATATGATATA TCGCCCTTGG GTTATGCTGT TCATTATAAT
CACCCAGAAG TAGCCAAGCT ATTAATAGAA CACGGCGCAT ATTTCGATAT TAAGGGTAAA
AATCGTATTT TCAATGGTGT TAACATGCTG TTTTGGGTTG CTAGATGCGG TTATTTAGAA
ATAGCCAAGC TGTTACTAGA ACACGGAGCA GATGTAAATG TTAAGGATGA ACGTGGTAAT
TCTCTCTTAA CGAGTTTGTG TGCTTCCCAA AAACCCCATA TAGATACAGC TAAATTTTTA
ATAGAGAAAG GAGCTGATGT AAATGCTAAG GATGGATTGG GTAATACTCC TTTATATAAA
GCTGTAGAGC AAGGCTACTT AGAATTAGCC AGACTATTAA TAGACAAAGG AGCTGACCTG
CTGGCTACAA ATAACCAGGG CTTAACACCT CTGCAGGTAG TTACCCAAAA AAACCATACC
GCATTAGTTG AATTATTAAC CAAAAAGTAA
 
Protein sequence
MKNNYTLTQQ LIARLLLIGL CLQSCGEGFD NHPLISTQEE QIDQVQVTAK QLADTTFIPK 
EDHQIDLDQE ANELQPAASQ GDSSLLNKYQ KVGGQGNNIV VIQEQDRKDK PIKRNEKVGV
VHYSNNKLTI RDSRPATKQP YTSPANKTTQ SIAGQQQLVV NNSMKALLTN QVCITKEGYQ
LKFEQKGAGT LEAIVENRFP TGFTKQRLPV VIDGGLSCTE AAVGNIAWQK QFIHVSDQYV
YVGQNGLLGG GKDKGLVSEE EQEEANEKEP LGTEINLSKE AIKSLWQIGQ EVGEEKYRDR
ALGLLALEAL NKGGVYTTRY LKKLNQLGTI PIIVGNQSAL PAAWQGKTPD ELQAVLHTLY
LQALEGKERH LSTLAKAEQA AEQLAILELK YHFQKLLEPT SLEKEGLVKD IQKGIDRLGI
LPMSELISLR KSYLSYLVRE ALTMDLATNT AHVKKFIEKV GELAQKGKSK AKHSQTEREA
LNAFSGKLYR QLGGLSKKLS QNSHEAVKSI EWQRSYYLKA TKLKDAVAYY QLGLLYGDSN
STYYDMHEAK EALEESARLG YVLAFYALSE HYEKEGDTEL ALYWQEQAAK KGHPQALYKL
SRASEQEARL NYLEQAGFSG DSQAQYALGN YYWGQGDYAT AIEWYNKAAY QGVQASYAYL
GIAARKGLGC PKDIQAALGY YLRSEKAGED NAIARYGRAR LYEKGEGVKA DLGTALSLYT
QASELGHAKA SYHAGKLHLS GQVEGCNIQA GYALLEKAGK AGIYQACLLL GKLYEHGWGR
LPDPTAALHW YSQAAINSSK AKRMSDLAQL ALLHKLDNDI GIEEEKKLSL LDQASLVEKN
KYMRLEEAYQ EAQKAKVAGE EMLQFLRDRN RKYKLDLVEK SRALEQLKGH NEAEKGKLYQ
QLQENEEQVK KLTEKLCLYE QENSLAKKNA ALLHHQLTDK EQAYQALNKQ EVNLRRQLAE
KEEAYKKLMQ EKYLQQKTNK ELAQLLINKE KYLLHAAVEN GQLAVVKMFL KKGANIQAKD
VEGKSPLHLA ARAGHLEIAK LLLEKGADTE ARNSYGNSPL HSATKNGQLE IAKLLLESGA
DIEAKGEYDI SPLGYAVHYN HPEVAKLLIE HGAYFDIKGK NRIFNGVNML FWVARCGYLE
IAKLLLEHGA DVNVKDERGN SLLTSLCASQ KPHIDTAKFL IEKGADVNAK DGLGNTPLYK
AVEQGYLELA RLLIDKGADL LATNNQGLTP LQVVTQKNHT ALVELLTKK