Gene Aasi_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0944 
Symbol 
ID6376973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1212958 
End bp1215357 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content35% 
IMG OID642682072 
Producthypothetical protein 
Protein accessionYP_001958033 
Protein GI189502316 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC CAAATTTTTT TACTAAAGGA TTGGCAGCTA TTCTATTGAC AATCCACATG 
ATCTCTGTAA CACATGCAGA AATCCGTTTC ACTGAAGTAC AAGGAATTTA TCAATCTACA
TCCTCTGAAC ATAGAGCCAT TTCACCTTTT TGGGGACTAG TAGCCGTTCA CGGCAAACTG
TTAGAACGCA TCCGTTTTTT TGGGTTTTAC CAGAAACAAA CAGTGGGAGA AAGGTTAAAA
GATGTAGCTT CGGATAAAAT TGCAGAATTA CTACTTGAGT TATTTCCTAG TTTAGACGGA
CTTCAATTTG TTCCTAACAT ACAAGAAAGG ACACTTCTAG GTAAACTAAG TGAAAGCAAG
AATTTACCTC TTATTAAAGA AATCATTGGA ATACTATTAG CTGATCAAGA CCCGCGCGTT
CAGGAAACAA ACCTTCTTCC AATATTAAAA CCACTTAGCA ACAGCAGTAG TAAAAAATAT
GCAAAGCTTT TGGCAGATGC ATATGCAGAA CAAGAATTGT ATGCGAATCC TAAAGAGAAT
CCTACTAAAC AACGTTACCC TGAGAATATT GTTATTATAT CTCTTTTAAG CTTTTTTATT
AAAGCTGCTG AGAATAAGTC TCTGTTGGGC CTGGACCCAT TAGTATCCAA TGATATTTAC
GACAAAGTCT TATACGACAA GCTTAAAAAT AATTTCCCAC AAATTGTCGC AGATTTAGAG
AATCCAGCTA ATGTAGAATT AGTATTTTTT TTGGCAAAGG GATTTGAGGC TTATGAAAAT
TTAGTAGCAG AGCCAGTTAC CTATTCTAAA AACATTACCA TTCAAGGAGG CACAAAACCC
TTTTCAGATT GTGGCGAATA TTCATTAAGG AACCTTTTTA TGCTGTTATT ATCGGTAGAA
AATGGAGGGA TGATAGCTGT AAGAAAATTA GAGGAGCTGG AAGAAAAAGT TTTTAAAAAC
AGAATACAGA CTATCCATGA TCCCGATGAT TTTAAAAAAT ACCTGCCTTA CCAAAATTTT
AAAGATTTTA TACTTGCACA TTCAGACGTT ACATACAACT CTATAGAGCT TCATAGTGCC
TGGGCTGAAA TTGTGTCCAA TCTAAATGAG AACTCAACTG CACAAGGGAT CAACCGTGTT
CAGTATGGAA ATAAGAAGGT AGGGGCAGAT GGAGAAAGAT ATGAGATTAA ATCAAATTTC
CCGAATGCAG AAGCTATAAG CATTTTTAAT ATGTTCAATG TAATTGCCCG TATTATACCA
GATAAAACTT TAAACGAAGG TTGGAATGAA CGTGATATAA AGGAACGGTA TAAGCAAGCT
GAGGAAAAGC TCACACACCT TTGTTATTTA TTTAGCAACG ACAATATAGC AGTAGATTGG
CGAAATGTAT TGACAGGTGA TAAAAAAATA GATTCTAACT TTATGTCAAT CATTTTTACC
ATCAATAATC AAGATGCGCT TAAATGGATT TTTCAAAATG GGCATTTTGA TATAGAACGT
ATTAGTACAG TTAAGAGTGA TTGGCGTACT GATTTTGCAT ACTTAAAGTA TCCAAACGAA
TGGCTGGCTT CCCTATTTAC CAACACATTA AATAATAAAG AAAAAGAAGA AAATTATGAA
AAACTGTTAC CTTTAACAGT CATTTATAAT GCTTCACTAA AAACACTCGA AGGTGTTAAG
GAAACCATTG AACTAGTGTT GAATAAAAAG TTGGAAAATT TTTATCCACT CATTAACCGA
TGGGTACAAC AATCTATTTT TACACATTCT GAAAATCCTC ATTTTTTAAT TGACATGATA
GAGTTTATAG CAAACATACC TACCACCAAA ATTGATGCTG CATTTCAGCA GCAAATTATA
CGAGACGGCA TGGTTGTAGC AAAAGATAAA GGACTATTGG AAAAAGACAT TGCATTGCTA
GCATTTAAAA CTAAGAAATA CAAAGCAGCT GGAGAGCTTA TCCAAGCAGG TGCTAACACT
ACGGAAAAAG GAGAAAGAGG GAACACGATT TCTCACTTAG CTGTGGATGA GATAAACCTA
GAAATTATAG ACAGCCTTAT TGAAGCTGGG GCAGATATTG ACATCAATAA TGATGATGGC
CGTACACCAC TAAATTTGTT TATATCTAAA CCAAATGCTG AATCGCCAGA GAATCTACCA
ATTGTAGAAA AGCTTATCCG TGCTACCGGC AATATTAACA TGCAAAGTCA CCAGGGTAAT
ACAGTCTTAC ACTTGGCCGT ATCTCAAGGC AAAATGAAGA TTTTTGATAT GCTTATTAAC
CAAGCAAAAC CTGATGTTAA TATAACTAAT AAGAGTGGGC AAACTCCCTT AGCTTTAGCA
AAAGCCAGGA ATAATGAAAC GGCTGCTGAA ATATTGCGTA AACACGGGGC AACTGAATAA
 
Protein sequence
MKRPNFFTKG LAAILLTIHM ISVTHAEIRF TEVQGIYQST SSEHRAISPF WGLVAVHGKL 
LERIRFFGFY QKQTVGERLK DVASDKIAEL LLELFPSLDG LQFVPNIQER TLLGKLSESK
NLPLIKEIIG ILLADQDPRV QETNLLPILK PLSNSSSKKY AKLLADAYAE QELYANPKEN
PTKQRYPENI VIISLLSFFI KAAENKSLLG LDPLVSNDIY DKVLYDKLKN NFPQIVADLE
NPANVELVFF LAKGFEAYEN LVAEPVTYSK NITIQGGTKP FSDCGEYSLR NLFMLLLSVE
NGGMIAVRKL EELEEKVFKN RIQTIHDPDD FKKYLPYQNF KDFILAHSDV TYNSIELHSA
WAEIVSNLNE NSTAQGINRV QYGNKKVGAD GERYEIKSNF PNAEAISIFN MFNVIARIIP
DKTLNEGWNE RDIKERYKQA EEKLTHLCYL FSNDNIAVDW RNVLTGDKKI DSNFMSIIFT
INNQDALKWI FQNGHFDIER ISTVKSDWRT DFAYLKYPNE WLASLFTNTL NNKEKEENYE
KLLPLTVIYN ASLKTLEGVK ETIELVLNKK LENFYPLINR WVQQSIFTHS ENPHFLIDMI
EFIANIPTTK IDAAFQQQII RDGMVVAKDK GLLEKDIALL AFKTKKYKAA GELIQAGANT
TEKGERGNTI SHLAVDEINL EIIDSLIEAG ADIDINNDDG RTPLNLFISK PNAESPENLP
IVEKLIRATG NINMQSHQGN TVLHLAVSQG KMKIFDMLIN QAKPDVNITN KSGQTPLALA
KARNNETAAE ILRKHGATE