Gene Aasi_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1648 
Symbol 
ID6376592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp848959 
End bp851301 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573085 
Protein GI294661209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCATA GATATTATTT ATATATACTG ATCAGCAGTA GTTTACTATT ACAAAGTTGT 
TTGGGTACAC GTTACTTGCA AAAGAATGAA TATTTGCTGG CAGATCAATA CGTTATAGGC
AATAAAAAAA TTAAAGTAGG AGAATTAGAT CATTACTATT TACAACATAC CAATCGTAAA
TTGTTAGGTA TCCCTTTCTG GTTATGGATT TATGAGTTAG GTAGGTGTAA TTTTAATAAG
AAAGCGCTTA AGAGAAAGTA TAAGGTGGTA AAATTGAGGT ATGAAAAAAA GCTAGCAGAT
GCGGTTAGTA AACCAGAAGA ACTAAAACAA TTAGAAGAAA CCAAAAAGAG AAAGCTTGAA
GATATTGAAT TTTTGCTTAA GAATGGAAAT TGGTTGATGA GGGTGGGAGA GCGGCCTATT
ATATATAGCC CACAGAAACG TATATCTACT GAGAATAATC TCTTACAATA CCTTCATACT
AAAGGTTTTT TTAATGCAAG TATAAATAGT TTTACAAAAA GAAAAGGTAA AAGGGCTTAT
ATTATTTATC ATATACAGGA AAACCAACCC TTTGTAATTA AAGATATTAG TCTTCATTCA
GCAGACCCTG CTATAGAAAA ACTATTACAG CCTTATGAAG AACAAAGTTT ATTTAAAAAG
GGACAAAATT ATGATCAGGA TATTATAATT GCTGAAAGAA CAAGGATCTA TGATTTACTA
TCAGATAATG GCTATTGGAG CTTTAATAAA CAGTATGTGG CTTTTAATGT GGATACTACC
AGCAACAACC AATCTGTAGC CATAGAAATA GTTGTTCTGT TGCCTACAGA TGGTAAACCA
CATCCAGTAT ATAAGTTAGC TGATATCCAG TTATCTATAT CCTCTACTGA TACTACTGAC
CATAGTTCAG AAGTAGATAC TTATAGGGGT ATGAGTTTTA AAAACACAAA ACAACATTTT
GCCCCTGCTA CAATAGTTGG TAAGATTCCT TTGCAGTTAG GGCAAACTTA TAATAAAAGT
GATATCGTTG AGACTCATAA AAGGCTGGCA AGCTTAGGTA TATTTAAAAA TACACATATA
GGACATGAAA TCATAGATAG TACACATTTA AAGACTAACA TTTGTACAAG CTTATTCGAT
AAATTTCAGC TTGAACAAGA ATTAGGTACA GAATTTACAA GAGTATCTTT TGTGCCTTTT
TATCAATTAT CGTTTAAGAG TAGAAATTTA TTAAGAAAAC TAGAAACACT TACTTTAAAG
TCTCAACTTG GTATAGAAGT TGATTCCTTT TCTACTTCAG ATCAACAAAA AGCCCACTAT
AATGAGGGAA ACTTTCATAC AGCCGTGGAG CTTACTTTGC CACAGCTTTT GTTTCCGCTT
CCAGTTCCGA CAAGGACTAA TTTAAATGCC TATAAACCTA ATACTAAGGT ACATTTGGGT
TATACCTTTA CTAAGCAGGT TAACTATACT AAGCAGAGTA TAAAAACCTT ATTATCGTAT
TTATGGTATC CTAATTCAGC AACTACAGTT GAACTTATAC CAGTTAGTGT TGGCTTAGCA
GATTTTAAAT TGAACCCTGC ATTTAACAAA CAACTAAATG AGAATAAAAA GAGAAAGTAT
AAGCCTGGCC TAATTACTTA CGGTAGTGTA AAGCTTACTT TTAAAAAGGA CGATGAGTGC
AAAAAATACT ACTCTTGCTT AGAAACAATG CTAGAAAGTG GAGGTGCCTT ACAAAATTTA
ATAGATTTTA AAGGATTATT TGGTAATTAT CTAGAGTACT ATAAGTATTT AAAAGCTGAC
TTGGCTTATA GAGAACATAT ACCTCTATAT CCTGGTACTA TTTTTGCTTA TCAATTATAT
ACGGGGATTT TATATCCTTA TAGCGACCAT CAATTAGCAC CAGAGGATAA ATATTATTTT
ATAGGTGGGC CTAATAGCAT AAGGGCTTGG AATTCAAGAG GTCTAGGTCC AGGTTCTTGC
CGCGCTAATG GCACCCAACA AGAATTTATA GAAGATAGAG GCGGCGAGTT TATTTTGCAA
GCTAACCTAG AACTTAGACA AAAGCTCATA GGGTTTATAG AATCAGCATT TTTTATAGAT
GTGGGAAATG TATGGATGTT AAGTAAAAGC GATAGTCCAG GTGATAATTT TGAATTAACA
AGATTTTATA AAGAAATTGC TATAGGTACA GGCGTAGGCT TACGTTTAAA TTTTAATTTT
CTAGTATTGC GTTTCGATTT AGGATTTAAA GTATATGATC CATCACTACG CTTAGAAGAT
CGTATTTTTC CAGAAAGCAT GCTGCAGCCT CGACTTAATA TTGGATTAGG TTATCCTTTT
TAA
 
Protein sequence
MKHRYYLYIL ISSSLLLQSC LGTRYLQKNE YLLADQYVIG NKKIKVGELD HYYLQHTNRK 
LLGIPFWLWI YELGRCNFNK KALKRKYKVV KLRYEKKLAD AVSKPEELKQ LEETKKRKLE
DIEFLLKNGN WLMRVGERPI IYSPQKRIST ENNLLQYLHT KGFFNASINS FTKRKGKRAY
IIYHIQENQP FVIKDISLHS ADPAIEKLLQ PYEEQSLFKK GQNYDQDIII AERTRIYDLL
SDNGYWSFNK QYVAFNVDTT SNNQSVAIEI VVLLPTDGKP HPVYKLADIQ LSISSTDTTD
HSSEVDTYRG MSFKNTKQHF APATIVGKIP LQLGQTYNKS DIVETHKRLA SLGIFKNTHI
GHEIIDSTHL KTNICTSLFD KFQLEQELGT EFTRVSFVPF YQLSFKSRNL LRKLETLTLK
SQLGIEVDSF STSDQQKAHY NEGNFHTAVE LTLPQLLFPL PVPTRTNLNA YKPNTKVHLG
YTFTKQVNYT KQSIKTLLSY LWYPNSATTV ELIPVSVGLA DFKLNPAFNK QLNENKKRKY
KPGLITYGSV KLTFKKDDEC KKYYSCLETM LESGGALQNL IDFKGLFGNY LEYYKYLKAD
LAYREHIPLY PGTIFAYQLY TGILYPYSDH QLAPEDKYYF IGGPNSIRAW NSRGLGPGSC
RANGTQQEFI EDRGGEFILQ ANLELRQKLI GFIESAFFID VGNVWMLSKS DSPGDNFELT
RFYKEIAIGT GVGLRLNFNF LVLRFDLGFK VYDPSLRLED RIFPESMLQP RLNIGLGYPF