Gene Aasi_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1443 
Symbol 
ID6377493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1864314 
End bp1867475 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content36% 
IMG OID642682512 
Producthypothetical protein 
Protein accessionYP_001958461 
Protein GI189502744 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.639099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAT CTTATACTAT AAATCAGCAA TTTATAGGTT GCCTTTTACT TGTAAGCTTG 
CTTTTACAAA GCTGTAGTGG TTTAGGTAAC CCATGTATGC CTATTGAGAA AAACAAGATA
GCGCATATAC AAACTGATAC TTCTACAGCA ACAACTGCAT ATCAGATTGA TGTAACAGAA
TCTGTACCTG TATTGGCAGA GGATTCTTCC ACTAGTGGCC AAGCTTTAGT CCAATTATCT
ATGTACGATA ATAGTTTACC TGATATAGCA AAACAAGAAA TTAAAGCAAC TATAAGAAGT
GAACAACTAT CTAATAATAT ACATATAGGC AAACAGATAC AGTTAATAAA TGCTCCAGTA
AAATCAGTGC AATCATCTAT CCATTCATCT ATTAGCAAGA CTACTGAATT AATCAGGTCT
AAACAGCATA TAAGAAAAAA CCAACATACT GCAGCAGAAA AAAGAAATAA ACATTTGACA
GCTAGCTTAT TAAAATACCA GCAATACACA ATAAAAGGAG GGTATGAGAT ACAGTTCTCG
CAGACTAAAG GAAAATTACA AGCCATAGTA AGAAAGTGTT ATCCTACAGG ATTTAGCGAG
CAAGTATTAC CTGTTATTAT AACACCAGGA TTTAGCTTTA CAGAAAAAGA GGTAGTTAAT
GAAGGCTGGC AACAACAATA TGTACATATT TTCAAGGATT ATGTATATGT AGGCCAAAGT
GGTCTGCTGG GTGGCATGAA ACTAGGGTAT CGAGGATATG TAGAAGATGA GCTTCATACG
AGTGCTGTGC CTGATGAGTA TTGTTGCCCA ATCACCAAAC AAATTATGGC TGAGCCTGTT
ATGGCAGCAG ATGGTTATAC TTATGAGAAA AGTGCTATTG AGCAACATAT GAATGAGAAA
GGAGCCATTA GCCCTTTTAT CCGAAAACCG TTAACCAGTA CGAACTTAAT ACCTAACCAG
GGGCTAAAAA GAGCTATCCA AAATTATGTA GAAAAGAATA AGAAATTTTA TGAACAACAG
TGTATTAAAG CAATACAAGA AGTTGATATT AATTCATTGC TACATTTAGA AAAGTTGGGT
ATTAATATTG ATGTGGCTGA TGAAAATGGC TGGACATTGA TACACTATGT TATTTATCAA
GCAAAAATAG AACACATAAA GTTTTGCTTA AATAGGAAAT TTAACATTAA TGCTGCTAGT
GCAAGTTTAA AGGGATTATA TTTTCCACCG GATTTACCGC AATTAATTGC CGCTCAGTTA
GAAGAAATCA ATATAAAAGC TGCCAAATAT AATCTTTCTG CAAAGATTAC TGAGCAAACG
ATATTCCAAA TAATGAGAGA ACTAGAAAAC CAAGCTAAAG CAAATGATTA CTCAATTGAA
AAAGGCCAGC AAAGCTGGGA GGCCTTTGAA AGGGATGCCA TAAAGGTTGA ACAAGCATCC
TATAACAAAG CTTTTTTTTC CGGATCGGCC CAAAATGCTT GGCATATGAC TTCTAATAGA
AGTTCCTATT TTTCAGGTTC AGAATCGCGT CGGAGGAATT GGCAACACTA TCACGATAAC
CTTTCTCGCT GGCTAGATGA GCGAAATCAA GTTGTAGCTC TATTGGCTGA GGTTAAAAAA
ATTGAACAAA ATATAAATAT TTTGCAGCAA AGAAAACAAT GTGTATTTAA TAACTTAGCG
CCCCTCCATT TGGCAACAGC TCAAAAAAAT GAAAAAATAA TAATGCAGTT AATTCAGCTA
GGTGCTGATT TAGAATTAAA AGATGGCAAT GGAACAACGC CAATTTTTTG GGCTGTTTAC
CAAAATGAAT TAAAACTTTT AAAATTGTTT GTTGACAATG GGGCAAATTT ACAAGTATCT
GATAATCAAG GTAATACCTT ACTTCATGTT GCTGCACAAT ATGCTGATTT AAATATCATC
AATTATTTAA TTGAAATCGG TTTTTATCAT TTGCAGGAAA ATCATCTAGG GCAAACAGCT
ATTGCTGTTG CCTTAAAGTA TGAGCGCAAA GCAATTGCCG ATTTTATTAA TAAAAAAGGG
GCTGAAGGTT TACAAGCAGC GCTTTTTAGA ATCAACCGAG GTCAACAACA GCGTAAGGAT
TCTTCTACAA TTTTGGGCTC TGGTAGCAGT CAGTCGAATC TTTATTATCC AGCTTCCTTA
AGCAATCCTA CTCCATTAAA TAAACCAAGC AATACAACCA GCCTTGCTAG TAATACCAAT
CTGCATCTAA CTACCTTACC GCCATCACCT GCTTATGATG AACCGCAAAG CGCTGGTCGT
TTAGCCCCAC CACTTACCTC TGCAACGCAG GTAACTGAAC AATTTAGTCG GCTTAGGATT
TCGTACGAGA TTCCTTATCA GGCTCTCCAT TTTCAACAAG AGCTTGGCCG TGGAGGGTTT
GGGATTGTGT ACAAAGGTGC TTATCAAGAC AAGCTAGTGG CAATAAAACA GTTAATGAAT
CAGGACTTAT CTAAAGCTCT CATACATAAT TTCAAACAAG AGATTAGTAT GATGGCAAGG
TTGGAATCAC CTTATGTTAT TAAATTTATT GGTGCTTGTT TCCAAGCGCC ACACTATTCT
CTGGTGATGG ATTATATGCC TAATGGGGAT CTTTACCACT TTCTTCAAAA ACCAGGACAA
ATAGATTGGC AGCTACGATA TCAAATTGCT ACTGATATCG GCCATGGTGT AAATTATCTG
CACTCACACG GTATTATTCA TGGTGATTTA AAAAGTCTAA ATATTTTATT AGATAAAAAT
TATCAAGCCA AAATAACGGA CTTTGGCTTG GCTAAAATTA AGATATCTAG TTCGATTAGT
ACCTTAGTGG GAGGTCAGAA AGGAGGGTCG CTCCGTTGGA TGGCACCTGA GCTCTTAACA
GCCGAAGAAG AAGAAACTAG TAATACAAAA GCCTCCGATG TTTATAGTTA TGGTATGGTA
TTATGGGAAC TTGGCGCAAG GCAAATACCT TATGCTAATA AGAGGGACCC TCAAGTTTTG
GCTTTAAAAT TACAAAATAA ACATGAGCCC ATCACTCCGG ACACTCCTCC GTCAATATCT
GCACTTATCC AATGGTGTTG GAAAGAAAGA ACCAAAAGAC CTGCAATTAC TGAAGCGGTA
GAAACCTTGG AAAAAGAGCA GAGGTTACTT TTAAATAAAT AG
 
Protein sequence
MKRSYTINQQ FIGCLLLVSL LLQSCSGLGN PCMPIEKNKI AHIQTDTSTA TTAYQIDVTE 
SVPVLAEDSS TSGQALVQLS MYDNSLPDIA KQEIKATIRS EQLSNNIHIG KQIQLINAPV
KSVQSSIHSS ISKTTELIRS KQHIRKNQHT AAEKRNKHLT ASLLKYQQYT IKGGYEIQFS
QTKGKLQAIV RKCYPTGFSE QVLPVIITPG FSFTEKEVVN EGWQQQYVHI FKDYVYVGQS
GLLGGMKLGY RGYVEDELHT SAVPDEYCCP ITKQIMAEPV MAADGYTYEK SAIEQHMNEK
GAISPFIRKP LTSTNLIPNQ GLKRAIQNYV EKNKKFYEQQ CIKAIQEVDI NSLLHLEKLG
INIDVADENG WTLIHYVIYQ AKIEHIKFCL NRKFNINAAS ASLKGLYFPP DLPQLIAAQL
EEINIKAAKY NLSAKITEQT IFQIMRELEN QAKANDYSIE KGQQSWEAFE RDAIKVEQAS
YNKAFFSGSA QNAWHMTSNR SSYFSGSESR RRNWQHYHDN LSRWLDERNQ VVALLAEVKK
IEQNINILQQ RKQCVFNNLA PLHLATAQKN EKIIMQLIQL GADLELKDGN GTTPIFWAVY
QNELKLLKLF VDNGANLQVS DNQGNTLLHV AAQYADLNII NYLIEIGFYH LQENHLGQTA
IAVALKYERK AIADFINKKG AEGLQAALFR INRGQQQRKD SSTILGSGSS QSNLYYPASL
SNPTPLNKPS NTTSLASNTN LHLTTLPPSP AYDEPQSAGR LAPPLTSATQ VTEQFSRLRI
SYEIPYQALH FQQELGRGGF GIVYKGAYQD KLVAIKQLMN QDLSKALIHN FKQEISMMAR
LESPYVIKFI GACFQAPHYS LVMDYMPNGD LYHFLQKPGQ IDWQLRYQIA TDIGHGVNYL
HSHGIIHGDL KSLNILLDKN YQAKITDFGL AKIKISSSIS TLVGGQKGGS LRWMAPELLT
AEEEETSNTK ASDVYSYGMV LWELGARQIP YANKRDPQVL ALKLQNKHEP ITPDTPPSIS
ALIQWCWKER TKRPAITEAV ETLEKEQRLL LNK