Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1107 |
Symbol | |
ID | 6377201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1422596 |
End bp | 1425403 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642682219 |
Product | hypothetical protein |
Protein accession | YP_001958179 |
Protein GI | 189502462 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.489454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTAAAG CCATATTAAT AAAAACCCGT CAAGGGGCTA TCTATAGATA TACCAAGAAT ATTGCCTTAG GTATTGCACT AGTTATAGCA AGCACCAATT TCCATACATG TGTGTCAAAT GGAGTCCACC AAGTAGGAAG CAATTTAAAG ATGGAAGTAA GCTCAGGGGT GGACCGACAT ATGGATGTTC ATTTTGTGCT GAGTGATGAT GTAGAAGAGG CTAGCCTTGC TGGTTATGCA TTAAATGTAA ATCTAATTGT CATAGAAGGA GAGCAGGATA GTAATAATTT GCTAGAATAT CGTAATAGAT CAGGTAAAGG TAAGAGGGCA TCAAGTATCA ATAAACCGTT AAACTATTTT ATAGAGGAGG AAAAGTTAAA CTTAGAAAGC GAGGAGTTAA TGATACCTTT CAAACTTGTA CCGGGAGTAG GTGTAACAAA AGTAAAAGTT CAATTTTTAC TTTTGAATGA GGGTGGCAAT ATTCTCAGTA CATCGTATGT AGATTGGAAT AGTTCTACGG AAGAATCAGA AATAGTAACA GATAATAGGT TAATGCTAAT AGATATTCCG AAGGTAAATG ATGAAGAAAA TATAGCTATA TCAGTAGGTT CTGTTGCTTT AGAACAAATG GGTATGAGTA CAGCAGAAGA AAGTCCGGAT GGTAGTAGAA AAAGAAAGCG TAACCTAGAA AACAGTAGTA GTGCAATGAG TTTAAATGAT GCTATAAAGA GAAAAGAGAA AAGAGCAAAG AGATTAGCTA AAGGCAAAGA AAGCGTAGAA ACTGAAGAAG AAGCTGAAGT AGAAGTTCTA CATGCTTTTA TTGAAGAGAT GGTTATACCT GATGTATACA GCCGTATGTC AGAGATAGAA TTAGTAAAGT GTGCTAATGA TAATAGCCCG TATGCTCAAG AAGCGATTGT TAGGTTGAAT TTACGTTCAG GTATACATGG GTGCCTGCAA TATGATCTTG AACCGGACAA ATGGTCAAAT ATAGAAGAAA AAGCATTACA AGATCAACGA TATGTTCTTT TGCTAATATA TAATTATTAT CGGTATACTT ATTATACTAA GTACGAATGT ATGGATTTCT TTAAATTACC TTTTATAGAA AAGATTATAG AAAAAGTAAG GTTGGCTGCA GAATTAGGAG ATGCACTAGC TCAAACTAAC TTAGGCTATA TATTGTTATT TAAAGCTAAT GATGAAGATA AATGGTCTAC ATCTGTAAAA AGAGAGGTAG CTGAAACATT TCATCAGTCC TTTTGCTGGT TAAAAAAAGC AGCGGATCAA GGAAATGCTC ATGCTCAGCT TGCATTAGGG AGAGATTTCT ATGCACCAGA CTGCATATAT GAAGACTACG AAAGAATGGC TGCCTATTAT GCTCAAGCAG CTAAACAAGG ACATCAAGAA TCACAATTCT TATTAGGTAA TTTATATCAT TATGGACCAG AATCACAGAT AGATGATTAT AAATGGTGTG GATTTTACGA AACCTTAATA CAAGCTAAAG CAGGAGACGT AGAGGCTCAA TCAGAATTAG AGAACACATA TCATTCTTAT ATTAACAGAG ATGAGCTGGA TATTGAAAAA GCTATTTACT GGTATAATGA GGCAGCCAAA CAAGGAAATC CGGAATCCCA ATATCAATTG GCCATCATGT ATTTTGCAGG CCGGGGGGTA GGTTTTAGTA ATAAGCGAGC TATGGAATTA CTTAAGGAAT CTGCAGAAAA AAATTATGAT CTCGCTCTAT CAAAACTAAT AGATATATAT TATAAACGGG GAATAGAAGA AGATATAATA TATGATTCTA AGGAAGCGCT TAAATGGCTA ATCAAGGCAG CAATAGGAAG TTGTGATGAT GCTGGGGAAA TTTTAGGAAA AATGTACTTT AAAGGAGTTC AGATGGCTCA AGATTTTTCT AACGCATTTT TCTGGTTTTA TAAAGAAAGG TATTTTTCAG AGAAATACTC CCTATTTGTT ACCACTCAAT CAGAAGAAGA AGAGATGTCA GAAGGTTCTA ATGATATGGA AGGTGGTGAT AACGAACATG AGGCGTTAGA AGCTATTGAA GCAAACTTAC TTGGAAACTA CCAAGCAATG TTAATACAAA AGGAAAGAGA CCATTTAGGC ACCTGCGCTA CCTTTCAAAT TGGATTATAT AAAAAGCTTG AGGAGATAAT GTTCAAATTT ATACAGTGGA ATTACAAACT TAAAACGGCT AATGGGCTAA TGATAAATAG CTTAGATTTT AAGGATCCAG AATTTAAACA AGCCATTGAT GCACGTCAAA AGTTAACAGG TATTATACCT TATATAAAGG GGCATATATA TCAGGGAAAA AGTTATATAA GTTTTGATCG GCCCATAGTT CAGTTAGCAG ATGAAATTAT GGAAGAAGCA TTAAATCAAC ATTTATATAG AGAGGCTGAA AACATTCTTA ATCATCTTAA ACGTATATAT GAAAAATTGC GTACAAAAAC ACATAAGCGA TCAATTGATA TAGAAACCAG ATACCATTTA CTAGATTCAG ATTTGTCAGA GTTAGATAAA CAGAAGCTCT CACAAAACTT AGAAATAGAA AATGCATTAG AAGAGATATA TACAAATAAA TTAAAATTGC TTGAAGACCA GATCGTAGAA TTGAGGACTT ATTATCAGGC ACTATTTAGG CAGATTGAAA AAGGGTCAGG TATTCGTAAT AAAAATTTTA GAAAAGAGCG CAATATGATT TTAAGAATAG AAGAACCGTA TGCATACCCT CGTACCTTAT TTAGCTTTAA AGTTGTTGAA AAAGATTTTA GTATCTAA
|
Protein sequence | MCKAILIKTR QGAIYRYTKN IALGIALVIA STNFHTCVSN GVHQVGSNLK MEVSSGVDRH MDVHFVLSDD VEEASLAGYA LNVNLIVIEG EQDSNNLLEY RNRSGKGKRA SSINKPLNYF IEEEKLNLES EELMIPFKLV PGVGVTKVKV QFLLLNEGGN ILSTSYVDWN SSTEESEIVT DNRLMLIDIP KVNDEENIAI SVGSVALEQM GMSTAEESPD GSRKRKRNLE NSSSAMSLND AIKRKEKRAK RLAKGKESVE TEEEAEVEVL HAFIEEMVIP DVYSRMSEIE LVKCANDNSP YAQEAIVRLN LRSGIHGCLQ YDLEPDKWSN IEEKALQDQR YVLLLIYNYY RYTYYTKYEC MDFFKLPFIE KIIEKVRLAA ELGDALAQTN LGYILLFKAN DEDKWSTSVK REVAETFHQS FCWLKKAADQ GNAHAQLALG RDFYAPDCIY EDYERMAAYY AQAAKQGHQE SQFLLGNLYH YGPESQIDDY KWCGFYETLI QAKAGDVEAQ SELENTYHSY INRDELDIEK AIYWYNEAAK QGNPESQYQL AIMYFAGRGV GFSNKRAMEL LKESAEKNYD LALSKLIDIY YKRGIEEDII YDSKEALKWL IKAAIGSCDD AGEILGKMYF KGVQMAQDFS NAFFWFYKER YFSEKYSLFV TTQSEEEEMS EGSNDMEGGD NEHEALEAIE ANLLGNYQAM LIQKERDHLG TCATFQIGLY KKLEEIMFKF IQWNYKLKTA NGLMINSLDF KDPEFKQAID ARQKLTGIIP YIKGHIYQGK SYISFDRPIV QLADEIMEEA LNQHLYREAE NILNHLKRIY EKLRTKTHKR SIDIETRYHL LDSDLSELDK QKLSQNLEIE NALEEIYTNK LKLLEDQIVE LRTYYQALFR QIEKGSGIRN KNFRKERNMI LRIEEPYAYP RTLFSFKVVE KDFSI
|
| |