Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0461 |
Symbol | |
ID | 6377727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 558212 |
End bp | 561436 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642681621 |
Product | hypothetical protein |
Protein accession | YP_001957600 |
Protein GI | 189501883 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | [TIGR02116] toxin-antitoxin system, toxin component, Txe/YoeB family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA TAAAGAATCA TTTTAAGCAA ACGGTTGCTG GTATTGTATT AGTTAGTTTC TTACTAGCAA GTTGTACAGT AGACCACCCA GAAATCAATA CAATAACGCC AACAGACAAT AAGATATTTG AGGTGAAAGG AGGAACACGA TCAACAAATC ATCAGCTACA ACTAAGTAAC AAAGAAATTT TATGGTATGG TAATTCGGTA GCTGTAGAAC AGGAACTAAA GGCCCATTTA CAAAGCATAG CTGAAACTTC CTTACAAGCA GATATAGCCA AATATGATAA CTCTAGTGGT ACGTTGCAAC AAGAATCGTT ATCTATAGAA TATCATAAAT TAGGGCAGTT AGTAAACAGG CCATTAGGTT GGTCATCATA TGAAGGACAT CAGTTACGCA AATTATACTA TTTGCCTAAT GGTAATTTGT ATGCTAAGGT TTATGATAGG GGAAATGCTA GCCTTGTGCA AAGGTTGCCT GTTTATGTAA TGCGAGGAAT TGACTTATTT ATGCTAGCTG CATCAAAGGA AAGAGAACAA CAGGTACACA TAACACTTAA CCGTGTAATA GATAAAAAAT TGGCAACATA CGGACGTGTA TGGATAGGGC AAGCAGGGAT GTTAGGAGGT GGCAAAGGAA AAAAGAAGAA TAATAAAAAG TTAGCACCTT CACCTATGCA ACAGTTAGAA GATGTAACAG GGGCTACCCT GGGAGCACAA AAAGAAAATA AAAGGCAAGA ACTTGAAAAA GGTGAGCAAG GTACGTATGA ACAACATAGG CCATTATCCA TGTTGCCTCA GCCAAAGGAG CTATACCATT TAATTCCTGC AAACCCTTAT TCTAAAATAA ATAGCTTGGA AAAAGTAAAA GAAGATTTTA AAAAAGGATT CACGTTAAAG CTGCAGCCAA ATTTTTATGA CAATGCACAT AACGTAGCAG AAGCTATACA ACTTTTTGTG GATGCTGCTA AGTGTGGAAA TAGCCTGGCC ATGTATAATT TGGGAGAAAT TTGCGAGCAA CAGGGAAATA TAGAATTAGC GAAACGGTGG TATGTTTTAT CCTTTCAAAA TACCTGGGTT GAGGGTACTG CAAAAGCTTA TGCTTCTGCG TCACATACAA AAATAACAAA TCTTATCAAG CAAGGGCATG TAGAGCTGAA AAAAGTCTTT TTACATCATG TTGATAATGT TACTACTTTA GTTGAAAAAA TCATCCGGAT CAAGGTAATT AGTTCATTTT ATAAAAGTCG CCTGGCTGAA CCTTACCTTA GCAAAGAAGA AAATGGGCTA AAAGTAAAGC GCTTAAGAAA ATTGACAGAA GAGTTAGAAC TAGCTTTAGA TAGTTTACCT CAAGAGAAAA TATATTTTAA TATGGTCCGT CGGCTTTCAG CCCTAGGAGC CCGATATGTA CAAGAGCAAA ACTATAAGGC TGCTCAAGCC TGTTTTCTTA AATGCCAAAA GTTGCCAGAC GCACTTTATA ATTTAGGTTT GTTTTGTCAA AATGGATATA CTAGCGCTGA TGGTAAACCT GACTATCAAG CAGCCAAACA GTTTTATAAG CGATCTGGAA CTGCAGAATC TTTTAGTAAC CTAGGGGCTT TTTATATGGA TGGTTTGCTG AATGGAAACC CTGACCTTCA AAAAGCTATA AAATATTCTG AAATGTCTGG CACACCGCAA GCTCTATGCA ATATGGGAGT TATTTTCTCA AGACGGTATG TAGAAGTTCT TAGACAAGCT CCTGATTATA AACGGGCTAA AGTGTGCTTT GAACGTTCTG GCACGGGACA TGCTCATGCG CACTTGGGCG ATTTTTATTG CTATGGTTAT ATTACTAAAC CTAATTATAA ATTAGCAAAA TATCATTATG ACATGGCTGT AGAAAAAGGA TATTCAGATG CTATAGGAGG CATGGGAGAT CTTTATGCAC TAGGATATTG TAGTTTAAAT GGCAAGCCTA ACTACCAGTT ATTTGCAAAT TCCATGTTAG AAGCTATTAG TACGCCGACA GCCTCTATAC CTATTAAAAA ACACTGCTCA TATAACCTTG GAGTAGCTTA TATGAACGGG TGGATAGGCA TCAAAAAAAA GAAGCCAGAT TATAGGCAAG CTAAGTACTA TTGGGAGCAA TCCGAACTAC CTCAAGCATT TTACCATATT GCCACATTAT ATGAGAAAGC ACATATCAAA GCTGGCGAAG GAAATACTAA ATATCAAAAA GCATTGGAGT ACTATAAGCG GTCAAGTCTT CCTAAAGCGA AGTTAGAGAT TATCCGTTTG TATGATTCAA ACTTAGTTAT TACACAGTCT GAAGAAGAAA GGTTAGCAGC CCTCCAAAGT GCAATACAAG AAGTTCATGA TTTGTTACCT ACCTTGTCTG ATAAGGATGC TTGTTATATA AGGGGTGTTT TAGCTTATTA TTGTAAAGAT TGGCAAGATG CATATACCTA CTTAAACCAA GCTATTCTTT TAGGAACAGA AGAGGAAGAG GTAAAAGAGT TGGCCGAAAC AGTAACAAGC TACTTAGAAA AAGAATCAGC ATTATTAAGC ATACAAAAAG AAGAGCAAAT TTTTGAAAAA GATAGAAAAG ATGTTACTAT AGAAAATATA CAGCCCTTAG AGAGCCACGT AGCAGCAACT GATGTCTTGC AGTTAGGAGA TACAGAAACT AGTACATATA TTATAGAAAA GGCAAGCACT AATACCACTT TTACTGAGGT TGAAACAGTA CTTGAACAAC AAAATGAAGG TTGTTACGTT GAATTAATAA TGCCTGGTTT AAGTGTTCAA GAACAGGCAC AGCAAAATAT TCAACTATGC CAAAGAGTTA CAAAAAGGGA AAGGAAAGAG CAAAAGCGAG TGCGTAGGGT AACTCGTATA AAGTTAGATT TTTTACGAAT GAATGATCAA CAATCAGAGA TTATACAAGA AAACTCATTG CCTATTGTAT TTAGATTTTT AGATCATAAG CAAGAGAAAG AATTTTTAGC ATTCAAAGAA AAAGAAGAAC ATAAAAAGAG TGTTGAAAAG GTTTTAGAAG ATATAAAAAG CCATAGTTGG GAGGCTGTAG GGCTTGGCAG ACCAGAAGTT TTAAAACATG CTTACAAAGG TTATAGGGGC TGTATCTCTC GTCACCTAAA CCATAAAGAT AGGCTTGTTT ATAAGGTAAT AGGAAAAGGA GAAATTTTAA TACTTTCTTG GCAAGGCCAC TATGAAGATA AATAA
|
Protein sequence | MEKIKNHFKQ TVAGIVLVSF LLASCTVDHP EINTITPTDN KIFEVKGGTR STNHQLQLSN KEILWYGNSV AVEQELKAHL QSIAETSLQA DIAKYDNSSG TLQQESLSIE YHKLGQLVNR PLGWSSYEGH QLRKLYYLPN GNLYAKVYDR GNASLVQRLP VYVMRGIDLF MLAASKEREQ QVHITLNRVI DKKLATYGRV WIGQAGMLGG GKGKKKNNKK LAPSPMQQLE DVTGATLGAQ KENKRQELEK GEQGTYEQHR PLSMLPQPKE LYHLIPANPY SKINSLEKVK EDFKKGFTLK LQPNFYDNAH NVAEAIQLFV DAAKCGNSLA MYNLGEICEQ QGNIELAKRW YVLSFQNTWV EGTAKAYASA SHTKITNLIK QGHVELKKVF LHHVDNVTTL VEKIIRIKVI SSFYKSRLAE PYLSKEENGL KVKRLRKLTE ELELALDSLP QEKIYFNMVR RLSALGARYV QEQNYKAAQA CFLKCQKLPD ALYNLGLFCQ NGYTSADGKP DYQAAKQFYK RSGTAESFSN LGAFYMDGLL NGNPDLQKAI KYSEMSGTPQ ALCNMGVIFS RRYVEVLRQA PDYKRAKVCF ERSGTGHAHA HLGDFYCYGY ITKPNYKLAK YHYDMAVEKG YSDAIGGMGD LYALGYCSLN GKPNYQLFAN SMLEAISTPT ASIPIKKHCS YNLGVAYMNG WIGIKKKKPD YRQAKYYWEQ SELPQAFYHI ATLYEKAHIK AGEGNTKYQK ALEYYKRSSL PKAKLEIIRL YDSNLVITQS EEERLAALQS AIQEVHDLLP TLSDKDACYI RGVLAYYCKD WQDAYTYLNQ AILLGTEEEE VKELAETVTS YLEKESALLS IQKEEQIFEK DRKDVTIENI QPLESHVAAT DVLQLGDTET STYIIEKAST NTTFTEVETV LEQQNEGCYV ELIMPGLSVQ EQAQQNIQLC QRVTKRERKE QKRVRRVTRI KLDFLRMNDQ QSEIIQENSL PIVFRFLDHK QEKEFLAFKE KEEHKKSVEK VLEDIKSHSW EAVGLGRPEV LKHAYKGYRG CISRHLNHKD RLVYKVIGKG EILILSWQGH YEDK
|
| |