Gene Aasi_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0455 
Symbol 
ID6377754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp545921 
End bp549145 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content36% 
IMG OID642681615 
Producthypothetical protein 
Protein accessionYP_001957594 
Protein GI189501877 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID[TIGR02116] toxin-antitoxin system, toxin component, Txe/YoeB family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA TAAAGAATCA TTTTAAGCAA ACGGTTGCTG GTATTGTATT AGTTAGTTTC 
TTACTAGCAA GTTGTACAGT AGACCACCCA GAAATCAATA CAATAACGCC AACAGACAAT
AAGATATTTG AGGTGAAAGG AGGAACACGA TCAACAAATC ATCAGCTACA ACTAAGTAAC
AAAGAAATTT TATGGTATGG TAATTCGGTA GCTGTAGAAC AGGAACTAAA GGCCCATTTA
CAAAGCATAG CTGAAACTTC CTTACAAGCA GATATAGCCA AATATGATAA CTCTAGTGGT
ACGTTGCAAC AAGAATCGTT ATCTATAGAA TATCATAAAT TAGGGCAGTT AGTAAACAGG
CCATTAGGTT GGTCATCATA TGAAGGACAT CAGTTACGCA AATTATACTA TTTGCCTAAT
GGTAATTTGT ATGCTAAGGT TTATGATAGG GGAAATGCTA GCCTTGTGCA AAGGTTGCCT
GTTTATGTAA TGCGAGGAAT TGACTTATTT ATGCTAGCTG CATCAAAGGA AAGAGAACAA
CAGGTACACA TAACACTTAA CCGTGTAATA GATAAAAAAT TGGCAACATA CGGACGTGTA
TGGATAGGGC AAGCAGGGAT GTTAGGAGGT GGCAAAGGAA AAAAGAAGAA TAATAAAAAG
TTAGCACCTT CACCTATGCA ACAGTTAGAA GATGTAACAG GGGCTACCCT GGGAGCACAA
AAAGAAAATA AAAGGCAAGA ACTTGAAAAA GGTGAGCAAG GTACGTATGA ACAACATAGG
CCATTATCCA TGTTGCCTCA GCCAAAGGAG CTATACCATT TAATTCCTGC AAACCCTTAT
TCTAAAATAA ATAGCTTGGA AAAAGTAAAA GAAGATTTTA AAAAAGGATT CACGTTAAAG
CTGCAGCCAA ATTTTTATGA CAATGCACAT AACGTAGCAG AAGCTATACA ACTTTTTGTG
GATGCTGCTA AGTGTGGAAA TAGCCTGGCC ATGTATAATT TGGGAGAAAT TTGCGAGCAA
CAGGGAAATA TAGAATTAGC GAAACGGTGG TATGTTTTAT CCTTTCAAAA TACCTGGGTT
GAGGGTACTG CAAAAGCTTA TGCTTCTGCG TCACATACAA AAATAACAAA TCTTATCAAG
CAAGGGCATG TAGAGCTGAA AAAAGTCTTT TTACATCATG TTGATAATGT TACTACTTTA
GTTGAAAAAA TCATCCGGAT CAAGGTAATT AGTTCATTTT ATAAAAGTCG CCTGGCTGAA
CCTTACCTTA GCAAAGAAGA AAATGGGCTA AAAGTAAAGC GCTTAAGAAA ATTGACAGAA
GAGTTAGAAC TAGCTTTAGA TAGTTTACCT CAAGAGAAAA TATATTTTAA TATGGTCCGT
CGGCTTTCAG CCCTAGGAGC CCGATATGTA CAAGAGCAAA ACTATAAGGC TGCTCAAGCC
TGTTTTCTTA AATGCCAAAA GTTGCCAGAC GCACTTTATA ATTTAGGTTT GTTTTGTCAA
AATGGATATA CTAGCGCTGA TGGTAAACCT GACTATCAAG CAGCCAAACA GTTTTATAAG
CGATCTGGAA CTGCAGAATC TTTTAGTAAC CTAGGGGCTT TTTATATGGA TGGTTTGCTG
AATGGAAACC CTGACCTTCA AAAAGCTATA AAATATTCTG AAATGTCTGG CACACCGCAA
GCTCTATGCA ATATGGGAGT TATTTTCTCA AGACGGTATG TAGAAGTTCT TAGACAAGCT
CCTGATTATA AACGGGCTAA AGTGTGCTTT GAACGTTCTG GCACGGGACA TGCTCATGCG
CACTTGGGCG ATTTTTATTG CTATGGTTAT ATTACTAAAC CTAATTATAA ATTAGCAAAA
TATCATTATG ACATGGCTGT AGAAAAAGGA TATTCAGATG CTATAGGAGG CATGGGAGAT
CTTTATGCAC TAGGATATTG TAGTTTAAAT GGCAAGCCTA ACTACCAGTT ATTTGCAAAT
TCCATGTTAG AAGCTATTAG TACGCCGACA GCCTCTATAC CTATTAAAAA ACACTGCTCA
TATAACCTTG GAGTAGCTTA TATGAACGGG TGGATAGGCA TCAAAAAAAA GAAGCCAGAT
TATAGGCAAG CTAAGTACTA TTGGGAGCAA TCCGAACTAC CTCAAGCATT TTACCATATT
GCCACATTAT ATGAGAAAGC ACATATCAAA GCTGGCGAAG GAAATACTAA ATATCAAAAA
GCATTGGAGT ACTATAAGCG GTCAAGTCTT CCTAAAGCGA AGTTAGAGAT TATCCGTTTG
TATGATTCAA ACTTAGTTAT TACACAGTCT GAAGAAGAAA GGTTAGCAGC CCTCCAAAGT
GCAATACAAG AAGTTCATGA TTTGTTACCT ACCTTGTCTG ATAAGGATGC TTGTTATATA
AGGGGTGTTT TAGCTTATTA TTGTAAAGAT TGGCAAGATG CATATACCTA CTTAAACCAA
GCTATTCTTT TAGGAACAGA AGAGGAAGAG GTAAAAGAGT TGGCCGAAAC AGTAACAAGC
TACTTAGAAA AAGAATCAGC ATTATTAAGC ATACAAAAAG AAGAGCAAAT TTTTGAAAAA
GATAGAAAAG ATGTTACTAT AGAAAATATA CAGCCCTTAG AGAGCCACGT AGCAGCAACT
GATGTCTTGC AGTTAGGAGA TACAGAAACT AGTACATATA TTATAGAAAA GGCAAGCACT
AATACCACTT TTACTGAGGT TGAAACAGTA CTTGAACAAC AAAATGAAGG TTGTTACGTT
GAATTAATAA TGCCTGGTTT AAGTGTTCAA GAACAGGCAC AGCAAAATAT TCAACTATGC
CAAAGAGTTA CAAAAAGGGA AAGGAAAGAG CAAAAGCGAG TGCGTAGGGT AACTCGTATA
AAGTTAGATT TTTTACGAAT GAATGATCAA CAATCAGAGA TTATACAAGA AAACTCATTG
CCTATTGTAT TTAGATTTTT AGATCATAAG CAAGAGAAAG AATTTTTAGC ATTCAAAGAA
AAAGAAGAAC ATAAAAAGAG TGTTGAAAAG GTTTTAGAAG ATATAAAAAG CCATAGTTGG
GAGGCTGTAG GGCTTGGCAG ACCAGAAGTT TTAAAACATG CTTACAAAGG TTATAGGGGC
TGTATCTCTC GTCACCTAAA CCATAAAGAT AGGCTTGTTT ATAAGGTAAT AGGAAAAGGA
GAAATTTTAA TACTTTCTTG GCAAGGCCAC TATGAAGATA AATAA
 
Protein sequence
MEKIKNHFKQ TVAGIVLVSF LLASCTVDHP EINTITPTDN KIFEVKGGTR STNHQLQLSN 
KEILWYGNSV AVEQELKAHL QSIAETSLQA DIAKYDNSSG TLQQESLSIE YHKLGQLVNR
PLGWSSYEGH QLRKLYYLPN GNLYAKVYDR GNASLVQRLP VYVMRGIDLF MLAASKEREQ
QVHITLNRVI DKKLATYGRV WIGQAGMLGG GKGKKKNNKK LAPSPMQQLE DVTGATLGAQ
KENKRQELEK GEQGTYEQHR PLSMLPQPKE LYHLIPANPY SKINSLEKVK EDFKKGFTLK
LQPNFYDNAH NVAEAIQLFV DAAKCGNSLA MYNLGEICEQ QGNIELAKRW YVLSFQNTWV
EGTAKAYASA SHTKITNLIK QGHVELKKVF LHHVDNVTTL VEKIIRIKVI SSFYKSRLAE
PYLSKEENGL KVKRLRKLTE ELELALDSLP QEKIYFNMVR RLSALGARYV QEQNYKAAQA
CFLKCQKLPD ALYNLGLFCQ NGYTSADGKP DYQAAKQFYK RSGTAESFSN LGAFYMDGLL
NGNPDLQKAI KYSEMSGTPQ ALCNMGVIFS RRYVEVLRQA PDYKRAKVCF ERSGTGHAHA
HLGDFYCYGY ITKPNYKLAK YHYDMAVEKG YSDAIGGMGD LYALGYCSLN GKPNYQLFAN
SMLEAISTPT ASIPIKKHCS YNLGVAYMNG WIGIKKKKPD YRQAKYYWEQ SELPQAFYHI
ATLYEKAHIK AGEGNTKYQK ALEYYKRSSL PKAKLEIIRL YDSNLVITQS EEERLAALQS
AIQEVHDLLP TLSDKDACYI RGVLAYYCKD WQDAYTYLNQ AILLGTEEEE VKELAETVTS
YLEKESALLS IQKEEQIFEK DRKDVTIENI QPLESHVAAT DVLQLGDTET STYIIEKAST
NTTFTEVETV LEQQNEGCYV ELIMPGLSVQ EQAQQNIQLC QRVTKRERKE QKRVRRVTRI
KLDFLRMNDQ QSEIIQENSL PIVFRFLDHK QEKEFLAFKE KEEHKKSVEK VLEDIKSHSW
EAVGLGRPEV LKHAYKGYRG CISRHLNHKD RLVYKVIGKG EILILSWQGH YEDK