Gene Aasi_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0226 
Symbol 
ID6376268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp251825 
End bp254632 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content38% 
IMG OID642681414 
Producthypothetical protein 
Protein accessionYP_001957399 
Protein GI189501682 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.544357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACC ATACGCCTTA CCATGAAGTT GACCAGCTTG ACCCCAGACA TTTCATCATT 
GTAAAAGGAG CAAGGGTAAA TAATCTAAAA AATGTAAATG TAGCTATTCC ACGTAACCAA
CTAGTAGTGA TTACAGGATT GTCTGGTTCT GGCAAGTCTT CCTTGGCTTT TGATACACTG
TTTGCCGAGG GCCAGCGTAT GTACATAGAA AGCTTAAGTT CTTACGCACG TCAGTTTTTA
GGGAAATTAG AAAAACCAGA TGTAGATTAT ATTCGTGGCA TTTCCCCTGC GATTGCTATT
GAGCAAAAAA CCAATAATAA AAATCCACGC TCTACAGTAG GTACTATCAC AGAGATATAT
GAATATTTAA AACTTCTCTA TGCTAGAATT GGCACCACTT ACTCACCCAT TAGCGGACAA
AAAGTTACTA GGAATACCGT ATCTGATGTA GTTGATTACA TTTATAGCCA TCCTGCCGGA
ACAAAAGCTA TCATCTCCTA CCCTTTACAA CTAGATGCAA ATGCTAACCC TATAGAGCGT
TTAAAAGTAG AACTAGCAAA AGGATTTACT AGAATCATAT ATGCAGGAAA AACAACTTTT
ATTGAAGATA TTCTCAGTAG CGAGCAAGCC TTAACATCTC ATGATATATC TATATTAATA
GATCGGTTTG TAGTGGACCA AGAAGATCGC GACAATCAGT TTAGAATAGC CGATTCTATC
CAAACTGCTT TTTTTGAAGG CAAAGGAACT TGCTATGTGG AAATTGTAGG TAAAGAACAG
AAAGTGTTCT CTGATCGTTT TGAACAAGAC GGAATGGCCT TTGAAATACC TTCTGTAAAC
CTTTTTAGTT TTAACAACTC TTATGGTGCC TGCAAAACGT GCGATGGATT TGGGAAAATA
TTGGGTATTG ATATTGATAA AGTTATACCT AATAGAACGC TTTCTGTTTA CGAAGGAGCT
ATTTATCCCT GGAAAAGCCA ATCCATGCAT AAATGGTTAC AGCCACTTAT AGAAAACGAG
CGGTATAAAG ATTTCCCTAT CCATAGGCCT TACAAAGATT TAACTGCTGA AGAGCAAACA
TTTTTATGGA AAGGTGATGG TAATTTTAAA GGCATCGATG GTTTCTTTCA ATATTTGCAA
GAGAAAATAC ACAAAATCCA GTATAGAATT CTTTTATCTC GTTATAGAGG AAAAACGACC
TGTCCTGATT GCCAAGGAAC TAGAATTAGA AAAGATGCCA GTTATGTTAA AATAGAAGGA
AAAGCCATTA CTGAATTACT ACTAATGCCT ATAGCAGAGG TGGCTCGTTT TTTTAACAGC
CTATCATTGC CGGTACATGC ACAACAAATT GCCGACCGAC TTCTGGTTGA GATTAGAAAT
AGGCTATCTT ATATGGAACA GGTCGGATTA AGTTACCTTA CTTTAAACCG GCAATCTGCT
ACCCTATCCG GTGGGGAGTA CCAAAGGATT AAGCTAGCTA CAGCCTTGGG TAGTACACTA
GTAGATACGC TCTATATTTT AGATGAGCCT ACTATTGGAC TGCATCCTAG AGATACACAA
AAACTAATTG ATATTTTGGT TGCCTTAAAA AATCTAGGTA ATACAGTTAT TGTAGTGGAA
CATGAAGAAG AGCTAATGCA AGTGGCTGAT CAGCTAATTG ACATAGGGCC TGAAGCAGGT
TCTAAAGGTG GAGAATTGGT ATTTCAAGGA GATTGGGATG CCTTAAAAAA TTTCAACCAA
AGCCACACTG CTCGTTACTT AAACGGCATA GAAACCATTC CTGTACCTAC CCAGAGACGC
AAGCCTCAAT ATAACATTAC TTTTAAAGGC ATTAGAGAGA ATAATTTAAA GGATGTAGAT
GTAACCATTC CATTAGGTGT ATTGACCGTT ATAACAGGCG TGAGTGGCTC AGGAAAATCT
ACCTTAGTAA AAAGAATTAT ATACCCAGCT TTAGCCAATA AGCTAGAAAT ATATAAAGAA
AAACCAGGTA GTTTTGATAG TTTAGAAGGC GATTATGATC TAATTAAGTA TATAGAGTTT
GTAGATCAAA ACCCTATTGG AAAATCCTCT CGATCTAATC CGGTTACTTA TGTAAAAGCC
TATGAGCACA TCCGACAGCT ATTTGCAGAC CAGCCATTAG CAAAGCAGCG AAGTTATCAA
CCCTCCTATT TTTCATTTAA TGTAGAAGGA GGCAGATGTG AAGCATGCCA AGGAGAAGGT
AAAATAACTA TAGAAATGCA ATTTATGGCC GATATAACAC TGACATGTGA AAGCTGTCAT
GGCAAACGAT TTAAGCAAGA AGCATTAGAA ATAAAATATA AGGATCATGA TATTGCATCT
GTGCTAGACA TGACTGTCGA TGATGCTTCT TATTTCTTTA GAGACCAAAC CAGCATTTAC
AGAAAACTAA AACCTTTGCG TGATGTAGGC TTAGGATATG TTACACTAGG GCAGCCTTCT
AGCTCCCTAA GTGGTGGAGA AGCACAGCGC CTCAAGCTAG CAACCTACTT AGATAAGAGC
CACCAGCAAG AACATACTTT GTTTATCTTT GATGAACCTA CCACAGGACT CCATGTGCAT
GATATTAGTA AGTTGCTCAC AGCAATCAAT AACCTAATTA GTGTGGGAAA TAGCGTGCTG
GTTATAGAAC ATACAACAGA ACTTATTAAA TGTGCCGACT GGATTATAGA TTTAGGCCCA
GAGGGCGGAG ACCAAGGAGG AGAAATTGTG TTTAGTGGTA CCCCAGAAGA TATGATTCAG
CTTAGCAATA ACCATACAGC CGAATATCTT AGAAAGAAAT TGATATAG
 
Protein sequence
MNNHTPYHEV DQLDPRHFII VKGARVNNLK NVNVAIPRNQ LVVITGLSGS GKSSLAFDTL 
FAEGQRMYIE SLSSYARQFL GKLEKPDVDY IRGISPAIAI EQKTNNKNPR STVGTITEIY
EYLKLLYARI GTTYSPISGQ KVTRNTVSDV VDYIYSHPAG TKAIISYPLQ LDANANPIER
LKVELAKGFT RIIYAGKTTF IEDILSSEQA LTSHDISILI DRFVVDQEDR DNQFRIADSI
QTAFFEGKGT CYVEIVGKEQ KVFSDRFEQD GMAFEIPSVN LFSFNNSYGA CKTCDGFGKI
LGIDIDKVIP NRTLSVYEGA IYPWKSQSMH KWLQPLIENE RYKDFPIHRP YKDLTAEEQT
FLWKGDGNFK GIDGFFQYLQ EKIHKIQYRI LLSRYRGKTT CPDCQGTRIR KDASYVKIEG
KAITELLLMP IAEVARFFNS LSLPVHAQQI ADRLLVEIRN RLSYMEQVGL SYLTLNRQSA
TLSGGEYQRI KLATALGSTL VDTLYILDEP TIGLHPRDTQ KLIDILVALK NLGNTVIVVE
HEEELMQVAD QLIDIGPEAG SKGGELVFQG DWDALKNFNQ SHTARYLNGI ETIPVPTQRR
KPQYNITFKG IRENNLKDVD VTIPLGVLTV ITGVSGSGKS TLVKRIIYPA LANKLEIYKE
KPGSFDSLEG DYDLIKYIEF VDQNPIGKSS RSNPVTYVKA YEHIRQLFAD QPLAKQRSYQ
PSYFSFNVEG GRCEACQGEG KITIEMQFMA DITLTCESCH GKRFKQEALE IKYKDHDIAS
VLDMTVDDAS YFFRDQTSIY RKLKPLRDVG LGYVTLGQPS SSLSGGEAQR LKLATYLDKS
HQQEHTLFIF DEPTTGLHVH DISKLLTAIN NLISVGNSVL VIEHTTELIK CADWIIDLGP
EGGDQGGEIV FSGTPEDMIQ LSNNHTAEYL RKKLI