Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0226 |
Symbol | |
ID | 6376268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 251825 |
End bp | 254632 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642681414 |
Product | hypothetical protein |
Protein accession | YP_001957399 |
Protein GI | 189501682 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.544357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACC ATACGCCTTA CCATGAAGTT GACCAGCTTG ACCCCAGACA TTTCATCATT GTAAAAGGAG CAAGGGTAAA TAATCTAAAA AATGTAAATG TAGCTATTCC ACGTAACCAA CTAGTAGTGA TTACAGGATT GTCTGGTTCT GGCAAGTCTT CCTTGGCTTT TGATACACTG TTTGCCGAGG GCCAGCGTAT GTACATAGAA AGCTTAAGTT CTTACGCACG TCAGTTTTTA GGGAAATTAG AAAAACCAGA TGTAGATTAT ATTCGTGGCA TTTCCCCTGC GATTGCTATT GAGCAAAAAA CCAATAATAA AAATCCACGC TCTACAGTAG GTACTATCAC AGAGATATAT GAATATTTAA AACTTCTCTA TGCTAGAATT GGCACCACTT ACTCACCCAT TAGCGGACAA AAAGTTACTA GGAATACCGT ATCTGATGTA GTTGATTACA TTTATAGCCA TCCTGCCGGA ACAAAAGCTA TCATCTCCTA CCCTTTACAA CTAGATGCAA ATGCTAACCC TATAGAGCGT TTAAAAGTAG AACTAGCAAA AGGATTTACT AGAATCATAT ATGCAGGAAA AACAACTTTT ATTGAAGATA TTCTCAGTAG CGAGCAAGCC TTAACATCTC ATGATATATC TATATTAATA GATCGGTTTG TAGTGGACCA AGAAGATCGC GACAATCAGT TTAGAATAGC CGATTCTATC CAAACTGCTT TTTTTGAAGG CAAAGGAACT TGCTATGTGG AAATTGTAGG TAAAGAACAG AAAGTGTTCT CTGATCGTTT TGAACAAGAC GGAATGGCCT TTGAAATACC TTCTGTAAAC CTTTTTAGTT TTAACAACTC TTATGGTGCC TGCAAAACGT GCGATGGATT TGGGAAAATA TTGGGTATTG ATATTGATAA AGTTATACCT AATAGAACGC TTTCTGTTTA CGAAGGAGCT ATTTATCCCT GGAAAAGCCA ATCCATGCAT AAATGGTTAC AGCCACTTAT AGAAAACGAG CGGTATAAAG ATTTCCCTAT CCATAGGCCT TACAAAGATT TAACTGCTGA AGAGCAAACA TTTTTATGGA AAGGTGATGG TAATTTTAAA GGCATCGATG GTTTCTTTCA ATATTTGCAA GAGAAAATAC ACAAAATCCA GTATAGAATT CTTTTATCTC GTTATAGAGG AAAAACGACC TGTCCTGATT GCCAAGGAAC TAGAATTAGA AAAGATGCCA GTTATGTTAA AATAGAAGGA AAAGCCATTA CTGAATTACT ACTAATGCCT ATAGCAGAGG TGGCTCGTTT TTTTAACAGC CTATCATTGC CGGTACATGC ACAACAAATT GCCGACCGAC TTCTGGTTGA GATTAGAAAT AGGCTATCTT ATATGGAACA GGTCGGATTA AGTTACCTTA CTTTAAACCG GCAATCTGCT ACCCTATCCG GTGGGGAGTA CCAAAGGATT AAGCTAGCTA CAGCCTTGGG TAGTACACTA GTAGATACGC TCTATATTTT AGATGAGCCT ACTATTGGAC TGCATCCTAG AGATACACAA AAACTAATTG ATATTTTGGT TGCCTTAAAA AATCTAGGTA ATACAGTTAT TGTAGTGGAA CATGAAGAAG AGCTAATGCA AGTGGCTGAT CAGCTAATTG ACATAGGGCC TGAAGCAGGT TCTAAAGGTG GAGAATTGGT ATTTCAAGGA GATTGGGATG CCTTAAAAAA TTTCAACCAA AGCCACACTG CTCGTTACTT AAACGGCATA GAAACCATTC CTGTACCTAC CCAGAGACGC AAGCCTCAAT ATAACATTAC TTTTAAAGGC ATTAGAGAGA ATAATTTAAA GGATGTAGAT GTAACCATTC CATTAGGTGT ATTGACCGTT ATAACAGGCG TGAGTGGCTC AGGAAAATCT ACCTTAGTAA AAAGAATTAT ATACCCAGCT TTAGCCAATA AGCTAGAAAT ATATAAAGAA AAACCAGGTA GTTTTGATAG TTTAGAAGGC GATTATGATC TAATTAAGTA TATAGAGTTT GTAGATCAAA ACCCTATTGG AAAATCCTCT CGATCTAATC CGGTTACTTA TGTAAAAGCC TATGAGCACA TCCGACAGCT ATTTGCAGAC CAGCCATTAG CAAAGCAGCG AAGTTATCAA CCCTCCTATT TTTCATTTAA TGTAGAAGGA GGCAGATGTG AAGCATGCCA AGGAGAAGGT AAAATAACTA TAGAAATGCA ATTTATGGCC GATATAACAC TGACATGTGA AAGCTGTCAT GGCAAACGAT TTAAGCAAGA AGCATTAGAA ATAAAATATA AGGATCATGA TATTGCATCT GTGCTAGACA TGACTGTCGA TGATGCTTCT TATTTCTTTA GAGACCAAAC CAGCATTTAC AGAAAACTAA AACCTTTGCG TGATGTAGGC TTAGGATATG TTACACTAGG GCAGCCTTCT AGCTCCCTAA GTGGTGGAGA AGCACAGCGC CTCAAGCTAG CAACCTACTT AGATAAGAGC CACCAGCAAG AACATACTTT GTTTATCTTT GATGAACCTA CCACAGGACT CCATGTGCAT GATATTAGTA AGTTGCTCAC AGCAATCAAT AACCTAATTA GTGTGGGAAA TAGCGTGCTG GTTATAGAAC ATACAACAGA ACTTATTAAA TGTGCCGACT GGATTATAGA TTTAGGCCCA GAGGGCGGAG ACCAAGGAGG AGAAATTGTG TTTAGTGGTA CCCCAGAAGA TATGATTCAG CTTAGCAATA ACCATACAGC CGAATATCTT AGAAAGAAAT TGATATAG
|
Protein sequence | MNNHTPYHEV DQLDPRHFII VKGARVNNLK NVNVAIPRNQ LVVITGLSGS GKSSLAFDTL FAEGQRMYIE SLSSYARQFL GKLEKPDVDY IRGISPAIAI EQKTNNKNPR STVGTITEIY EYLKLLYARI GTTYSPISGQ KVTRNTVSDV VDYIYSHPAG TKAIISYPLQ LDANANPIER LKVELAKGFT RIIYAGKTTF IEDILSSEQA LTSHDISILI DRFVVDQEDR DNQFRIADSI QTAFFEGKGT CYVEIVGKEQ KVFSDRFEQD GMAFEIPSVN LFSFNNSYGA CKTCDGFGKI LGIDIDKVIP NRTLSVYEGA IYPWKSQSMH KWLQPLIENE RYKDFPIHRP YKDLTAEEQT FLWKGDGNFK GIDGFFQYLQ EKIHKIQYRI LLSRYRGKTT CPDCQGTRIR KDASYVKIEG KAITELLLMP IAEVARFFNS LSLPVHAQQI ADRLLVEIRN RLSYMEQVGL SYLTLNRQSA TLSGGEYQRI KLATALGSTL VDTLYILDEP TIGLHPRDTQ KLIDILVALK NLGNTVIVVE HEEELMQVAD QLIDIGPEAG SKGGELVFQG DWDALKNFNQ SHTARYLNGI ETIPVPTQRR KPQYNITFKG IRENNLKDVD VTIPLGVLTV ITGVSGSGKS TLVKRIIYPA LANKLEIYKE KPGSFDSLEG DYDLIKYIEF VDQNPIGKSS RSNPVTYVKA YEHIRQLFAD QPLAKQRSYQ PSYFSFNVEG GRCEACQGEG KITIEMQFMA DITLTCESCH GKRFKQEALE IKYKDHDIAS VLDMTVDDAS YFFRDQTSIY RKLKPLRDVG LGYVTLGQPS SSLSGGEAQR LKLATYLDKS HQQEHTLFIF DEPTTGLHVH DISKLLTAIN NLISVGNSVL VIEHTTELIK CADWIIDLGP EGGDQGGEIV FSGTPEDMIQ LSNNHTAEYL RKKLI
|
| |