Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0942 |
Symbol | |
ID | 6377182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1207945 |
End bp | 1210797 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642682070 |
Product | hypothetical protein |
Protein accession | YP_001958031 |
Protein GI | 189502314 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.971464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAAC AAGAAACAAA TTTAAAAAAG CCAAATGCAG ATATTCATGG TTACGACCAT GTAGAGGTGT TGGGTGCTAG AGTACATAAC CTTAAAAATA TTGATGTGCG CTTTCCTAGG AATAAGCTGG TGGTTATTAC TGGGCTGAGT GGTAGTGGAA AATCTTCTTT AGCATTTGAT ACGATTTATG CAGAAGGGCA ACGACGTTAT ATAGAAACTT TTATAAGCTA TGCACGTTCC TTTATTGGTG ATTTAGAGCG TCCGGATGTA GATAAAATTA ATGGATTAAG TCCGGTCATT GCTATTGAGC AAAAAACAAC TTCTAGAAAC CCACGGTCTA CAGTAGGTAC AGTAACAGAA ATTTATGACT ATTTGCGTTT GCTATTTGCT AAAGTTGCAG ATGCCTACTC CTATACAAGT GGCCACAAAA TGGTTAAGCA GACAGCTGAA CAAATAGAAA ATCATATACT TCAGATATTT TCCGGTAAAC AGCTTTTCTT GCTGGCACCT GTAGTAAAAG GTAGAAAAGG GCACTATAGA GAGCTGTTTC AACAAATTAG CAGCATGGGA TTTACTAAGG TCAAAATAGA TGGAGAGATA AAAGATGTTG TAGCTAAACT GCAAGTAGAC CGTTACAAGA TACACGATAT AGAAATTGTG GTTGACCGGT TGGTTGTCGA TAAGGAAGAT AAGCAACGGT TACAAAATTC TTTGCATACA ACCTTACAGC ATGGTAAAGG TGTGGCAATG ATACAGGATG AGTTAGGACA GATTCACTAT TTTTCTAAAT CTTTTATGGA TCCTGTTACC GGCCTCTCTT ACGATGAGCC TGCGCCTAAT ACTTTTTCTT TTAACTCGCC CTATGGCGCT TGTACCACCT GTGAAGGATT AGGGACCATA TCGCAGGTTG ATATAGATGC GATTATACCC GATAAGTCCT TGAATGTACA ACAAGGGGGT ATTTTGCCAT TAGGGCCTGA AGCGAAAACA GATATCTTTA AAACAATAAA AAGTTTATTT GAACATTATC AGGTACCGTT CGATACACCT ATTCAAAAGC TGTCCCAACA GTTGCTGGAT ACTATTCTAT ATGGGCAAGA AGAAATTACT GAGCAGGAAT CTACTAGTAA AGAAAAAAAG CGTGTGATAA AACGGTTCGA GGGTATAATA CCTGCCTTAG ATAACCTAGA AAATAAAACT ACTGCAAAAG AACGTGCTAT ACAGGAATCG TACAGACGGG ATATAACATG TCCTGAATGT CAAGGTGCTA GGCTTAAAAA AGTAGCATTA TACTTTAAAA TAGCTGATAA GAACATTGCT GAGTTGGCCA ATATGAACCT GCAACAGCTT CATGGTTTCC TTGAAGAGCT TATGCCTAAG CTAGATAACC GTCAGCAAAT TATTGCTAGT GAGTTACTTA AAGAGCTTAA AAAGCGTATC CATTTCTTGC TTAACGTAGG ATTATATTAT TTGTCACTAA ATCGACCACT AAGAACGCTT TCTGGTGGAG AAGCACAAAG GATTAGGCTG GCTACACAAA TTGGTACACA GCTCGTAGGT GTATTGTATA TTCTAGATGA GCCTAGCATC GGGCTACATC AGCGTGACAA CATGAGTCTG ATCCAGGCCC TTCATGATTT AAGAGACTTA GGTAATTCTG TGATGGTAGT GGAGCATGAT AAAGATATGA TGTTGGAATC AGACTATATT ATTGATATAG GTCCTGGAGC TGGAAAAAAT GGAGGTAAAG TTGTGGCTGC TGGCACACCA ACCGAATTCT TAAAGCAAGC CAGCACAACA GCTGAATTTT TATCGGGTGT TCGACAAATT GCTATTCCAT CTACTCGTAG GCAAGGAAAT GGTAATGTGT TGACACTTGC AGGTTGCACA GGCAATAATC TTAAAAATGT AACACTAAAT TTACCATTGG GTAAGCTTAT TTGTATTTCT GGGGTTTCGG GTAGTGGTAA GTCCACGTTA ATCCATCAAA CGCTGTATCC TATTCTTCAG AAATATCTAT ATAAATCTTA TGCCAATCCG CTTCCTTACA CAAGTATAAC CGGATTAGAA CATTTAGATA AAGTGGTGGA AATAGACCAG AAGCCTATAG GGAGAACTCC CCGTTCCAAT CCTTCTACCT ATACCAATGT TTTTACAGGC ATCCGTAATT TATTTTCACA ATTGCCAGAA GCGAAGATAC GAGGTTATCA ACCTGGTAGA TTTTCTTTTA ATGTAAGTGG AGGAAGGTGT GAAACTTGTC AAGGTGGAGG TATGCGTGTC ATAGAAATGG ACTTTTTACC CGATGTATAT GTGCATTGCG AAACCTGCCA AGGAAAACGT TATAACCGAG AAACCTTGGA AGTACAATAT AAGGGTAAGT CAATCTCTGA TGTATTAGAT ATGACCATTA GCAATGCTGT AGAATTTTTT GATAAATATC CACATATCCG TAAGATCATA CAAATTTTGG AGGATGTAGG CTTAGGTTAC CTTACTTTAG GTCAGCCTGC TACCACTTTG TCAGGCGGTG AAGCACAACG CGTAAAATTG GCTACCGAAC TAGCAAAAAG GGATACAGGT AAAACCTTTT ACATACTCGA TGAACCCACA ACAGGTTTAC ATTTTCAAGA TATTCAGCAC CTGTTAGATG TGCTCAATAA ATTAACCGAT AAAGGCAATA CTGTACTAAT TATTGAGCAT AACCTAGATA TTATCAAGGT TGCAGATTAT ATCATTGATG TGGGCCCAGA GGGAGGCGAA CAAGGTGGGC AGATTGTAGC AGAAGGAACA CCAGAAGAGC TTATTCAACA CCCATATAGC CACACTGCCA AATTTCTTAA AATGGAAATG TAA
|
Protein sequence | MSKQETNLKK PNADIHGYDH VEVLGARVHN LKNIDVRFPR NKLVVITGLS GSGKSSLAFD TIYAEGQRRY IETFISYARS FIGDLERPDV DKINGLSPVI AIEQKTTSRN PRSTVGTVTE IYDYLRLLFA KVADAYSYTS GHKMVKQTAE QIENHILQIF SGKQLFLLAP VVKGRKGHYR ELFQQISSMG FTKVKIDGEI KDVVAKLQVD RYKIHDIEIV VDRLVVDKED KQRLQNSLHT TLQHGKGVAM IQDELGQIHY FSKSFMDPVT GLSYDEPAPN TFSFNSPYGA CTTCEGLGTI SQVDIDAIIP DKSLNVQQGG ILPLGPEAKT DIFKTIKSLF EHYQVPFDTP IQKLSQQLLD TILYGQEEIT EQESTSKEKK RVIKRFEGII PALDNLENKT TAKERAIQES YRRDITCPEC QGARLKKVAL YFKIADKNIA ELANMNLQQL HGFLEELMPK LDNRQQIIAS ELLKELKKRI HFLLNVGLYY LSLNRPLRTL SGGEAQRIRL ATQIGTQLVG VLYILDEPSI GLHQRDNMSL IQALHDLRDL GNSVMVVEHD KDMMLESDYI IDIGPGAGKN GGKVVAAGTP TEFLKQASTT AEFLSGVRQI AIPSTRRQGN GNVLTLAGCT GNNLKNVTLN LPLGKLICIS GVSGSGKSTL IHQTLYPILQ KYLYKSYANP LPYTSITGLE HLDKVVEIDQ KPIGRTPRSN PSTYTNVFTG IRNLFSQLPE AKIRGYQPGR FSFNVSGGRC ETCQGGGMRV IEMDFLPDVY VHCETCQGKR YNRETLEVQY KGKSISDVLD MTISNAVEFF DKYPHIRKII QILEDVGLGY LTLGQPATTL SGGEAQRVKL ATELAKRDTG KTFYILDEPT TGLHFQDIQH LLDVLNKLTD KGNTVLIIEH NLDIIKVADY IIDVGPEGGE QGGQIVAEGT PEELIQHPYS HTAKFLKMEM
|
| |