Gene Aasi_0942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0942 
Symbol 
ID6377182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1207945 
End bp1210797 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content39% 
IMG OID642682070 
Producthypothetical protein 
Protein accessionYP_001958031 
Protein GI189502314 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.971464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAC AAGAAACAAA TTTAAAAAAG CCAAATGCAG ATATTCATGG TTACGACCAT 
GTAGAGGTGT TGGGTGCTAG AGTACATAAC CTTAAAAATA TTGATGTGCG CTTTCCTAGG
AATAAGCTGG TGGTTATTAC TGGGCTGAGT GGTAGTGGAA AATCTTCTTT AGCATTTGAT
ACGATTTATG CAGAAGGGCA ACGACGTTAT ATAGAAACTT TTATAAGCTA TGCACGTTCC
TTTATTGGTG ATTTAGAGCG TCCGGATGTA GATAAAATTA ATGGATTAAG TCCGGTCATT
GCTATTGAGC AAAAAACAAC TTCTAGAAAC CCACGGTCTA CAGTAGGTAC AGTAACAGAA
ATTTATGACT ATTTGCGTTT GCTATTTGCT AAAGTTGCAG ATGCCTACTC CTATACAAGT
GGCCACAAAA TGGTTAAGCA GACAGCTGAA CAAATAGAAA ATCATATACT TCAGATATTT
TCCGGTAAAC AGCTTTTCTT GCTGGCACCT GTAGTAAAAG GTAGAAAAGG GCACTATAGA
GAGCTGTTTC AACAAATTAG CAGCATGGGA TTTACTAAGG TCAAAATAGA TGGAGAGATA
AAAGATGTTG TAGCTAAACT GCAAGTAGAC CGTTACAAGA TACACGATAT AGAAATTGTG
GTTGACCGGT TGGTTGTCGA TAAGGAAGAT AAGCAACGGT TACAAAATTC TTTGCATACA
ACCTTACAGC ATGGTAAAGG TGTGGCAATG ATACAGGATG AGTTAGGACA GATTCACTAT
TTTTCTAAAT CTTTTATGGA TCCTGTTACC GGCCTCTCTT ACGATGAGCC TGCGCCTAAT
ACTTTTTCTT TTAACTCGCC CTATGGCGCT TGTACCACCT GTGAAGGATT AGGGACCATA
TCGCAGGTTG ATATAGATGC GATTATACCC GATAAGTCCT TGAATGTACA ACAAGGGGGT
ATTTTGCCAT TAGGGCCTGA AGCGAAAACA GATATCTTTA AAACAATAAA AAGTTTATTT
GAACATTATC AGGTACCGTT CGATACACCT ATTCAAAAGC TGTCCCAACA GTTGCTGGAT
ACTATTCTAT ATGGGCAAGA AGAAATTACT GAGCAGGAAT CTACTAGTAA AGAAAAAAAG
CGTGTGATAA AACGGTTCGA GGGTATAATA CCTGCCTTAG ATAACCTAGA AAATAAAACT
ACTGCAAAAG AACGTGCTAT ACAGGAATCG TACAGACGGG ATATAACATG TCCTGAATGT
CAAGGTGCTA GGCTTAAAAA AGTAGCATTA TACTTTAAAA TAGCTGATAA GAACATTGCT
GAGTTGGCCA ATATGAACCT GCAACAGCTT CATGGTTTCC TTGAAGAGCT TATGCCTAAG
CTAGATAACC GTCAGCAAAT TATTGCTAGT GAGTTACTTA AAGAGCTTAA AAAGCGTATC
CATTTCTTGC TTAACGTAGG ATTATATTAT TTGTCACTAA ATCGACCACT AAGAACGCTT
TCTGGTGGAG AAGCACAAAG GATTAGGCTG GCTACACAAA TTGGTACACA GCTCGTAGGT
GTATTGTATA TTCTAGATGA GCCTAGCATC GGGCTACATC AGCGTGACAA CATGAGTCTG
ATCCAGGCCC TTCATGATTT AAGAGACTTA GGTAATTCTG TGATGGTAGT GGAGCATGAT
AAAGATATGA TGTTGGAATC AGACTATATT ATTGATATAG GTCCTGGAGC TGGAAAAAAT
GGAGGTAAAG TTGTGGCTGC TGGCACACCA ACCGAATTCT TAAAGCAAGC CAGCACAACA
GCTGAATTTT TATCGGGTGT TCGACAAATT GCTATTCCAT CTACTCGTAG GCAAGGAAAT
GGTAATGTGT TGACACTTGC AGGTTGCACA GGCAATAATC TTAAAAATGT AACACTAAAT
TTACCATTGG GTAAGCTTAT TTGTATTTCT GGGGTTTCGG GTAGTGGTAA GTCCACGTTA
ATCCATCAAA CGCTGTATCC TATTCTTCAG AAATATCTAT ATAAATCTTA TGCCAATCCG
CTTCCTTACA CAAGTATAAC CGGATTAGAA CATTTAGATA AAGTGGTGGA AATAGACCAG
AAGCCTATAG GGAGAACTCC CCGTTCCAAT CCTTCTACCT ATACCAATGT TTTTACAGGC
ATCCGTAATT TATTTTCACA ATTGCCAGAA GCGAAGATAC GAGGTTATCA ACCTGGTAGA
TTTTCTTTTA ATGTAAGTGG AGGAAGGTGT GAAACTTGTC AAGGTGGAGG TATGCGTGTC
ATAGAAATGG ACTTTTTACC CGATGTATAT GTGCATTGCG AAACCTGCCA AGGAAAACGT
TATAACCGAG AAACCTTGGA AGTACAATAT AAGGGTAAGT CAATCTCTGA TGTATTAGAT
ATGACCATTA GCAATGCTGT AGAATTTTTT GATAAATATC CACATATCCG TAAGATCATA
CAAATTTTGG AGGATGTAGG CTTAGGTTAC CTTACTTTAG GTCAGCCTGC TACCACTTTG
TCAGGCGGTG AAGCACAACG CGTAAAATTG GCTACCGAAC TAGCAAAAAG GGATACAGGT
AAAACCTTTT ACATACTCGA TGAACCCACA ACAGGTTTAC ATTTTCAAGA TATTCAGCAC
CTGTTAGATG TGCTCAATAA ATTAACCGAT AAAGGCAATA CTGTACTAAT TATTGAGCAT
AACCTAGATA TTATCAAGGT TGCAGATTAT ATCATTGATG TGGGCCCAGA GGGAGGCGAA
CAAGGTGGGC AGATTGTAGC AGAAGGAACA CCAGAAGAGC TTATTCAACA CCCATATAGC
CACACTGCCA AATTTCTTAA AATGGAAATG TAA
 
Protein sequence
MSKQETNLKK PNADIHGYDH VEVLGARVHN LKNIDVRFPR NKLVVITGLS GSGKSSLAFD 
TIYAEGQRRY IETFISYARS FIGDLERPDV DKINGLSPVI AIEQKTTSRN PRSTVGTVTE
IYDYLRLLFA KVADAYSYTS GHKMVKQTAE QIENHILQIF SGKQLFLLAP VVKGRKGHYR
ELFQQISSMG FTKVKIDGEI KDVVAKLQVD RYKIHDIEIV VDRLVVDKED KQRLQNSLHT
TLQHGKGVAM IQDELGQIHY FSKSFMDPVT GLSYDEPAPN TFSFNSPYGA CTTCEGLGTI
SQVDIDAIIP DKSLNVQQGG ILPLGPEAKT DIFKTIKSLF EHYQVPFDTP IQKLSQQLLD
TILYGQEEIT EQESTSKEKK RVIKRFEGII PALDNLENKT TAKERAIQES YRRDITCPEC
QGARLKKVAL YFKIADKNIA ELANMNLQQL HGFLEELMPK LDNRQQIIAS ELLKELKKRI
HFLLNVGLYY LSLNRPLRTL SGGEAQRIRL ATQIGTQLVG VLYILDEPSI GLHQRDNMSL
IQALHDLRDL GNSVMVVEHD KDMMLESDYI IDIGPGAGKN GGKVVAAGTP TEFLKQASTT
AEFLSGVRQI AIPSTRRQGN GNVLTLAGCT GNNLKNVTLN LPLGKLICIS GVSGSGKSTL
IHQTLYPILQ KYLYKSYANP LPYTSITGLE HLDKVVEIDQ KPIGRTPRSN PSTYTNVFTG
IRNLFSQLPE AKIRGYQPGR FSFNVSGGRC ETCQGGGMRV IEMDFLPDVY VHCETCQGKR
YNRETLEVQY KGKSISDVLD MTISNAVEFF DKYPHIRKII QILEDVGLGY LTLGQPATTL
SGGEAQRVKL ATELAKRDTG KTFYILDEPT TGLHFQDIQH LLDVLNKLTD KGNTVLIIEH
NLDIIKVADY IIDVGPEGGE QGGQIVAEGT PEELIQHPYS HTAKFLKMEM