Gene Aasi_0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0916 
Symbol 
ID6377020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1171781 
End bp1173838 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content34% 
IMG OID642682050 
Producthypothetical protein 
Protein accessionYP_001958011 
Protein GI189502294 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0756379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGAGAA ATATAGGGCG TAAAATAATA CAGATTAGTA GCTTAACTAT AAGTTTTATA 
CTAGGATGTT TCTGTAATCT CCTAGCTAAA GAAAAAATTT ATGAAAAAGA GTTGCCTTAC
GCCATAGCTT CTTATTATAT ACCATCTCTT TCAAGCCATA AACAGGAGTC TAAGCAGGTA
TCCATTAATA AAAAAAAGCG ACAACAAATT TGGGAGGCAA AGTTGCCGTC TTTAAGTAAA
CAAGAAAAGG CAGCTATTAT ATTTGAGCAG ATATTTGCCA AAAGCCCAGC TATTAATAGT
TTACAAGCTC AACAAGAACC CATCTTAAAT AATTATGCTT GGCATGACCT AAAATTATTT
TATGGTACTA ATTCTAATCC TAAATGTAAT CTGCTTCATC AGATTGATAG AACAATAAGT
TGGTTAGGTA AAGGAGTACT GGCCATATTT ATAGCAACGC CTACAAGTAG TATACAAGAG
TTGCGTACTA GACAAAGTAT CATACAATTT CTTTTAAAAG AAGACTTCAC TTTTAATCAA
CTTATAAAAG TTTACCAAGA TTATACTTAT ATAGAAGAAA ACTTGTTATC ATTTTGGTCT
AACCATGACC CCCTTTTTAA TCGTTTTTAT AAAAGCTTTA TTCAAAACAC TTTATACTTT
AACCTTCCTA ACAGTAGCAG ATATAATAAG GCAAGCCGTG CATTAGAGCT ACGTAAAAGG
CTTTATTTTG ATTCAGTACT AGCTTTACAT GGTACTTCCA TTAGCTTTCA AAGTTATTAT
CTATTTAAAG TATTAACAGA TCGGGCTACA TTTAAAAGGA TAATTTCTAA AGATCATCCT
ATGTGGATTA GAATATTAGC TTGGTATCCT TTAGCTACTA TCCCTATAGA AATACTTAGT
ATTAAAAAGC ACTATGATAG TTGGTCAACG CCTTTAAAAT ACTTAGCCGC CCGGCTAGCT
GATATACAGC AATGGGTATT GTTAGCTAAA GAAGTAAATA CTATTGTAGC TGGTTGTCCG
GAATTAGAAA ACCTATATGG AAAATATTTA ATAAACATAA GAAAGCTACT TGCTCAACCT
AATACTACGG AAATGGGAAG GCTTATTGAT CGTCTTTGTA GCTTGCCACT AAAAAAATGG
TCTGTATTTT TTAATAACAA TGGTAAGCTA TTACAAACTT ACAAGTTGTT TGTAAAGCAC
AAAAATAAGT TTACAGATGC ATTATATGAC TTTGGTAGGC TCGATACTTT TTTATCTATA
GCTCAGTTGC TAAAAGAAAC ACAGAGTACC CAGAGAAAAA CTACTTATAC TTTTACTCGA
TTTCTTGACC GTAGCGAAGA GTCTAAGCCA TACATATCGC TTGTTGATAT GTGGAGTCCT
TTTTTAGATG TAAAGAAGGC TATTACTAAT ACCATTACAA TGAGTAGTGA GCGTGAAGTG
CGCAATATGA TTTTAACGGG CCCTAATGCT GGTGGAAAGT CAGTGTTCAT AAGTGGTATG
GCCATAAGCT TATTGCTAAG CCAGACATTA GGAATTGCTC CTGCAAGCTC GGCTACCATT
ACACCATTTA ATAAAATTAA CACTTACTTA GATGTTTCGG GTGATATTGC TGAAGGTAAA
TCGCTTTTTA TGGCAGAGGT ACAGCGTGCT CAGCAACAAC TTGATACTAT CATGGGCTTG
CAGGAAGGTG AATTCAGCTT TAGTGTTATG GACGAAATAT TTAGTGGTAC TAATCCTCTA
GAAGGTGAGG CCGCTGCTTA CAGTATTGTT CATTATCTTT CAAAGTATAC TAATAATTTA
AGTATTGTAG CTACTCATTT CCCCAAATTA ACGTTGCTAC CAGAGCGGGC CCCACATAGT
GGCTTTGCTA ACTATAAGGT TTTTGTAGGT GTACAGAAAG AGACTGGCCA ACTAATTTAT
ACTTACAAAG TAGCACCAGG AAAATCTAAC CAAACAATTG CTTTAGATAT TTTAAAAGAG
CAAGGTTATG ATATAAAGAT GTTAGAAGAA GCAAAGGACA TACTATCCCA TCCAGAGCAT
TATCAGGCTA GTTTTTAA
 
Protein sequence
MERNIGRKII QISSLTISFI LGCFCNLLAK EKIYEKELPY AIASYYIPSL SSHKQESKQV 
SINKKKRQQI WEAKLPSLSK QEKAAIIFEQ IFAKSPAINS LQAQQEPILN NYAWHDLKLF
YGTNSNPKCN LLHQIDRTIS WLGKGVLAIF IATPTSSIQE LRTRQSIIQF LLKEDFTFNQ
LIKVYQDYTY IEENLLSFWS NHDPLFNRFY KSFIQNTLYF NLPNSSRYNK ASRALELRKR
LYFDSVLALH GTSISFQSYY LFKVLTDRAT FKRIISKDHP MWIRILAWYP LATIPIEILS
IKKHYDSWST PLKYLAARLA DIQQWVLLAK EVNTIVAGCP ELENLYGKYL INIRKLLAQP
NTTEMGRLID RLCSLPLKKW SVFFNNNGKL LQTYKLFVKH KNKFTDALYD FGRLDTFLSI
AQLLKETQST QRKTTYTFTR FLDRSEESKP YISLVDMWSP FLDVKKAITN TITMSSEREV
RNMILTGPNA GGKSVFISGM AISLLLSQTL GIAPASSATI TPFNKINTYL DVSGDIAEGK
SLFMAEVQRA QQQLDTIMGL QEGEFSFSVM DEIFSGTNPL EGEAAAYSIV HYLSKYTNNL
SIVATHFPKL TLLPERAPHS GFANYKVFVG VQKETGQLIY TYKVAPGKSN QTIALDILKE
QGYDIKMLEE AKDILSHPEH YQASF