Gene Aasi_0323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0323 
Symbol 
ID6377691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp372557 
End bp375367 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content40% 
IMG OID642681502 
Producthypothetical protein 
Protein accessionYP_001957486 
Protein GI189501769 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.875781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTA CTAATAAACC TAAATTATTC CTCTTAGATG CGTTGGCCCT TATTTACCGG 
GCTCATTTCG CTTTTATTAA GAATCCTAGA ATTACTTCAA AAGGCCTTAA TACCAGTGCT
ACTTTGGGTT TTACCAACAC TTTAGTAGAG GTCATTACTA AAGAGAAGCC AAGCCACATC
ATTGTAGCCT TTGATACTGG AGCACCTACG CATAGACATA CGGCTTTCCC AGCTTATAAA
GAACACAGGC CTTCCCAGCC CGAAGATATT ACGGTTGCTA TTCCCTATGT AAAAAAAATC
TTAAAAGCCT TTCGTATACC TGTACTATTG CTCGAAGGCT ATGAGGCGGA TGATATTATT
GGAACGTTGG CTCGGCAGGC TGCCGTACAA GGGTTCGAAG TGTACATGAT GACGCCTGAT
AAAGATTTTG CACAATTGGT AGATGATCAT ATTTATATAT ATAAGCCAGC TTTTATGGGC
AATGGCGTAG CCATTCTAGA TAGGCAGGCG GTATTAGAAA AATGGGGTAT ATCGAACGTA
GATCAAATAA GGGATCTGCT TGCACTCCAG GGCGATGCTG TGGATAATAT TCCGGGTATA
CCCAGTATTG GAATAAAAAC AGCACAGAAA CTAATACAGC AGTTTGGGAC TTTAGAAAAC
TTATTAGCCA ACGCTGATCA GTTAACTGGG AAGCTGCAAG AGAATGTAGT AAAATATGCA
CAGCAGGGCA TTTTATCTAA AGAATTGGCT ACGATACATA CAGAAGTGCC TATACAGTTT
GATGCTGAAG AGAGTCGGTA CCAAGGTCCT GATCCTGTAG CGCTTAAGGA AATTTTTCAA
GAGTTAGAAT TTAATTCGCT AACAAGACGT TTATTAGGAG AAGACAACTA TAACACAAAA
AGGACTCCCG GCACTCAAGC CAACTTATTT GATTTCACAC CTACTGACCA GCCAACCGCA
GAAACGGCTT CTGTATATCT TGATCCTGCT CCATTTCGTA ATATTTATAC TACAAAGCAT
CAATATTATC TTATAGATAC ACCGTCTTTA CGGCAAAATT TGATTAATTA TCTAAAACTT
CAGGATACTT TCTGCTTTGA TACAGAAACT ACTAGCCTAG ATCCCTACCA AGCGCAGCTT
GTCGGGATTT CATTTGCTTA CTACCCTGGT GAGGCGTACT ATGTACCTAT TCCAGCAGAA
AGGAAGGCTG CGCAAGCAAT TATTGAAGAG TTCCGCCCAT TACTTGAAAG TACTACCCAG
TGTAAAGTAG GCCAGAATTT AAAATACGAT AACCTTATTC TACGCACTTA TGGCATAGAA
GTAGCGCCTC CTATTTTTGA TACCATGGTA GCGCATTACC TGGTAGCACC TGATAAACCT
CATAACATGA ATGCTATTGC CGAAAGCTAT CTTAATTATG CACCTATTCC TATAGAGGCT
TTAATTGGTT CTAGAAAGTC TACGCAGAAA AGCATGCGGA TGGTGGATGT GGAATTGGTT
AAAGAGTATG CTTGCGAAGA TGCAGATATT ACCCTACAGC TTAAAAGGCT TTTAGAACTA
AATATTAAAC AAGAAAATTT ATCAAAGCTA TTTTATGAAA TAGAAATTCC GTTGGTACAG
GTGCTTACTG CCATGGAGTA CCAGGGAGTG CAAATAGACA CTCAAGTGCT ACGGGAAATT
TCTGTTACGT TGGGTACAGA GCTTGCAGCT TTAGAAAAAG AGATTCATAG GTTAGCAGGG
CATGCGTTTA ACATCAGTTC TCCTAAGCAG CTTGGTGAAA TATTATTTGA TAAGCTGAAA
ATTGCTGGAA ATAATAAAAA GACAAAATCA GGGCAATATG CTACTGGCGA GCTGGTGCTA
GCCGATTTAA CAAAGGACCA TCCCATTGCA GCTAACATAT TAGATTACAG GGAACTACAA
AAATTAAAAT CCACATATGT AGATGCTTTA ATTGATCTAA TTTCTCCCTT TGATGGGAAA
GTGCATACTT CTTATAACCA AACCGTAGTA ACCACGGGTC GGCTTAGTTC TACCAACCCG
AACTTACAAA ATATTCCTAT TCGTACAGAA AAGGGTAGGG CTATCCGTAA AGCTTTTGTT
CCCAGTAAGC CAGATCATGT GTTGCTTTCT GCTGATTATT CACAAATAGA ATTACGTATT
ATGGCTTCTT TCTCACAAGA TGAACATATG ATAGAGGCAT TTAAAGCAGG AAAGGATATT
CATACGGCTA CAGCGAGTAA GCTTTTTAAA GTACCGCTTG AACAGGTAGA TGAATCCATG
AGACGCCAAG CTAAGACAGC CAACTTTGGC ATTATATATG GTATTTCTGG TTTTGGACTT
GCACAACGGC TTGGTATTTC GCGGTCGGAA GCCATAGCCA CCATTCAAGC TTATTTTCAA
GAATTTCATG CTATAAAAGC CTATATGGAT AGGGTTATTG GCCAAGCTAG AGAGCAAGGC
TATGTAACTA CCTTAATGGG ACGAAAACGA TATCTAAGGG ATATCAACTC TAGGAATTCA
ACCTTACGTG GATTTGATGA GCGGAATGCC ATCAACACGC CTATACAAGG TACTGCTGCT
GAAATGATAA AACTAGCTAT GGTACATATT TATGAATGGC TACAAAAAGA AAAACTTCAA
TCTAAATTAA TTTTACAAGT ACACGATGAA CTTGTATTTG ATGTGCCCCA GCATGAGATA
GAAATCATGC GAGAGCATGT AGCTTACTTC ATGAAAAATT CACTTCCTTT AGCAGGAGTA
CCTGTAGAGG TACAAATAGG CCTTGGGAAA AATTGGGAAG AAGCACACTA A
 
Protein sequence
MATTNKPKLF LLDALALIYR AHFAFIKNPR ITSKGLNTSA TLGFTNTLVE VITKEKPSHI 
IVAFDTGAPT HRHTAFPAYK EHRPSQPEDI TVAIPYVKKI LKAFRIPVLL LEGYEADDII
GTLARQAAVQ GFEVYMMTPD KDFAQLVDDH IYIYKPAFMG NGVAILDRQA VLEKWGISNV
DQIRDLLALQ GDAVDNIPGI PSIGIKTAQK LIQQFGTLEN LLANADQLTG KLQENVVKYA
QQGILSKELA TIHTEVPIQF DAEESRYQGP DPVALKEIFQ ELEFNSLTRR LLGEDNYNTK
RTPGTQANLF DFTPTDQPTA ETASVYLDPA PFRNIYTTKH QYYLIDTPSL RQNLINYLKL
QDTFCFDTET TSLDPYQAQL VGISFAYYPG EAYYVPIPAE RKAAQAIIEE FRPLLESTTQ
CKVGQNLKYD NLILRTYGIE VAPPIFDTMV AHYLVAPDKP HNMNAIAESY LNYAPIPIEA
LIGSRKSTQK SMRMVDVELV KEYACEDADI TLQLKRLLEL NIKQENLSKL FYEIEIPLVQ
VLTAMEYQGV QIDTQVLREI SVTLGTELAA LEKEIHRLAG HAFNISSPKQ LGEILFDKLK
IAGNNKKTKS GQYATGELVL ADLTKDHPIA ANILDYRELQ KLKSTYVDAL IDLISPFDGK
VHTSYNQTVV TTGRLSSTNP NLQNIPIRTE KGRAIRKAFV PSKPDHVLLS ADYSQIELRI
MASFSQDEHM IEAFKAGKDI HTATASKLFK VPLEQVDESM RRQAKTANFG IIYGISGFGL
AQRLGISRSE AIATIQAYFQ EFHAIKAYMD RVIGQAREQG YVTTLMGRKR YLRDINSRNS
TLRGFDERNA INTPIQGTAA EMIKLAMVHI YEWLQKEKLQ SKLILQVHDE LVFDVPQHEI
EIMREHVAYF MKNSLPLAGV PVEVQIGLGK NWEEAH