Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0323 |
Symbol | |
ID | 6377691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 372557 |
End bp | 375367 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642681502 |
Product | hypothetical protein |
Protein accession | YP_001957486 |
Protein GI | 189501769 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.875781 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACTA CTAATAAACC TAAATTATTC CTCTTAGATG CGTTGGCCCT TATTTACCGG GCTCATTTCG CTTTTATTAA GAATCCTAGA ATTACTTCAA AAGGCCTTAA TACCAGTGCT ACTTTGGGTT TTACCAACAC TTTAGTAGAG GTCATTACTA AAGAGAAGCC AAGCCACATC ATTGTAGCCT TTGATACTGG AGCACCTACG CATAGACATA CGGCTTTCCC AGCTTATAAA GAACACAGGC CTTCCCAGCC CGAAGATATT ACGGTTGCTA TTCCCTATGT AAAAAAAATC TTAAAAGCCT TTCGTATACC TGTACTATTG CTCGAAGGCT ATGAGGCGGA TGATATTATT GGAACGTTGG CTCGGCAGGC TGCCGTACAA GGGTTCGAAG TGTACATGAT GACGCCTGAT AAAGATTTTG CACAATTGGT AGATGATCAT ATTTATATAT ATAAGCCAGC TTTTATGGGC AATGGCGTAG CCATTCTAGA TAGGCAGGCG GTATTAGAAA AATGGGGTAT ATCGAACGTA GATCAAATAA GGGATCTGCT TGCACTCCAG GGCGATGCTG TGGATAATAT TCCGGGTATA CCCAGTATTG GAATAAAAAC AGCACAGAAA CTAATACAGC AGTTTGGGAC TTTAGAAAAC TTATTAGCCA ACGCTGATCA GTTAACTGGG AAGCTGCAAG AGAATGTAGT AAAATATGCA CAGCAGGGCA TTTTATCTAA AGAATTGGCT ACGATACATA CAGAAGTGCC TATACAGTTT GATGCTGAAG AGAGTCGGTA CCAAGGTCCT GATCCTGTAG CGCTTAAGGA AATTTTTCAA GAGTTAGAAT TTAATTCGCT AACAAGACGT TTATTAGGAG AAGACAACTA TAACACAAAA AGGACTCCCG GCACTCAAGC CAACTTATTT GATTTCACAC CTACTGACCA GCCAACCGCA GAAACGGCTT CTGTATATCT TGATCCTGCT CCATTTCGTA ATATTTATAC TACAAAGCAT CAATATTATC TTATAGATAC ACCGTCTTTA CGGCAAAATT TGATTAATTA TCTAAAACTT CAGGATACTT TCTGCTTTGA TACAGAAACT ACTAGCCTAG ATCCCTACCA AGCGCAGCTT GTCGGGATTT CATTTGCTTA CTACCCTGGT GAGGCGTACT ATGTACCTAT TCCAGCAGAA AGGAAGGCTG CGCAAGCAAT TATTGAAGAG TTCCGCCCAT TACTTGAAAG TACTACCCAG TGTAAAGTAG GCCAGAATTT AAAATACGAT AACCTTATTC TACGCACTTA TGGCATAGAA GTAGCGCCTC CTATTTTTGA TACCATGGTA GCGCATTACC TGGTAGCACC TGATAAACCT CATAACATGA ATGCTATTGC CGAAAGCTAT CTTAATTATG CACCTATTCC TATAGAGGCT TTAATTGGTT CTAGAAAGTC TACGCAGAAA AGCATGCGGA TGGTGGATGT GGAATTGGTT AAAGAGTATG CTTGCGAAGA TGCAGATATT ACCCTACAGC TTAAAAGGCT TTTAGAACTA AATATTAAAC AAGAAAATTT ATCAAAGCTA TTTTATGAAA TAGAAATTCC GTTGGTACAG GTGCTTACTG CCATGGAGTA CCAGGGAGTG CAAATAGACA CTCAAGTGCT ACGGGAAATT TCTGTTACGT TGGGTACAGA GCTTGCAGCT TTAGAAAAAG AGATTCATAG GTTAGCAGGG CATGCGTTTA ACATCAGTTC TCCTAAGCAG CTTGGTGAAA TATTATTTGA TAAGCTGAAA ATTGCTGGAA ATAATAAAAA GACAAAATCA GGGCAATATG CTACTGGCGA GCTGGTGCTA GCCGATTTAA CAAAGGACCA TCCCATTGCA GCTAACATAT TAGATTACAG GGAACTACAA AAATTAAAAT CCACATATGT AGATGCTTTA ATTGATCTAA TTTCTCCCTT TGATGGGAAA GTGCATACTT CTTATAACCA AACCGTAGTA ACCACGGGTC GGCTTAGTTC TACCAACCCG AACTTACAAA ATATTCCTAT TCGTACAGAA AAGGGTAGGG CTATCCGTAA AGCTTTTGTT CCCAGTAAGC CAGATCATGT GTTGCTTTCT GCTGATTATT CACAAATAGA ATTACGTATT ATGGCTTCTT TCTCACAAGA TGAACATATG ATAGAGGCAT TTAAAGCAGG AAAGGATATT CATACGGCTA CAGCGAGTAA GCTTTTTAAA GTACCGCTTG AACAGGTAGA TGAATCCATG AGACGCCAAG CTAAGACAGC CAACTTTGGC ATTATATATG GTATTTCTGG TTTTGGACTT GCACAACGGC TTGGTATTTC GCGGTCGGAA GCCATAGCCA CCATTCAAGC TTATTTTCAA GAATTTCATG CTATAAAAGC CTATATGGAT AGGGTTATTG GCCAAGCTAG AGAGCAAGGC TATGTAACTA CCTTAATGGG ACGAAAACGA TATCTAAGGG ATATCAACTC TAGGAATTCA ACCTTACGTG GATTTGATGA GCGGAATGCC ATCAACACGC CTATACAAGG TACTGCTGCT GAAATGATAA AACTAGCTAT GGTACATATT TATGAATGGC TACAAAAAGA AAAACTTCAA TCTAAATTAA TTTTACAAGT ACACGATGAA CTTGTATTTG ATGTGCCCCA GCATGAGATA GAAATCATGC GAGAGCATGT AGCTTACTTC ATGAAAAATT CACTTCCTTT AGCAGGAGTA CCTGTAGAGG TACAAATAGG CCTTGGGAAA AATTGGGAAG AAGCACACTA A
|
Protein sequence | MATTNKPKLF LLDALALIYR AHFAFIKNPR ITSKGLNTSA TLGFTNTLVE VITKEKPSHI IVAFDTGAPT HRHTAFPAYK EHRPSQPEDI TVAIPYVKKI LKAFRIPVLL LEGYEADDII GTLARQAAVQ GFEVYMMTPD KDFAQLVDDH IYIYKPAFMG NGVAILDRQA VLEKWGISNV DQIRDLLALQ GDAVDNIPGI PSIGIKTAQK LIQQFGTLEN LLANADQLTG KLQENVVKYA QQGILSKELA TIHTEVPIQF DAEESRYQGP DPVALKEIFQ ELEFNSLTRR LLGEDNYNTK RTPGTQANLF DFTPTDQPTA ETASVYLDPA PFRNIYTTKH QYYLIDTPSL RQNLINYLKL QDTFCFDTET TSLDPYQAQL VGISFAYYPG EAYYVPIPAE RKAAQAIIEE FRPLLESTTQ CKVGQNLKYD NLILRTYGIE VAPPIFDTMV AHYLVAPDKP HNMNAIAESY LNYAPIPIEA LIGSRKSTQK SMRMVDVELV KEYACEDADI TLQLKRLLEL NIKQENLSKL FYEIEIPLVQ VLTAMEYQGV QIDTQVLREI SVTLGTELAA LEKEIHRLAG HAFNISSPKQ LGEILFDKLK IAGNNKKTKS GQYATGELVL ADLTKDHPIA ANILDYRELQ KLKSTYVDAL IDLISPFDGK VHTSYNQTVV TTGRLSSTNP NLQNIPIRTE KGRAIRKAFV PSKPDHVLLS ADYSQIELRI MASFSQDEHM IEAFKAGKDI HTATASKLFK VPLEQVDESM RRQAKTANFG IIYGISGFGL AQRLGISRSE AIATIQAYFQ EFHAIKAYMD RVIGQAREQG YVTTLMGRKR YLRDINSRNS TLRGFDERNA INTPIQGTAA EMIKLAMVHI YEWLQKEKLQ SKLILQVHDE LVFDVPQHEI EIMREHVAYF MKNSLPLAGV PVEVQIGLGK NWEEAH
|
| |