Gene Aasi_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0024 
Symbol 
ID6376578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp30465 
End bp32255 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content35% 
IMG OID642681224 
Productexcinuclease ABC subunit C 
Protein accessionYP_001957210 
Protein GI189501493 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCCA AGCAATATGA AATAACAGAA ATCGGAAAAT TACCTAACCA ACCTGGAGTT 
TATTTCTTTT ATAATAAACA GCAACAGATT ATCTATGTGG GCAAAGCAAA GGACCTAAAA
AAGAGAGTTT CTAGCTACTT TAACAAAAAC CAACAGGATA ATTTAAAAAC TAGGAGAATG
GTAGAGGAAA TAGCTGCTAT TAGTTTTACA ATAGTTAATT CTGAGTATGA AGCTTTGCTG
CTAGAAAATA ACCTTATCAA GAGATTTCAA CCACGTTATA ATATTTTACT GAAAGATGAT
AAAAGCTTCC CTTACATATG CATTACACGC GAGCGGTTTC CAAAAGTCAT TACTACCAGG
CAACCAGGTG CTAAGCTAGG ACAATATTAT GGTCCATTCA CAGACCTTAA AAACATGTAT
CAAATCCTAG ATCTCATTAA GAGTCTATAT ACCCTCCGCA CTTGCAACTA TAATTTATTA
GAAGAAAACA TCTGTAAACA TAAATTTAAA GTTTGTTTAG AATATCATAT AGGACATTGT
AAAGGACCTT GTGAAGGATT GCAAACCGAA GAAGCTTATA ACCAAGACAT TCAACAAATT
GAGCATTTTC TTAAAGGCAA TATCAATGCT GTCAAAAAAC ATTTTAAAGA AAAGATGTCC
CAAGCAGCAG CAGCAATGGA CTATAAGAAT GCACAAGCTT ACAAAGAGAA AATAGCAGCT
TTAGAAAACT ACCAATCAAA ATCCTTGGTA GTCAATCCGC AAGTAGGAAA TCTAGATGTA
TTTGCTATTG TATCTGATGC TAAAGCTGCT TTTATTAGCT ATTTACAAAT AAAAGAGGGA
GCTATTATCT GTACACAAAA TACAGAGATT AAAAAGCAGC TAGAAGAACT AGATGAAGAT
ATACTTCCTT TAATCATTTT ACATTTTAGA GAAAAGTATG CAAGTAATCC ATCAGAGATT
TTAGTCAACA TACCTATTGC TACTACCTTT GACAAAGCAA TGCTCACTGT ACCTAGAATA
GGAGATAAAA AGAAATTAGT AGAATTAGCT ACCAAAAATG TACTCTTTCT AAAAAGAGAA
GCATTACTAC GCCAAGAAGA TAACCAAAAC CGTGCTAATA AGACGCTTGT ATTATTACAA
CATGACTTAC AATTAAAAGA TTTACCCTTG CATATCGAAT GCTTTGATAA TTCTAATATA
CAAGGCACAC ATCCCGTAGC AGCCATGGTT GTCTTCCAAC ATGGAAAGCC TGCAAAAAAA
GAGTATCGGC ATTTTAATAT TAAAACTGTT GTGGGGCCTG ATGATTTTGC ATCCATGCGA
GAAATAGTTA CTAGAAGGTA CAGAAGGTTG ATAGAGGAAA ATCAAAAACT TCCTGATCTC
ATTGTTATTG ATGGTGGTAA AGGGCAGTTG GGTGCTGCAG TAGCAGCTTT GCAAGAATTG
GGTATTTATG GACAGATACC AATTATAGGA ATAGCGAAAC GTCTGGAGGA AATCTACTTT
CCAGAAGATA GTTACCCTAT TCACATAAGC AAACAATCCC CCTCTTTGAA ACTGCTACAG
CAGATACGGA ACGAAGCACA TAGGTTTGCC ATTACTTTCC ATAGAGACAA GCGTAGTAAA
ACCAGCCTTA AAAGCCAACT AGAGTCTATA CCGGGCGTAG GAGAAAAAAC AATCACCACA
CTTTTACAAC ACTTTGGTTC TGTCCAAAAC ATTAAAGAAG CGAGTCTAAA GGCATTGGCA
AGCCAGGTTG GTAATAAAAG GGCAATACAA ATTAAGGAAC ACTTGAAGTA G
 
Protein sequence
MEAKQYEITE IGKLPNQPGV YFFYNKQQQI IYVGKAKDLK KRVSSYFNKN QQDNLKTRRM 
VEEIAAISFT IVNSEYEALL LENNLIKRFQ PRYNILLKDD KSFPYICITR ERFPKVITTR
QPGAKLGQYY GPFTDLKNMY QILDLIKSLY TLRTCNYNLL EENICKHKFK VCLEYHIGHC
KGPCEGLQTE EAYNQDIQQI EHFLKGNINA VKKHFKEKMS QAAAAMDYKN AQAYKEKIAA
LENYQSKSLV VNPQVGNLDV FAIVSDAKAA FISYLQIKEG AIICTQNTEI KKQLEELDED
ILPLIILHFR EKYASNPSEI LVNIPIATTF DKAMLTVPRI GDKKKLVELA TKNVLFLKRE
ALLRQEDNQN RANKTLVLLQ HDLQLKDLPL HIECFDNSNI QGTHPVAAMV VFQHGKPAKK
EYRHFNIKTV VGPDDFASMR EIVTRRYRRL IEENQKLPDL IVIDGGKGQL GAAVAALQEL
GIYGQIPIIG IAKRLEEIYF PEDSYPIHIS KQSPSLKLLQ QIRNEAHRFA ITFHRDKRSK
TSLKSQLESI PGVGEKTITT LLQHFGSVQN IKEASLKALA SQVGNKRAIQ IKEHLK