Gene Aasi_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0859 
Symbol 
ID6376947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1090739 
End bp1092535 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content34% 
IMG OID642681996 
Producthypothetical protein 
Protein accessionYP_001957957 
Protein GI189502240 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00473 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAACTA AAAAGTATTT ACCTAACGAT GTACGCTTAC TACCTGACCA GCCTGGTATA 
TACTTGTTTT ATAATAAGAA AGATGAAATT ATATATGTTG GAAAATCTAT TAATTTAAGA
AAGAGGGTTA GCAGCTATTT TAGTAAAAGT CAAATAAAAA AGGCAACTCC TAAAACTCAA
AGAATGGTAG CTGATATACA AGCCATTAGC TTCACCATTG TTGATTCTGA ATATGATGCC
TTAGTATTAG AAAACAACTT AATTAAATCT ATTCGTCCTG AGTACAATAT TTTGTTAAAA
TATGGCAAAG GCTATTCCTA TATCTGTATT ACTAATGACC GGTTTCCTAA AGTCATTACT
ACTCGCCAAG TAAACACTAA ACTAGGAAGA TATTATGGCC CTTTCAATAA CTTATATTAT
ATGTACCAAA TGATGGACCT TATCCAGCGG GTATACGGTC CTCGGACCTG TAACTATAAT
CTCTCCAAGC AGAACATAAA GAAACACAAG TTTAGGGTAT GCTTATTATA CCATATACGT
AAGTGTAAAG GACCTTGTCA AGGTTTACAA ACCGAAGATG CCTATAAATT AGACATTGCC
CAAATAGACC ACCTTCTACG AGGAAATTTA AATAAGGTTA AGAAGGACTT CAAAGAAAGA
ATGCAGCAAG CAGCTCTCAG ATTAGCTTAT AAGGAAGCTC ACGACTATAA AATGAAATTA
GCTTCTTTGG AAGCTTACCA AGCAAAATCA CTTGTTGCTA ATCCTAATTT AGGAAACTTA
GATGTTTTTG GTATCGTATC AGATGAAGAA GTAGCTTTTA TAAGCTACCT ACAGGTTAAA
AACGGTGCTA CCAACTTTGC ACAAACTATA CAGGTTAAAA AAAAGCTAGG TGAAGCAGAT
ACCGATATAC TCCCTCTTAT TATTCTTAAC TTCAGGGAAA TGTATAACGG TACTGCTCCG
GAAGCCTTAG TAAATGTTGT ACCACTTATC AAACTTGATA AACTTACTCT GACCAGCCCT
AAAATTGGAG ACAAGAAAAA GCTAGTAGAC CTTGCTATCA AAAATGCATT ATTTTTAAAG
AAAGAGTATT TACTTAAAAA AGAAGAAAAC CAAAATAGGC CCTACAAAAC ACTACAATTA
TTACAGCAAG ACCTACAATT AAAAAGCCTA CTTTTACACA TTGAATACTT TAACAATGCT
AGTATACAAG AAAACAATAC TGCAGCAGCC TTAGTTGTGT TTAAAAATGG GAAGCCTGCA
AAAAGAGAAT ATCAAAATTT TAATATTAGA ACTGTTGTAG AACCTATTGA TCTTACTTCT
GTGCATGAAA TAGTCAGTAG AAGGTATAAA CGACTAATAG AAGAAAAGAG TACGCTACCT
GATTTGATTA TAATTGACGG AGAAACAACA CAACTTAATG TTGCAGTACA AGCCTTACAA
GAAATAGGAA TTTATGGACA AGTACCAATC ATTAGTATAG CTAAACGATT AGAAGAAATT
TATTATCCTG GTGATACTTA CCCTACTTGC CTAAGCAAAC AATCACCATC TTTAAAGTTA
TTACAACAAG TCCGTAACGA AGCTCACCGA TTTGCTATTG GCATTCACCA CAGCCAATTC
AGTAACATCA GTCTTATTAG CCAGCTACAA AACATACCAG GTATCGGTAA CAAAACGGTT
GATAAGCTAG TACAGCAATT TCAATCTGTT GACAATATTA AAGCTGCAAG TTTAGAAACA
CTAGCTGAGT ATGTAGGTAC TAGCAGAGCA ACCCAACTTA AAGCATATTT ACAATAA
 
Protein sequence
METKKYLPND VRLLPDQPGI YLFYNKKDEI IYVGKSINLR KRVSSYFSKS QIKKATPKTQ 
RMVADIQAIS FTIVDSEYDA LVLENNLIKS IRPEYNILLK YGKGYSYICI TNDRFPKVIT
TRQVNTKLGR YYGPFNNLYY MYQMMDLIQR VYGPRTCNYN LSKQNIKKHK FRVCLLYHIR
KCKGPCQGLQ TEDAYKLDIA QIDHLLRGNL NKVKKDFKER MQQAALRLAY KEAHDYKMKL
ASLEAYQAKS LVANPNLGNL DVFGIVSDEE VAFISYLQVK NGATNFAQTI QVKKKLGEAD
TDILPLIILN FREMYNGTAP EALVNVVPLI KLDKLTLTSP KIGDKKKLVD LAIKNALFLK
KEYLLKKEEN QNRPYKTLQL LQQDLQLKSL LLHIEYFNNA SIQENNTAAA LVVFKNGKPA
KREYQNFNIR TVVEPIDLTS VHEIVSRRYK RLIEEKSTLP DLIIIDGETT QLNVAVQALQ
EIGIYGQVPI ISIAKRLEEI YYPGDTYPTC LSKQSPSLKL LQQVRNEAHR FAIGIHHSQF
SNISLISQLQ NIPGIGNKTV DKLVQQFQSV DNIKAASLET LAEYVGTSRA TQLKAYLQ