Gene Aasi_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1422 
Symbol 
ID6377524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1825284 
End bp1826984 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content38% 
IMG OID642682493 
Producthypothetical protein 
Protein accessionYP_001958443 
Protein GI189502726 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.432451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAA GATGGGTGTT TCAAACTATG CCAGAGCCTA CTGTGGTTGC ATCATTAGCC 
AATGCTATTG GCGTAGATAG CTCTATTGCT GCTTTATTGG TACAAAAGGG AGTAACTGAT
TTTAATCAGG CAAAAGATTA TTTTAGACCT TCGTTAGCCG ATCTGCACGA TCCGTTTATT
ATGAAGGATA TGGATAAGGC AGTAGGCCGT TTACTGCGAG CTATTGAGAA CCAAGAAGAA
ATATTAATCT ATGGCGACTA TGATGTGGAT GGAGTTACAT CAGTATCGTT GGTATATGGC
TTTTTGAAGC AATATCATAG TAAGCTACAA TTCTATATTC CAGATAGATA TAAAGAAGGT
TATGGGGTTT CTAAGCAAGC TATCAACTGG GCTATTGAGA CAGGTATTTC CCTAATTATA
ACATTAGACT GTGGTATAAA AGCCACAGAA TGTATTCAAC AAGCCCAGAT AGCAGGAATT
GATGTAATTG TATGTGATCA CCATGAACCA GGTGAAGGCC TGCCACCTGC CTATGCAATT
TTAGATCCTA AACAAAAAAC ATGTCCTTAT CCTTTTAAAG AGCTTTCTGG TTGTGGGGTT
GGTTTTAAGT TACTTCAAGC TTTTGTATTA AAGCAAAACA TTTCATTAGA CATCCTCTAT
CGATACCTGG ACTTAGTAGC TATTAGCATT GCCTGCGACT TAGTTCCACT CACGGACGAA
AATAGAATAC TTGCTTACCA TGGACTAAAG CGCTTAAATA ATAGTCCTTC ATTTGGCATA
CAAGCTATTA TACAAGTTGC TAATTTACCC TGGACATTAG GAATTTCACA ATTGGTTTTT
GGTTTAGGGC CTCGCCTTAA TGCAGCAGGC AGGGTGGACC ATGGTAGCTT AGCTGTAAAT
TTACTTTTGG CAGAAGATTC AGTGACTGCT AGCTCGCTAG CACGTCAGAT TGAAGACAAA
AATGGTTTTA GAAGATACTT AGACAGTACT ATTACAGCAG AAGCACTTCA GTTGATGAAA
GCTAGCGATG CAAGTTTAAT AGCTAAAACT ACGGTGCTTT TTAAAGAAGA CTGGCACAAG
GGAGTAATTG GTATTGTTGC TTCTAGATGT ATAGAACATT ATTACCGTCC TACCATCATT
CTTACAGCTT CTGGTAATAA GGCTACAGGT TCTGCACGTT CTGTGGCAGG CTATAATATA
TATGAGGCCA TTACCGAATG TGCTGAGCTA TTGGAACAAT ATGGAGGGCA TGCCCATGCA
GCAGGCCTTA CATTACCTTT AGAAAATATT ATATCTTTTC AAGAACGTTT TGAGCAAGTT
GTCTCTAGCA CTATTTCGCA AGAATTGCTT ATTCCTGTTC AAAAAATAGA TTTGTTGCTG
CCGCTTGAAA AGGTTACCAA TAAGTTTTAT AATATTCTTA ACCAGATGGC TCCTTTTGGT
AATGGTAATA TGAGGCCTGT ATTTGCAACA GAGTTAGTGA TAGCTACTAA ATATAGAATT
CTTAAAGAAA ATCATTTAAA ACTAACGGTA CAAGAACCTG CTTCAGGCAT CTGTATAGAC
GCTATTGGTT TTGGATTAGC AAAATATGCC CATTTGGTGT GTGACCGTAA GCCATTTAAG
ATAGCTTATA CCATAGAGAG AAATAATTAC CAAGGGCAAG TAAATTTACA ACTTAATATT
AAAGGTTTAC AGCCAATATA G
 
Protein sequence
MDKRWVFQTM PEPTVVASLA NAIGVDSSIA ALLVQKGVTD FNQAKDYFRP SLADLHDPFI 
MKDMDKAVGR LLRAIENQEE ILIYGDYDVD GVTSVSLVYG FLKQYHSKLQ FYIPDRYKEG
YGVSKQAINW AIETGISLII TLDCGIKATE CIQQAQIAGI DVIVCDHHEP GEGLPPAYAI
LDPKQKTCPY PFKELSGCGV GFKLLQAFVL KQNISLDILY RYLDLVAISI ACDLVPLTDE
NRILAYHGLK RLNNSPSFGI QAIIQVANLP WTLGISQLVF GLGPRLNAAG RVDHGSLAVN
LLLAEDSVTA SSLARQIEDK NGFRRYLDST ITAEALQLMK ASDASLIAKT TVLFKEDWHK
GVIGIVASRC IEHYYRPTII LTASGNKATG SARSVAGYNI YEAITECAEL LEQYGGHAHA
AGLTLPLENI ISFQERFEQV VSSTISQELL IPVQKIDLLL PLEKVTNKFY NILNQMAPFG
NGNMRPVFAT ELVIATKYRI LKENHLKLTV QEPASGICID AIGFGLAKYA HLVCDRKPFK
IAYTIERNNY QGQVNLQLNI KGLQPI