Gene Aasi_0155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0155 
Symbol 
ID6376586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp177034 
End bp178155 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content34% 
IMG OID642681345 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001957330 
Protein GI189501613 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAACAT TTTTATCAGG TTTACAAAGT CCTAATGTAG ATCTACTTAC TAATTACTTT 
GCCATAAAGC TTATCGAATG GTACCAACAC CATCATCGAG CTTTACCTTG GCGTGAAACA
AAAGATCCTT ACAAAATCTG GCTTTCGGAA ATTATTTTAC AGCAGACCAG GGTGGCACAA
GGACTTCCGT ATTATCAACG TTTTATTGAG AACTATCCTA CTATACACGA TCTAGCATCA
GCTAGTGAAA CAGCCATATT AAGGGTATGG CAGGGATTAG GGTATTATAC AAGAGCTAGA
AATTTACATG CCTGCGCACG TACAATAGTA ACACAATTTC AGGGTAAATT TCCTAACAAC
TACAAAGCAT TATTATCTTT ACCAGGTATA GGTGTTTATA CAGCTGCAGC CATTGCTTCT
ATTGCTTTTA AAGAGCCTAT TCCGGTAATA GATGGTAATG TATATCGAGT TTTAGCAAGA
ATATTTGACA TAGAAACAGC CATCAATAGT ACTAAAGGCA AACATATTTT CAACCAACTA
GCCCAAACAC TAATTTCTAA AACTGCGCCA GATATATATA ATCAAGCCAT CATGGAATTT
GGAGCCATAC AGTGTACTCC TTTAAAGCCC CTATGTAATA CTTGTATATT TAAGATGGAT
TGCTCAGCAT TTCTTGCCAA TAAACAGCAT CTATTACCTG TTAAAGAAGC CAAAGTAAAG
ATTAAACAAC GTTTTTTTCA TTATCTATGC ATCCAACTAG ATGATGACCA ATTATTTATG
AAAAGTAGAA AGCCAGGAGA CATTTGGACA GGCTTATATG ATTTTTACTT AGTAGAAGAA
AGCGAACGTA AAGAGTTTGA TCAATTAGAA GATGAACTAG TACAATTGAT AAAAAAACAC
CAACTCTATA TCGAAAAGGT TCCTACAGTT TATAAGCATA TACTTACTCA TAGGGTACTT
TATGCATCAT TTTTTAAAAT CATTGTTACA AAAGCCTTTT TGGCAGATGC CAAAATTCTA
TTGGAAGATA GCCTTACTGA TACCTTTTCC ATAGAAGCCA CTAAATTCCT ACCGAAACCT
AAACTAATCT GCAATTTTTT AGAAGAATAT TTATATATTT AA
 
Protein sequence
MKTFLSGLQS PNVDLLTNYF AIKLIEWYQH HHRALPWRET KDPYKIWLSE IILQQTRVAQ 
GLPYYQRFIE NYPTIHDLAS ASETAILRVW QGLGYYTRAR NLHACARTIV TQFQGKFPNN
YKALLSLPGI GVYTAAAIAS IAFKEPIPVI DGNVYRVLAR IFDIETAINS TKGKHIFNQL
AQTLISKTAP DIYNQAIMEF GAIQCTPLKP LCNTCIFKMD CSAFLANKQH LLPVKEAKVK
IKQRFFHYLC IQLDDDQLFM KSRKPGDIWT GLYDFYLVEE SERKEFDQLE DELVQLIKKH
QLYIEKVPTV YKHILTHRVL YASFFKIIVT KAFLADAKIL LEDSLTDTFS IEATKFLPKP
KLICNFLEEY LYI