Gene Aasi_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0210 
Symbol 
ID6376399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp229067 
End bp230305 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content37% 
IMG OID642681398 
Producthypothetical protein 
Protein accessionYP_001957383 
Protein GI189501666 
COG category[R] General function prediction only 
COG ID[COG1408] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTG GCCTATTTTT TATTCTGTTT TTAGTACTTA TAGATATATA TTTCTATCAA 
GGATTAAAAA TTATTACATC ACCTCTACAG CCAAGTAGTA GGGTAATTAT TTATACCTTA
TATGGAATTT TTGTTACGAG TACAATCGTA ACTGTGCTGA CTTATGGGTG GCTAAGATGG
GGATTAAGAA ACAATTTTGT GAAGTATAGG GTCATTCCAG CTTGGTTTAT CAAATACATT
ACTAAGCTCT TTAGTGTTTC TTTCCTCTTT GTAGATGATA TTGCCAGATC AGCTAGATGG
TTGGTATATA AAATAGCTTA CCTCTTTAAT GATACAATAA CAGATAGTGC GCTAATGCCT
AGATCGGTTA TGGTGACAAA AGCAGGTATG TTGGCCGCTA CAGTGCCTCT AGTCACATTG
AGCTATGGTA TACTAGTAGG CGCACATGAT TATAGGGTAA GAAGATTACG TATAAAATTA
CCGCATCTCC CTAATGCTTT TCACGGATTA AAGATTGGTC AGTTATCTGA TATACACACA
GGTAGCTTTT TTAACAAAAA GGCAGTAGCA GGTGGCGTGG ATATGCTGCT ACGTGAAAAG
CCAGATGTGA TTTTCTTTAC CGGCGATCTG GTAAATGATA CGGCCGATGA AGTGAAAGAG
TATATTCCTA TTTTTAGCAG GTTAAAAGCA CCGTTAGGTA TTTATTCTGT TTTGGGTAAT
CATGATTATG GAGACTATGT GCCTTGGCCT TCAATTACAG CTAAACAGAA GAATTTGCAA
GATTTACGTA ATGCGCATCA GCTAATGGGA TGGACGCTGC TCATTAATGA GCATATTATA
CTTACAGAAG GGGCTGACAA ATTGGCTATA ATTGGGATAG AAAATTGGGG GCTACAATTT
TCACAATATG GTAAATTAGT ACAAGCATAC CAGGGTACAG CAGACATCCC TGTAAAATTG
TTATTATCCC ATGATCCTAG CCACTGGGAT GCGGAAGTAC GACCAAAATT TAGTGATATA
GACATTACCT TTGCAGGTCA TACACATGGG TTTCAATTTG GTATTGAAAT AGGCACATTT
AAATGGAGCC CTGTACAATA TCAATACAAA CAATGGGCTG GATTATATCA GCAAGGTGCC
CAGTATCTAT ATGTTAATAG AGGGTTTGGT TATTTAGGTT ATCCAGGTAG AATCGGTATT
TTACCTGAAA TTACTATTGT AGAGCTTGTA AAGGAGTAA
 
Protein sequence
MKRGLFFILF LVLIDIYFYQ GLKIITSPLQ PSSRVIIYTL YGIFVTSTIV TVLTYGWLRW 
GLRNNFVKYR VIPAWFIKYI TKLFSVSFLF VDDIARSARW LVYKIAYLFN DTITDSALMP
RSVMVTKAGM LAATVPLVTL SYGILVGAHD YRVRRLRIKL PHLPNAFHGL KIGQLSDIHT
GSFFNKKAVA GGVDMLLREK PDVIFFTGDL VNDTADEVKE YIPIFSRLKA PLGIYSVLGN
HDYGDYVPWP SITAKQKNLQ DLRNAHQLMG WTLLINEHII LTEGADKLAI IGIENWGLQF
SQYGKLVQAY QGTADIPVKL LLSHDPSHWD AEVRPKFSDI DITFAGHTHG FQFGIEIGTF
KWSPVQYQYK QWAGLYQQGA QYLYVNRGFG YLGYPGRIGI LPEITIVELV KE