Gene Aasi_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0040 
Symbol 
ID6376354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp54870 
End bp58199 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content38% 
IMG OID642681238 
Productmetal-dependent phosphohydrolase 
Protein accessionYP_001957224 
Protein GI189501507 
COG category[E] Amino acid transport and metabolism
[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases
[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATCG ATACTATTCT ATTTACCGCT TTTTTAAGTG TTAACTTGAT TATAGGCTTA 
CTAGCTGGAA GGCGTGTAAA AAGCTTACGC GATTTCTCAA TTGGCAATAA AGATTTCTCG
ACTGCTACGC TGACCTCTAC TGTTGTAGCT ACCTCGGTAG GGGGTGGATT TCTATTTTAT
GCTTTGCAGA ATATTTATAC CAGTGGGCTA CAATTTATTC TAGTTACAGC AGGAGGAACG
ATATGCTTGC TATTGATTGG CCAGGTCTTA TCTGTTCGGA TGGGGGAGTT TTTAAATAAT
TTATCGGTAG CTGAGTCTAT GGGGGATCTA TATGGGCCCG CTGTGCGTAT CATTACTGCT
ATAAGTGGAA TTTTGAGGGC TATTGGTGCC ATTGCCCTAC AATTCCAGGT GATCGCTAAA
ATGTTAACCT TATTGTTAGG GTTACAAGGA CCTTCAGTTA CTATTGCTGC TGCCTCTATT
GTTATTCTGT ATTCAGCCTT TGGGGGTATC CGTTCGGTTC TTATTACTGA TCTATTCCAG
TTCATTGCTT TTGTTATTTT TATTCCTATC CTAGCCTTGA TCGTCTGGAA CCATGTGAAA
AATCCTAGTC AGGTGTTACA TACCATCACT ACCAACCCCA TTTTTAGTTT TAAAGCACTC
CTATCGTGGA ATCCTAAATT GTTAAGTGCG CTAGCTTTAA TGTTATATTT TATTATTCCT
GGTATGAATC CCCCTATATT TCAGCGGATA GCCATGGCAA AGACTATTGA ACAAATTAAG
TCCTCTTTCA CTTACGCAGC AGGCATTACC CTGATTATGA TTATATCAGT AGCTTGGATT
GCTATTTTAC TGCTTGCAGA TAATCCTAAT TTAGAGCCTA GCGGGCTTGT CAATCACCTT
ATTAACCAAT ATGCTTATCC AGGTCTGAAA GGGCTTATTG CAATTGGTAT TACAGCGATG
GCCATGTCTA CAGCAGACTC TGATCTTAAT GCTGCGGCTG TGTTAGCTGT GAATGATATT
ATTAAGCCAC GTAAAAGTGA TTGGGTGGAA TCTATTACAA TAACTAGATT GCTTTCCCTA
GGATTGGGGC TATGTGCGCT AGCACTAGCC ATCCATACTA CAGATTTATT AGAACTCTTA
TTGCTCTCGG GGAGTTTTTA TATGCCTATT GTGACCGTAC CGTTATTACT GGCTATTTTT
GGTTTTCGTA GCAGTAATAG AGCAGTTCTT ATAGGCATGA CAGCTGGTTT TATAACTGTT
GTTGGGTGGA ACGTTTTCTT AGCTCATACA GATATCAGTA GCCTGATGCC AGGTATGATG
GCTAATTTGC TATTTTATAT GATCAGCCAT TATGTTTTAC AAGAAAAAGG GGGCTGGGTA
GGTATTAAAG AGAAAGGCCC GCTTTTAGCA GCTAGACAAA GCCGCTGGGA AAGCTGGAGA
AGGTTCGTTT ATACTATTAA GCATCCCCAT ACTTATACAT ATCTACAAAA GAATCTTCCG
ACTTATGAAG TTGTTTATAC GTTGTTTGCT ATTTATGTAA TAGGAGCTAC TTATGCTTCA
TTCTTCACCA TACCAGAAGC AATCGTTACC CATCACCAAA AGCTCTATGA CTTTGCCGTA
CATTCTGTTT TAATTGCAAC AGCTGGTTTT CTAACTTATC CGGCATGGCC GCCTACTTTC
AAAGCTAAAT GGTTTATTGC CTTTGCTTGG CCGATAGGAA TATTTTATAT TCTCTTTGTT
GTGGGAACCA TTTTAGTCCT GATGAGCGGC TTTCACCAAG TGCAAGTGAT GATTTTTATG
TTGAATCTGG TTATGGCTGC TTTTCTACTA TCTTGGCCAC TTATGTTGCT GCTTTCAACT
GCAGGCATGG CTATAGGCAG CGTGGTCTTA TATCTATATT GCGGCAACTT ACATTGTAGC
GATGTTGATC TGGCGGGTGA ATTCAAAGTG GTTTATGGCA TCCTTTTACT GAGCAGCTTT
CTAATTGCCA TCTTTAGATT TAAAGAGAAT AAGAAGAAAT TAGAAAGTAA AAATATTTAT
CTAGCTAGGT TGTATGAAGA AAAGAGTAAC GAGCTAGCAG AAATTTTAGG CTATAGAGAA
CAAATAATAA AAGAACTAAG TGAGGATGAA AAAAGATTAT TTGATGATAC TACGGCTGCT
TATATACAAC AAATCATCTA CCGAATGACA GATTATATGC GTTTAGAAGT AACTACAATT
AATTTAGATC AGCTTTTGTT AGAAGTTAAA GATATTCTTA AGCTCAAAGA GCTTGATAAC
ATGACTCAAT GGATAACTAA AAGACTGACA AAAGAAGAAT CCATTCATGG CGATGCAGCA
AAACTCAAGC AGTTATTAGT CAACGCTATC TTATATATTC AAGAGCACAA CCTATCGAAT
CAACCTATCA CCGTGATAGT AGAAGATGCC AAGCTAGGTC ATCGAGTAGA TTATATTAAA
GATTATACCA GGCAATTAGC AGCTTTAAAA TTTACCATTA CCATAGAAAA GGATATACCG
ACTAAAAAAG ATCTTTATAT GATCGATCAA CTGCCTTTGT TAAGTCAACA TACTAGAAAA
GGTAAATTAA TAGAAAATGC TCGTATCATC CATGCACATT ATGGGTATGC CAATCTAGAC
AACGAGCACA TGCAGGTGTA TGTACTTCCG GCGAATGTAA GAGAAGTAAG AGGCAAAGTA
ATGGAATTAT TAAGAGAGCC TGTAGAAGTA GATGGAGAAG AAGTAAGACA TCCATTGGCT
ATCGAGCTTG AAAAAGAGTT AATGGATAAA ATAAAAGGGA AAAAGATAGA TGGTAAGGTT
ATTAATAAGG CACTGGATAC TATTAAAAGA TACCATGCAG GCATTAAAAG GAAATCAGGC
GAACCTTTCT TTACGCATCC TATTGCTGTA GCATTAATTT TATTGGAATA CTGCCAAGAT
CAAGATGCAG TGGTAGCAGC ATTACTGCAT GATAAGGTAG AAGATACCAG TTTATCGCTC
ATACAAATCA GGGCTATATT TGGAGAAAAA GTAGCTTTTA TAGTGAGTAA AGTAACCAAC
CTAGAAGATA ATTTGCGTAG AGTAAGTTCA GTAGACCATG AGAATGTCTA TCGTTTGATG
AATTATGAAG ATGAGCGGGC CGCTTTTGTG AAACTTGCAG ATAGATTACA TAACATGCGC
ACTATCAGTG GTCATTCTTC ACTTGCCAAG CAAAAACATA TAGCCAATGA GACCTTGAAT
TTCTTTGTCC CGCTAGCTAA AAATTTAGGA TTAGAAACTA TAGCAAGAGA ATTAGAAAAG
CTAAGTTTAG CAGTATTGGG TAAAAGGTAA
 
Protein sequence
MSIDTILFTA FLSVNLIIGL LAGRRVKSLR DFSIGNKDFS TATLTSTVVA TSVGGGFLFY 
ALQNIYTSGL QFILVTAGGT ICLLLIGQVL SVRMGEFLNN LSVAESMGDL YGPAVRIITA
ISGILRAIGA IALQFQVIAK MLTLLLGLQG PSVTIAAASI VILYSAFGGI RSVLITDLFQ
FIAFVIFIPI LALIVWNHVK NPSQVLHTIT TNPIFSFKAL LSWNPKLLSA LALMLYFIIP
GMNPPIFQRI AMAKTIEQIK SSFTYAAGIT LIMIISVAWI AILLLADNPN LEPSGLVNHL
INQYAYPGLK GLIAIGITAM AMSTADSDLN AAAVLAVNDI IKPRKSDWVE SITITRLLSL
GLGLCALALA IHTTDLLELL LLSGSFYMPI VTVPLLLAIF GFRSSNRAVL IGMTAGFITV
VGWNVFLAHT DISSLMPGMM ANLLFYMISH YVLQEKGGWV GIKEKGPLLA ARQSRWESWR
RFVYTIKHPH TYTYLQKNLP TYEVVYTLFA IYVIGATYAS FFTIPEAIVT HHQKLYDFAV
HSVLIATAGF LTYPAWPPTF KAKWFIAFAW PIGIFYILFV VGTILVLMSG FHQVQVMIFM
LNLVMAAFLL SWPLMLLLST AGMAIGSVVL YLYCGNLHCS DVDLAGEFKV VYGILLLSSF
LIAIFRFKEN KKKLESKNIY LARLYEEKSN ELAEILGYRE QIIKELSEDE KRLFDDTTAA
YIQQIIYRMT DYMRLEVTTI NLDQLLLEVK DILKLKELDN MTQWITKRLT KEESIHGDAA
KLKQLLVNAI LYIQEHNLSN QPITVIVEDA KLGHRVDYIK DYTRQLAALK FTITIEKDIP
TKKDLYMIDQ LPLLSQHTRK GKLIENARII HAHYGYANLD NEHMQVYVLP ANVREVRGKV
MELLREPVEV DGEEVRHPLA IELEKELMDK IKGKKIDGKV INKALDTIKR YHAGIKRKSG
EPFFTHPIAV ALILLEYCQD QDAVVAALLH DKVEDTSLSL IQIRAIFGEK VAFIVSKVTN
LEDNLRRVSS VDHENVYRLM NYEDERAAFV KLADRLHNMR TISGHSSLAK QKHIANETLN
FFVPLAKNLG LETIARELEK LSLAVLGKR