Gene Aasi_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0839 
Symbol 
ID6377080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1064041 
End bp1067358 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content37% 
IMG OID642681978 
Producthypothetical protein 
Protein accessionYP_001957939 
Protein GI189502222 
COG category[E] Amino acid transport and metabolism
[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases
[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0994214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGATA TCATCATTTT TATTGTTTTC TTGCTCCTTA ACCTTATTAT AGGAATCCTT 
TACCGTGGTA AGCAACAATC TTTTAAAGAA TATGCCATCG GAAATAAAAA CTTTTCTACA
GCTGCACTAA TTGCTACCCT TGTGGCTACC CAGGCTTCTG GTAGCATGTT TCTTAATGGA
CTAGAACAAA CTTACACCAA TGGGTTATAC TATGTCCTGG CAAGTGTAAT AGGAAGTTTC
TTGGGGTTAT TAATTACAGG GTACATCATA GGGCCTCGTT TAGGGAACTT TTTAAACTGC
GTTTCTATAG CCGATATCAT GAGCAGCGTT TATGGTAAAG GTGTACAATT GATTACTGCT
ATCTGCACCG TATTAGCAAC AATAGGCTAC ATAGCTATTC AATTTAAAGT AATATCAAGA
ATTTTGGAGG CCCTATTTAA CTATCAAGGA CCTGAAGTTA CTGTTATTGC AGCTACCATT
ATTATTATTT ATTCAGCGTT TGGAGGCATT AAATCCGTTA CTTTTACAGA TGTCATTCAA
TTCATTACTT TTGGCACTTT GCTTCCTGTT TTAGCCTTGG CCATTTGGCA CCACATTCCC
AATATTAGCC AGGTAATACA TACCCTCTCA GAAAATCCTA ATTTTAATTT TGAACACGTG
ATCAGTTGGA GTCCTCAGTT TATAGGGACC TTGGCTTTTA TGTTTGTGCT TATAGTTCCT
GGTTTACCTC CTGAGATATT TCAGCGTATG GCCATGGCAC GAAATACTAG TCAGATAAAA
CGTGCTATCG CCTATGCGGC GGTACTTTGG GTTTTAATAC AACTATTTAT AGTCTGGATC
GCTGTTTTAC TTTTGTCTGA TAATCCACAT TTAACTACCA ATCAAGTAGT AGGGTATATG
ATTAAGACAT ACACCTATAC AGGCCTTAAA GGATTTTTAG GGGTAGGTGT TATTGCTTTG
GCTATGTCCA CAGCAGACAG TGCGCTAAAC TCTTGTGCGG TACTAGTTGC TAATGATATT
CTCCCTCCTT TAAAAATTAC TAAACAAGCC TCTGTACAAG CAGCTGCTAT TGCGACTTTT
GCCATAGGAT TTTTGGCTGT CCTATTAACG CTCTCTATTC AAAATATTTT ACAAATTATA
TTGCTCTTTT CTAATTTCTC TTTGCCTATC CTCACTATCC CCATGTTGCT TACTATCTTC
GGATTTCGTA CTAGTAGGCG AGTTATTTAC ATAGGTATGG GGGCTGGTTT TGTTATGACT
GCTCTTCTAT TACTCTATTT CAAAAATGTT AATAGTTTCT TTCCCGGTAT GATGGCTAAC
CTCATTTTCC TAGTAGGTAG CCACTATTTG CTCAAAGAAG AAGGGGGTTG GATAAAACAA
GAAACAGAAG CAGGGATAGA AGCCATTACC TGGAAAGATC GATGGACTCA ATTAAAGTCC
TTTAACTTAC CAGCTTACCT TGAAAAAAAT CTGCCTGAAC AAGAGTATTA TTATCCTTTG
CTAGCCTTTT ATTCACTTAT TGCTACGTAT GTCTCGCTAT ACCATTTACC ACATGTTATC
GAGCAAGAGT ATCTAATGCT TTATAGAACC ATCCAATATT CCGTACTTGT GATTACTACA
AGTTTATTGG GCTTTCAAAT TTGGCCAGCT CCACTAAAAA ACAAGCGATT ATTAGCATGG
GTGTGGCCTT TACTTATCTT CTATACGCTC TTTTTTGTAG GAGGCATGAT TGTGATTATG
AGTGGATTTC AATCAAGTCA AATTTTAATA TTTATGCTAA GCCTTGTGAT GATTGCCTTA
TTTACTTATT TGCCATTAGC GCTCACATTA GCTATAGCTG GAATAATAGT TGCTGCAGGG
GTATTTAAGT GGGTTACAGG CCAAGACATC TCACCTAATC AAGCAATACC TATTTCTTTT
CATTTTAGTC ATGGGTTGCT CTTATTTAGT AGTTTACTCA TAGCCTTATT TAGATTTAAA
CAAAGTAAAG AAACTCTGCA AAAGAAGAAT GCTTACCTAA TGACTCTTAA TAACCAGTCT
AAAGAAGAAA TGCAGGAGGT ACTTAGATAT AGGGAGGAAA TACTAAAAGA CCTTACTAAA
GAAGATACGC TTCTTTTTGA TGAGACGATG GCTGCTTATA TGAGGCAGGC TATTTATCGC
ATGACAGATT ACATGCGACT CGATGTAAAA AGGATAAGTA TGGAGGAGTT AATTTCTGAA
GTTAGAGAAA CTATCAAGTT AAGTGACTTT CCACAAATGC CAGACCTTTT AATCCAGAAT
AAATCCAAGC AACCTACTAT AGAAGTAGAT AGGGAAAGAA TAAAGCAAAT GCTTGCAAAC
AGTATTACAT ACCTTTATCA AAGTAGCAAT GCCACAAGGC CCATTACAAT TGCCTTAGAA
GATGCCAAAT TAGGCTACAG TGTTGCCCAT ATGCAAAACT ATGCGCGGGA ATCAAATGCC
ATTAGGATTG TTTTAACAAT AGAAAAGTCT TTACCAAAAA TACAACCTAT TTACAATCTC
AATCACAATT CTATAGTGGA CCAAGTGGCC CGTGATAGAA ACCGCCGTAT GCTCTTGCAA
AATGCTCGTA TTGTAGATGC TCATTATGGT TATGCAGAAC TAGACAGCCA TATTATTACC
CATTTATATG TTATTCCTGT AAACCTCCGA GAGGTACGAG GTAAGGTCAT GGAAATTATT
AGAGAGCCTG CTGTTGTGGA TCCAGCAGAG TTAAATCACC CCCTGTCAAT TAAGGTTGAA
GAAGAGCTCA AAAATAAATT AACTGCTAAA GGAATAGATT TTAAACCTAT AAGTAAAGCA
CTCAATGTAA TAAAAAGATA CCATGCAGGA GTAAAAAGAA AGTCAGGAGA GCCTTTCTTT
ACGCATCCTA TTGCTGCTGC TTTAATCCTT TTAGAATACT GCCAGGACCC AGATGCAGTT
ATAGCAGCAC TTTTGCATGA TACAGTGGAA GACACCGGGC TTTCTCTTAT TCAGATAAAG
ATTATGTTTG GAGAGACAGT CGCTTTTTTA GTGCAAAAGG TGACTAATTT AGAAGAAAAT
AGAAAACGCC TGATGCTTGA AGATCATGAA AACATTGCAC GTCTTATCAA TTATGAAGAC
AAGCGAGCAG CCTATGTAAA ACTAGCTGAT CGCATGCATA ACATGCGCAC CATCAGTGGC
CATTCTTCCC TTGCCAAACA GAAGCATATT GCTTCTGAAA CCTTAAATTT CTTTGTACCT
CTGGCAAAAA ATCTAGAATT GACAACTGTA TCACAGGAAT TGGAAATATT AAGTCTAGAA
GTACTAAAGA AAAAATAA
 
Protein sequence
MIDIIIFIVF LLLNLIIGIL YRGKQQSFKE YAIGNKNFST AALIATLVAT QASGSMFLNG 
LEQTYTNGLY YVLASVIGSF LGLLITGYII GPRLGNFLNC VSIADIMSSV YGKGVQLITA
ICTVLATIGY IAIQFKVISR ILEALFNYQG PEVTVIAATI IIIYSAFGGI KSVTFTDVIQ
FITFGTLLPV LALAIWHHIP NISQVIHTLS ENPNFNFEHV ISWSPQFIGT LAFMFVLIVP
GLPPEIFQRM AMARNTSQIK RAIAYAAVLW VLIQLFIVWI AVLLLSDNPH LTTNQVVGYM
IKTYTYTGLK GFLGVGVIAL AMSTADSALN SCAVLVANDI LPPLKITKQA SVQAAAIATF
AIGFLAVLLT LSIQNILQII LLFSNFSLPI LTIPMLLTIF GFRTSRRVIY IGMGAGFVMT
ALLLLYFKNV NSFFPGMMAN LIFLVGSHYL LKEEGGWIKQ ETEAGIEAIT WKDRWTQLKS
FNLPAYLEKN LPEQEYYYPL LAFYSLIATY VSLYHLPHVI EQEYLMLYRT IQYSVLVITT
SLLGFQIWPA PLKNKRLLAW VWPLLIFYTL FFVGGMIVIM SGFQSSQILI FMLSLVMIAL
FTYLPLALTL AIAGIIVAAG VFKWVTGQDI SPNQAIPISF HFSHGLLLFS SLLIALFRFK
QSKETLQKKN AYLMTLNNQS KEEMQEVLRY REEILKDLTK EDTLLFDETM AAYMRQAIYR
MTDYMRLDVK RISMEELISE VRETIKLSDF PQMPDLLIQN KSKQPTIEVD RERIKQMLAN
SITYLYQSSN ATRPITIALE DAKLGYSVAH MQNYARESNA IRIVLTIEKS LPKIQPIYNL
NHNSIVDQVA RDRNRRMLLQ NARIVDAHYG YAELDSHIIT HLYVIPVNLR EVRGKVMEII
REPAVVDPAE LNHPLSIKVE EELKNKLTAK GIDFKPISKA LNVIKRYHAG VKRKSGEPFF
THPIAAALIL LEYCQDPDAV IAALLHDTVE DTGLSLIQIK IMFGETVAFL VQKVTNLEEN
RKRLMLEDHE NIARLINYED KRAAYVKLAD RMHNMRTISG HSSLAKQKHI ASETLNFFVP
LAKNLELTTV SQELEILSLE VLKKK