Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0839 |
Symbol | |
ID | 6377080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1064041 |
End bp | 1067358 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642681978 |
Product | hypothetical protein |
Protein accession | YP_001957939 |
Protein GI | 189502222 |
COG category | [E] Amino acid transport and metabolism [K] Transcription [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0994214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGATA TCATCATTTT TATTGTTTTC TTGCTCCTTA ACCTTATTAT AGGAATCCTT TACCGTGGTA AGCAACAATC TTTTAAAGAA TATGCCATCG GAAATAAAAA CTTTTCTACA GCTGCACTAA TTGCTACCCT TGTGGCTACC CAGGCTTCTG GTAGCATGTT TCTTAATGGA CTAGAACAAA CTTACACCAA TGGGTTATAC TATGTCCTGG CAAGTGTAAT AGGAAGTTTC TTGGGGTTAT TAATTACAGG GTACATCATA GGGCCTCGTT TAGGGAACTT TTTAAACTGC GTTTCTATAG CCGATATCAT GAGCAGCGTT TATGGTAAAG GTGTACAATT GATTACTGCT ATCTGCACCG TATTAGCAAC AATAGGCTAC ATAGCTATTC AATTTAAAGT AATATCAAGA ATTTTGGAGG CCCTATTTAA CTATCAAGGA CCTGAAGTTA CTGTTATTGC AGCTACCATT ATTATTATTT ATTCAGCGTT TGGAGGCATT AAATCCGTTA CTTTTACAGA TGTCATTCAA TTCATTACTT TTGGCACTTT GCTTCCTGTT TTAGCCTTGG CCATTTGGCA CCACATTCCC AATATTAGCC AGGTAATACA TACCCTCTCA GAAAATCCTA ATTTTAATTT TGAACACGTG ATCAGTTGGA GTCCTCAGTT TATAGGGACC TTGGCTTTTA TGTTTGTGCT TATAGTTCCT GGTTTACCTC CTGAGATATT TCAGCGTATG GCCATGGCAC GAAATACTAG TCAGATAAAA CGTGCTATCG CCTATGCGGC GGTACTTTGG GTTTTAATAC AACTATTTAT AGTCTGGATC GCTGTTTTAC TTTTGTCTGA TAATCCACAT TTAACTACCA ATCAAGTAGT AGGGTATATG ATTAAGACAT ACACCTATAC AGGCCTTAAA GGATTTTTAG GGGTAGGTGT TATTGCTTTG GCTATGTCCA CAGCAGACAG TGCGCTAAAC TCTTGTGCGG TACTAGTTGC TAATGATATT CTCCCTCCTT TAAAAATTAC TAAACAAGCC TCTGTACAAG CAGCTGCTAT TGCGACTTTT GCCATAGGAT TTTTGGCTGT CCTATTAACG CTCTCTATTC AAAATATTTT ACAAATTATA TTGCTCTTTT CTAATTTCTC TTTGCCTATC CTCACTATCC CCATGTTGCT TACTATCTTC GGATTTCGTA CTAGTAGGCG AGTTATTTAC ATAGGTATGG GGGCTGGTTT TGTTATGACT GCTCTTCTAT TACTCTATTT CAAAAATGTT AATAGTTTCT TTCCCGGTAT GATGGCTAAC CTCATTTTCC TAGTAGGTAG CCACTATTTG CTCAAAGAAG AAGGGGGTTG GATAAAACAA GAAACAGAAG CAGGGATAGA AGCCATTACC TGGAAAGATC GATGGACTCA ATTAAAGTCC TTTAACTTAC CAGCTTACCT TGAAAAAAAT CTGCCTGAAC AAGAGTATTA TTATCCTTTG CTAGCCTTTT ATTCACTTAT TGCTACGTAT GTCTCGCTAT ACCATTTACC ACATGTTATC GAGCAAGAGT ATCTAATGCT TTATAGAACC ATCCAATATT CCGTACTTGT GATTACTACA AGTTTATTGG GCTTTCAAAT TTGGCCAGCT CCACTAAAAA ACAAGCGATT ATTAGCATGG GTGTGGCCTT TACTTATCTT CTATACGCTC TTTTTTGTAG GAGGCATGAT TGTGATTATG AGTGGATTTC AATCAAGTCA AATTTTAATA TTTATGCTAA GCCTTGTGAT GATTGCCTTA TTTACTTATT TGCCATTAGC GCTCACATTA GCTATAGCTG GAATAATAGT TGCTGCAGGG GTATTTAAGT GGGTTACAGG CCAAGACATC TCACCTAATC AAGCAATACC TATTTCTTTT CATTTTAGTC ATGGGTTGCT CTTATTTAGT AGTTTACTCA TAGCCTTATT TAGATTTAAA CAAAGTAAAG AAACTCTGCA AAAGAAGAAT GCTTACCTAA TGACTCTTAA TAACCAGTCT AAAGAAGAAA TGCAGGAGGT ACTTAGATAT AGGGAGGAAA TACTAAAAGA CCTTACTAAA GAAGATACGC TTCTTTTTGA TGAGACGATG GCTGCTTATA TGAGGCAGGC TATTTATCGC ATGACAGATT ACATGCGACT CGATGTAAAA AGGATAAGTA TGGAGGAGTT AATTTCTGAA GTTAGAGAAA CTATCAAGTT AAGTGACTTT CCACAAATGC CAGACCTTTT AATCCAGAAT AAATCCAAGC AACCTACTAT AGAAGTAGAT AGGGAAAGAA TAAAGCAAAT GCTTGCAAAC AGTATTACAT ACCTTTATCA AAGTAGCAAT GCCACAAGGC CCATTACAAT TGCCTTAGAA GATGCCAAAT TAGGCTACAG TGTTGCCCAT ATGCAAAACT ATGCGCGGGA ATCAAATGCC ATTAGGATTG TTTTAACAAT AGAAAAGTCT TTACCAAAAA TACAACCTAT TTACAATCTC AATCACAATT CTATAGTGGA CCAAGTGGCC CGTGATAGAA ACCGCCGTAT GCTCTTGCAA AATGCTCGTA TTGTAGATGC TCATTATGGT TATGCAGAAC TAGACAGCCA TATTATTACC CATTTATATG TTATTCCTGT AAACCTCCGA GAGGTACGAG GTAAGGTCAT GGAAATTATT AGAGAGCCTG CTGTTGTGGA TCCAGCAGAG TTAAATCACC CCCTGTCAAT TAAGGTTGAA GAAGAGCTCA AAAATAAATT AACTGCTAAA GGAATAGATT TTAAACCTAT AAGTAAAGCA CTCAATGTAA TAAAAAGATA CCATGCAGGA GTAAAAAGAA AGTCAGGAGA GCCTTTCTTT ACGCATCCTA TTGCTGCTGC TTTAATCCTT TTAGAATACT GCCAGGACCC AGATGCAGTT ATAGCAGCAC TTTTGCATGA TACAGTGGAA GACACCGGGC TTTCTCTTAT TCAGATAAAG ATTATGTTTG GAGAGACAGT CGCTTTTTTA GTGCAAAAGG TGACTAATTT AGAAGAAAAT AGAAAACGCC TGATGCTTGA AGATCATGAA AACATTGCAC GTCTTATCAA TTATGAAGAC AAGCGAGCAG CCTATGTAAA ACTAGCTGAT CGCATGCATA ACATGCGCAC CATCAGTGGC CATTCTTCCC TTGCCAAACA GAAGCATATT GCTTCTGAAA CCTTAAATTT CTTTGTACCT CTGGCAAAAA ATCTAGAATT GACAACTGTA TCACAGGAAT TGGAAATATT AAGTCTAGAA GTACTAAAGA AAAAATAA
|
Protein sequence | MIDIIIFIVF LLLNLIIGIL YRGKQQSFKE YAIGNKNFST AALIATLVAT QASGSMFLNG LEQTYTNGLY YVLASVIGSF LGLLITGYII GPRLGNFLNC VSIADIMSSV YGKGVQLITA ICTVLATIGY IAIQFKVISR ILEALFNYQG PEVTVIAATI IIIYSAFGGI KSVTFTDVIQ FITFGTLLPV LALAIWHHIP NISQVIHTLS ENPNFNFEHV ISWSPQFIGT LAFMFVLIVP GLPPEIFQRM AMARNTSQIK RAIAYAAVLW VLIQLFIVWI AVLLLSDNPH LTTNQVVGYM IKTYTYTGLK GFLGVGVIAL AMSTADSALN SCAVLVANDI LPPLKITKQA SVQAAAIATF AIGFLAVLLT LSIQNILQII LLFSNFSLPI LTIPMLLTIF GFRTSRRVIY IGMGAGFVMT ALLLLYFKNV NSFFPGMMAN LIFLVGSHYL LKEEGGWIKQ ETEAGIEAIT WKDRWTQLKS FNLPAYLEKN LPEQEYYYPL LAFYSLIATY VSLYHLPHVI EQEYLMLYRT IQYSVLVITT SLLGFQIWPA PLKNKRLLAW VWPLLIFYTL FFVGGMIVIM SGFQSSQILI FMLSLVMIAL FTYLPLALTL AIAGIIVAAG VFKWVTGQDI SPNQAIPISF HFSHGLLLFS SLLIALFRFK QSKETLQKKN AYLMTLNNQS KEEMQEVLRY REEILKDLTK EDTLLFDETM AAYMRQAIYR MTDYMRLDVK RISMEELISE VRETIKLSDF PQMPDLLIQN KSKQPTIEVD RERIKQMLAN SITYLYQSSN ATRPITIALE DAKLGYSVAH MQNYARESNA IRIVLTIEKS LPKIQPIYNL NHNSIVDQVA RDRNRRMLLQ NARIVDAHYG YAELDSHIIT HLYVIPVNLR EVRGKVMEII REPAVVDPAE LNHPLSIKVE EELKNKLTAK GIDFKPISKA LNVIKRYHAG VKRKSGEPFF THPIAAALIL LEYCQDPDAV IAALLHDTVE DTGLSLIQIK IMFGETVAFL VQKVTNLEEN RKRLMLEDHE NIARLINYED KRAAYVKLAD RMHNMRTISG HSSLAKQKHI ASETLNFFVP LAKNLELTTV SQELEILSLE VLKKK
|
| |