Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0446 |
Symbol | |
ID | 6377230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 523722 |
End bp | 525902 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642681607 |
Product | hypothetical protein |
Protein accession | YP_001957586 |
Protein GI | 189501869 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.722173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATT TCAACATAGA TTTTCTGATT GTATATGCGT TTTTAGCTAT TACCCTGATT ATAGGCATTC GCGCAGGTAG AGGCATTAAG GATATTCGTG AGTATGCTAT TGGAAATAAA ATGTATGGGA CTCTCACCTT AACACTTACA TTTTTAGCTA CTAATATAGC AGGGATAAGT ATTATGGATG GTGCTTCAGG GGTCTTTTTC AATGGAATTG TTAGGATTAT TCCAGAAATA GGTGTAGTTA TACAAATCCT ATTTTTTGCT TTTTTTATAA CTCCTAAAGT ATTACAATTT AAAGCTGCCC TCACCTTAGG GGATGTAATG GGAGACCTGT ATGGTAGGGT TAGCAAAACC ATTGCTGGAA TACTGGGGCT CTTTTATTCT ATATCCATGG TTAGCATGGA ATTATTAGGA TTAGGTATTA CCATTGAAGC ACTCTTAGGT TTTCAAGCTA GTTGGACTAT TATTATAGGA GGTGTGTTTT TAGCACTGTA TTCAAGCTAT GGGGGTATTA AGTCAGTTAC TATCACCGAT GTATTTCAAT TTTTAATACT TATTATAGTT ATTCCACTCT TAGCTAATAT AGCTCTAAAA CATGTAGGAG GAATCAAAGT GGTATTTGCT AGTCTTCCTC CAACAAAGTT AGAAATATTT AATCACGAGA ATTTTTCTTA CTATCTTACT CTTTTCTTGT TATGGAGTAT TTTCCCTGTA GGAATTACTA GCCCTCCCAT TTTTCAAAGG TTACTAATGG GACGAGATGC ACAACAGCTA CGTAACCAAT ATTTTATTGT AAGTGTTTTT CATCCTACTT TTCAGTTATT AATTATGTTA ATTGGCTTAG CTGGATTAGT CTTATATCCA ACTATTAAAG CTAATAATAT TATTCCCCAT ATCATTCAAC AACTATTGCC TGCAGGTGGC AAAGGGTTGG CTATAGCAGG TTTACTGGCG GTAATTATGT CTACAGCTGA TTCTTATTTA AATGCCGCTG GGTTAGTATT TGCTCATGAC ATTGTTAAGC CAGTCTATGA TCGAAATGGC TTGAAAATTG ATGAACTAAA ATGTGCTAGG TACAGTACAG CTATTATAGG TATAACAGCT ATTGTCATAG CTTTGAAATC TACAAGCATG CTAGGACTAA GCTTTTTAGC TGTCAAGTTC ACAGGCCCTT TACTTATGTT CCCACTCATA GCAGGCATTA TGGGATTGAA GGTTGATAAG CAAACTTTTT ACACGGCATC ACTTACTACA CTAGGGGTAT TTGTATTCAT CAGTTGGCTA CTGCCAATAG CTTATGGTCA TTTAGGAGTG CCTATTAGTA TTTTATCTAA TGGCATCACC TTCTTTAGCA TGCATGTTAT AAAAAATAAA GGATTTGCTA TTGCAAAACA GCTGCAGTCT ACTATTGAGG TTAATATGCG CTGGCAGTTA CGCAGCAGGT CTATATTGGC TAAGCTCAAA CAATTTTTGC CTACACCTAC TAATCTGCTA GCTTATTCCA GGAATAAAGT AGATATGTAT GGAGCCCCTT ATGTTTTGTT TGGTGTATTA TTGGCCATCA ACTATATCTT GCCTTATTTC GCCTGGACTT ACGAAGCTCC ACAAACATAC AATACGCTGT TGATGATTCG TTTCATAGGA GCAGTTTTAT GTGGATTGTT GATTGTCAAA GAGAAATGGC CCCGTTTTTT ACTGCCTTAC TTACCTACTT TTTGGCATTT AACTGTACTC TATTGTCTAC CCTTTACCAA TACACTACTA TTTTTAAATA CCCAAGGAAG CATAGAATGT ATAGCTAATG TTGCCATAAC AACGCTCTTT CTTATTATTG TAGTAGATTG GATGAGTTTT GCTATACTTA TGAGCTTAGG GATTTACTTG GTCTGTGTAT TTTTTCAGTA TTTTTTTGGG AAAATAGAAC TGCCTCTTAG CTTTAGTTTG CAATATCTGT TGGTGTATCA ATTTATTTTC ATCACACTCA TAGGACTGCT ATTTGTACGT CGTAAGCCCA TTAAGAAAAC TAGTGCGTCC GATCGTTTTG CAGGCACAGA ATTAGGCGAG CAAATGGAAT TTGCCATACA GGTTACTAGC CGATTGGTAG ACCACCTGTT TTTTTATTTC AAGAACACCC ATCAAGTCTG GAGCAACGAT AGGGATGGAT TTATACATTA G
|
Protein sequence | MNYFNIDFLI VYAFLAITLI IGIRAGRGIK DIREYAIGNK MYGTLTLTLT FLATNIAGIS IMDGASGVFF NGIVRIIPEI GVVIQILFFA FFITPKVLQF KAALTLGDVM GDLYGRVSKT IAGILGLFYS ISMVSMELLG LGITIEALLG FQASWTIIIG GVFLALYSSY GGIKSVTITD VFQFLILIIV IPLLANIALK HVGGIKVVFA SLPPTKLEIF NHENFSYYLT LFLLWSIFPV GITSPPIFQR LLMGRDAQQL RNQYFIVSVF HPTFQLLIML IGLAGLVLYP TIKANNIIPH IIQQLLPAGG KGLAIAGLLA VIMSTADSYL NAAGLVFAHD IVKPVYDRNG LKIDELKCAR YSTAIIGITA IVIALKSTSM LGLSFLAVKF TGPLLMFPLI AGIMGLKVDK QTFYTASLTT LGVFVFISWL LPIAYGHLGV PISILSNGIT FFSMHVIKNK GFAIAKQLQS TIEVNMRWQL RSRSILAKLK QFLPTPTNLL AYSRNKVDMY GAPYVLFGVL LAINYILPYF AWTYEAPQTY NTLLMIRFIG AVLCGLLIVK EKWPRFLLPY LPTFWHLTVL YCLPFTNTLL FLNTQGSIEC IANVAITTLF LIIVVDWMSF AILMSLGIYL VCVFFQYFFG KIELPLSFSL QYLLVYQFIF ITLIGLLFVR RKPIKKTSAS DRFAGTELGE QMEFAIQVTS RLVDHLFFYF KNTHQVWSND RDGFIH
|
| |