Gene Aasi_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0446 
Symbol 
ID6377230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp523722 
End bp525902 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content36% 
IMG OID642681607 
Producthypothetical protein 
Protein accessionYP_001957586 
Protein GI189501869 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.722173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATT TCAACATAGA TTTTCTGATT GTATATGCGT TTTTAGCTAT TACCCTGATT 
ATAGGCATTC GCGCAGGTAG AGGCATTAAG GATATTCGTG AGTATGCTAT TGGAAATAAA
ATGTATGGGA CTCTCACCTT AACACTTACA TTTTTAGCTA CTAATATAGC AGGGATAAGT
ATTATGGATG GTGCTTCAGG GGTCTTTTTC AATGGAATTG TTAGGATTAT TCCAGAAATA
GGTGTAGTTA TACAAATCCT ATTTTTTGCT TTTTTTATAA CTCCTAAAGT ATTACAATTT
AAAGCTGCCC TCACCTTAGG GGATGTAATG GGAGACCTGT ATGGTAGGGT TAGCAAAACC
ATTGCTGGAA TACTGGGGCT CTTTTATTCT ATATCCATGG TTAGCATGGA ATTATTAGGA
TTAGGTATTA CCATTGAAGC ACTCTTAGGT TTTCAAGCTA GTTGGACTAT TATTATAGGA
GGTGTGTTTT TAGCACTGTA TTCAAGCTAT GGGGGTATTA AGTCAGTTAC TATCACCGAT
GTATTTCAAT TTTTAATACT TATTATAGTT ATTCCACTCT TAGCTAATAT AGCTCTAAAA
CATGTAGGAG GAATCAAAGT GGTATTTGCT AGTCTTCCTC CAACAAAGTT AGAAATATTT
AATCACGAGA ATTTTTCTTA CTATCTTACT CTTTTCTTGT TATGGAGTAT TTTCCCTGTA
GGAATTACTA GCCCTCCCAT TTTTCAAAGG TTACTAATGG GACGAGATGC ACAACAGCTA
CGTAACCAAT ATTTTATTGT AAGTGTTTTT CATCCTACTT TTCAGTTATT AATTATGTTA
ATTGGCTTAG CTGGATTAGT CTTATATCCA ACTATTAAAG CTAATAATAT TATTCCCCAT
ATCATTCAAC AACTATTGCC TGCAGGTGGC AAAGGGTTGG CTATAGCAGG TTTACTGGCG
GTAATTATGT CTACAGCTGA TTCTTATTTA AATGCCGCTG GGTTAGTATT TGCTCATGAC
ATTGTTAAGC CAGTCTATGA TCGAAATGGC TTGAAAATTG ATGAACTAAA ATGTGCTAGG
TACAGTACAG CTATTATAGG TATAACAGCT ATTGTCATAG CTTTGAAATC TACAAGCATG
CTAGGACTAA GCTTTTTAGC TGTCAAGTTC ACAGGCCCTT TACTTATGTT CCCACTCATA
GCAGGCATTA TGGGATTGAA GGTTGATAAG CAAACTTTTT ACACGGCATC ACTTACTACA
CTAGGGGTAT TTGTATTCAT CAGTTGGCTA CTGCCAATAG CTTATGGTCA TTTAGGAGTG
CCTATTAGTA TTTTATCTAA TGGCATCACC TTCTTTAGCA TGCATGTTAT AAAAAATAAA
GGATTTGCTA TTGCAAAACA GCTGCAGTCT ACTATTGAGG TTAATATGCG CTGGCAGTTA
CGCAGCAGGT CTATATTGGC TAAGCTCAAA CAATTTTTGC CTACACCTAC TAATCTGCTA
GCTTATTCCA GGAATAAAGT AGATATGTAT GGAGCCCCTT ATGTTTTGTT TGGTGTATTA
TTGGCCATCA ACTATATCTT GCCTTATTTC GCCTGGACTT ACGAAGCTCC ACAAACATAC
AATACGCTGT TGATGATTCG TTTCATAGGA GCAGTTTTAT GTGGATTGTT GATTGTCAAA
GAGAAATGGC CCCGTTTTTT ACTGCCTTAC TTACCTACTT TTTGGCATTT AACTGTACTC
TATTGTCTAC CCTTTACCAA TACACTACTA TTTTTAAATA CCCAAGGAAG CATAGAATGT
ATAGCTAATG TTGCCATAAC AACGCTCTTT CTTATTATTG TAGTAGATTG GATGAGTTTT
GCTATACTTA TGAGCTTAGG GATTTACTTG GTCTGTGTAT TTTTTCAGTA TTTTTTTGGG
AAAATAGAAC TGCCTCTTAG CTTTAGTTTG CAATATCTGT TGGTGTATCA ATTTATTTTC
ATCACACTCA TAGGACTGCT ATTTGTACGT CGTAAGCCCA TTAAGAAAAC TAGTGCGTCC
GATCGTTTTG CAGGCACAGA ATTAGGCGAG CAAATGGAAT TTGCCATACA GGTTACTAGC
CGATTGGTAG ACCACCTGTT TTTTTATTTC AAGAACACCC ATCAAGTCTG GAGCAACGAT
AGGGATGGAT TTATACATTA G
 
Protein sequence
MNYFNIDFLI VYAFLAITLI IGIRAGRGIK DIREYAIGNK MYGTLTLTLT FLATNIAGIS 
IMDGASGVFF NGIVRIIPEI GVVIQILFFA FFITPKVLQF KAALTLGDVM GDLYGRVSKT
IAGILGLFYS ISMVSMELLG LGITIEALLG FQASWTIIIG GVFLALYSSY GGIKSVTITD
VFQFLILIIV IPLLANIALK HVGGIKVVFA SLPPTKLEIF NHENFSYYLT LFLLWSIFPV
GITSPPIFQR LLMGRDAQQL RNQYFIVSVF HPTFQLLIML IGLAGLVLYP TIKANNIIPH
IIQQLLPAGG KGLAIAGLLA VIMSTADSYL NAAGLVFAHD IVKPVYDRNG LKIDELKCAR
YSTAIIGITA IVIALKSTSM LGLSFLAVKF TGPLLMFPLI AGIMGLKVDK QTFYTASLTT
LGVFVFISWL LPIAYGHLGV PISILSNGIT FFSMHVIKNK GFAIAKQLQS TIEVNMRWQL
RSRSILAKLK QFLPTPTNLL AYSRNKVDMY GAPYVLFGVL LAINYILPYF AWTYEAPQTY
NTLLMIRFIG AVLCGLLIVK EKWPRFLLPY LPTFWHLTVL YCLPFTNTLL FLNTQGSIEC
IANVAITTLF LIIVVDWMSF AILMSLGIYL VCVFFQYFFG KIELPLSFSL QYLLVYQFIF
ITLIGLLFVR RKPIKKTSAS DRFAGTELGE QMEFAIQVTS RLVDHLFFYF KNTHQVWSND
RDGFIH