Gene Aasi_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1009 
Symbol 
ID6376953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1311226 
End bp1314576 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content34% 
IMG OID642682129 
Producthypothetical protein 
Protein accessionYP_001958090 
Protein GI189502373 
COG category[E] Amino acid transport and metabolism
[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases
[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.804907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCATA TTTCTATCGA TGTTATTTTA TTTAGTATTT TCTTGCTTAT CAACCTGGCA 
ATAGGACTGT TGGCTGGTGG ACGTGCAAAG AATTTGAGAG AGTATTCTAT TGGTAATAAA
AATTTTTCTA CAGCTGCACT TACTTCTACG ATTGTAGCCA CATGGATCAG TGGTAGCTTC
ATGACCTTTA AATTAACAAA AATTTATAGT GAAGGATGGT ATTTTATACT AGCTATTATT
TGTGATAATT TTACTTTACT GTTTACAGGG TTATTTTTAG CTGCAAGAAT GGGGGAATTT
TTAAAGAATA ACTCTGTAGC AGAGGCACTA GGCGATTTAT ATGGTAAGTC TGTACGTATT
ATTACAGCCA TTTGTGGCAT ATTGATTAGT ATAGGTGGCG TAGCTATACA ATTTAAAGTA
AGTGCTAAAG TACTAAGTAT CTTATTTGGA GTAAATGATA TTTATTCAAC CATAGCAGCT
GCTAGTATTG TAATTATATA TGCCGCCCTT GGAGGAATAC GTGCTGTTAC ATTTACTGAT
ATCATGCAAT TTTTTGCTTT TGGTACCTTT ATTCCACTTT TAACATTAAC CATTTGGAAT
GGAATTAAAG ATCATACTGC CATCATACAT GCCATTCAGA ATAGCCCACA GTTTGATTAT
AATACTTTGA TAGGTTCTCC TAAAAGGTTG GCTAGTTGTT TTGCCTTAAT GTTGTTCTAT
TTTATTCCAG GATTTGAGCC TGCTATATTC CAAAGAGTGA CTATGGCTAT TAATACTGGA
CAAGTAAAGC GGTCTTTTAC TTACTCATTC TTTCTATGTA CCGCTATTAC TTTATTTTCT
ATTTGGATAG GTATTTTAAT TTTAGGAACT AATTCTCAGT TGCAGCCTGA TCAAATATTT
AATTATATAG TCAATAGCTA TACTTATCCA GGGCTTAAAG GGCTTATACT CATAGGAACC
ATGTCTATGG TTATGTCTAC AGCAGACTCC CATATTAACT CTGCTGCAGT CTTATTTGCT
AATGATATTA TAAAGCCATT AAAGCTTGCA GTAAGCCAAG AAGTAAGAAT TGCTAAAGTA
TTTACTTTTT TTCTAGGTGC ATTAGCATTG TTACTAGCCT TATACAAAAC TGATTTCTTA
GAGCTTGTAC TTTTAGCATG GGGCCTTTAT ATGCCAATTG TCACAGTACC TTTATTATTG
GCTGTGTTTG GCTTTCGTAG TACCACTAAG CCTGTGCTCA TTGGTATGGC AGCAGGTTTT
ATAACTGTAC TGTTATGGGA TAAGCTGTTA GCAGATACAC AAATCAATAG TGTTATCCCA
GGCATGTTAG CTAATCTGGT ATTTTTAATG GGTAGCCATT ATATGTTGAA GTCAAGCGGA
GGATGGGTAG GCATTAAAGA TCCATACCCA TTGTTGGTTG CTAGGCAAGA AAGGCAAGAC
GCATGGAAAA AAATTATTGA CGCTATAAAA AACCCTAATA TTTTATCTTA TCTCAAACGA
AATCTTCCTG AAAAAGAAAT TATTTATTCT CTTTTTGGGC TATATGTTTT AGGAGCTACT
TATGCTTCAT TTTTTACAAT TTCTACAGAA GTTGTAGCTA ACTACCAAAA ACTGTATGAT
TATATTGCAC ATTCTGCACT TATTATTACT GCTTGTTTCA TTACTTATCC TGCTTGGCCA
CATACTCTTA AAAATAATCG TTTCATAGCT GTTGCTTGGC CATTAGCCAT TTGCTATATA
CTTTTTATAG CTGGTACAAT ACTCATGGTC ATGAGTGGTT TTCATCAAGT ACAGGTTATG
CTTTTTATAT TAAATATAGT GTTAACAGCT TTGTTGTTGG ACTGGAAGCT AATGCTTGTA
GTAGTTATTA GCGGTATATT GTCAAGTATA GGAGTTTTTT ATTTTTATAT AGGTTATATC
CCTATAATAA AAGTTGATGG GGTTTATTTA CAGTTTAAAG CTATCTATGG GCTTCCACTT
TTTATAAGCT TCTTATTAGC AATTATTAGC TTTAGGCAGA CAAAAGATCA ATTAGTTGAT
CAAACTAACT ATTTATTGCT AGCACAGAAG AAATTTCAAG ATAGACTTGT AGAGGTGGCC
AACTACAGAG AAGAACTTCT AAAAGAGCTA CAACCAGAAG AATTAAAAAT CTTTGACCAA
GCCACTTCCG CCTATCTTAA GCAGGCTATT TACCGGGTGC GTGACTATGT ACGCTTAGAC
GTAAGTGAAA GTTATATAGA CAAGTTACTG TCTGAAGTCA AAATTTTAGT TAAAGTACAT
GCAATAAAAC CTCGTCCTCA AATATTAATT AGAAATTATA CAAGTCATAA GAAGCTTCAC
GCAGATATAC CTAAAATTAA GCAACTCCTT GTTAATAGTA TCAGCTATAT ACAAAGCTAC
AATCTTCATA ATGTACCTAT TATCATAACT ATAGAAGATA CTAGATTGGG GTATGAGCTT
TCTCATATAG AAGGGTATGC TAAAAAGGTA GAGGCTTTAC GTATTACGGT GACGATTGAA
AGACAAATAC CTGCTCTTCA GGAAGTTTAT ATTAGCAAAG AGCAAGAAAA CTTTAATAAT
AGCTTATCTA TGACAGTAGA AATACTTCCT TTAGTAGAAA ATGTACGTAT TATCGATGCA
CACTATGGAT ATATAGACCA GGATCAAAAT AGACTACACA TGTACGTTAT CCCTGTTAAT
GTACGAGAAG TGCGAGGAAA GGTAATGGAG CTGATTAAAA AGTCAGCAGC AGTAGATCCT
TTGGAACTAG TCCATCCTTT AGCAGTCCAG TTAGAAGATG AATTGATAAC CAAGTTGCAA
GACACATCAA TAGATATAAC AGTTATTAAA AGAACCCTAG AGGTAATCAA GAAATATCAT
GGAGGTACCA AACGGCATTC AGGAGAGCCT TATTTTACAC ATCCTATAGC TGTTGCCATA
ATTTTGCTAG ATTATACACA AGATCAGGAT GCAATTGTGG CAGCTTTATT ACATGACACA
GTGGAAGATA CTAGTTTATC TCTTGCTCAT ATACAAGCTA TGTTTGGAGA GAAGGTAGGG
TTTTTGGTTG GAAAAGGTAC TAATCTAGAA AGTAAACTTA AAAGGATAAA TTTAGTAGAT
CATGAAAATT TACATAGGCT GATGAATTAT GAGGATGAGC GAGCAGCCTT AATAAAGTTA
GTAGATAGGT TGCATAATAT GCGTACGATA GAAGGGCATC CCTCTTTGAC TAAGCAAAAG
AGAATAGCTG GCGAAACATT AGCTTTTTTT GTGCCTATGT CAAGGCACTT ACGGCTAGAT
ACTTTAGCAC AGGAATTAGA AAAACTCAGT GTAGCAGTTT TAGGTAAATA G
 
Protein sequence
MPHISIDVIL FSIFLLINLA IGLLAGGRAK NLREYSIGNK NFSTAALTST IVATWISGSF 
MTFKLTKIYS EGWYFILAII CDNFTLLFTG LFLAARMGEF LKNNSVAEAL GDLYGKSVRI
ITAICGILIS IGGVAIQFKV SAKVLSILFG VNDIYSTIAA ASIVIIYAAL GGIRAVTFTD
IMQFFAFGTF IPLLTLTIWN GIKDHTAIIH AIQNSPQFDY NTLIGSPKRL ASCFALMLFY
FIPGFEPAIF QRVTMAINTG QVKRSFTYSF FLCTAITLFS IWIGILILGT NSQLQPDQIF
NYIVNSYTYP GLKGLILIGT MSMVMSTADS HINSAAVLFA NDIIKPLKLA VSQEVRIAKV
FTFFLGALAL LLALYKTDFL ELVLLAWGLY MPIVTVPLLL AVFGFRSTTK PVLIGMAAGF
ITVLLWDKLL ADTQINSVIP GMLANLVFLM GSHYMLKSSG GWVGIKDPYP LLVARQERQD
AWKKIIDAIK NPNILSYLKR NLPEKEIIYS LFGLYVLGAT YASFFTISTE VVANYQKLYD
YIAHSALIIT ACFITYPAWP HTLKNNRFIA VAWPLAICYI LFIAGTILMV MSGFHQVQVM
LFILNIVLTA LLLDWKLMLV VVISGILSSI GVFYFYIGYI PIIKVDGVYL QFKAIYGLPL
FISFLLAIIS FRQTKDQLVD QTNYLLLAQK KFQDRLVEVA NYREELLKEL QPEELKIFDQ
ATSAYLKQAI YRVRDYVRLD VSESYIDKLL SEVKILVKVH AIKPRPQILI RNYTSHKKLH
ADIPKIKQLL VNSISYIQSY NLHNVPIIIT IEDTRLGYEL SHIEGYAKKV EALRITVTIE
RQIPALQEVY ISKEQENFNN SLSMTVEILP LVENVRIIDA HYGYIDQDQN RLHMYVIPVN
VREVRGKVME LIKKSAAVDP LELVHPLAVQ LEDELITKLQ DTSIDITVIK RTLEVIKKYH
GGTKRHSGEP YFTHPIAVAI ILLDYTQDQD AIVAALLHDT VEDTSLSLAH IQAMFGEKVG
FLVGKGTNLE SKLKRINLVD HENLHRLMNY EDERAALIKL VDRLHNMRTI EGHPSLTKQK
RIAGETLAFF VPMSRHLRLD TLAQELEKLS VAVLGK