Gene Aasi_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0449 
Symbol 
ID6377245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp527486 
End bp530866 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content37% 
IMG OID642681610 
Producthypothetical protein 
Protein accessionYP_001957589 
Protein GI189501872 
COG category[E] Amino acid transport and metabolism
[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases
[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.812136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCCT ATTACATAAC AAATACTATT ACTATGAATT ACTTCAGCAT AGATCTTCTT 
ATTGTATATG CGTTTTTAGC TACTACCCTG ATTATAGGCA TTCGGGCAGG TAGAGGCATT
AAGGATATTC GTGAGTATGC TATTGGAAAT AAAATGTATG GGGCAGCTAT ACTTGTTTTT
ACTTTTTTAG CCACTAATAT AGGTGGAGCT AGCACGCTTA ATGCTGCGGC GGATGTTTTC
TCAAATGGTA TTATTAGAAC TCTTGCTACC CTAGGAGTTA TCATACAAAT TTTCATATTT
GCTATAATTA TTGTGCCCCA TATGAAACAC TTTACTAACC ATTTGACTAT GGGTGATGTA
ATGGGTAGCT TATATGGGAG ATATAGCCAG ATCCTTACAG GTATATTAGG AACGCTGTAC
TCTTTTTGTA TGATTGGTAT GGAACTGTTT ATGCTTGGTA TTGTCTGCCA ATCATTGTTA
GGCATACCAG CCAGTTGGGG AATTATAGTG GGTGGACTTT TATTAACAAT TTATTCGGCT
TATGGAGGTA TTAAAGCAGT AACTGCTACT GATGTATTTC AATTTCTTAT TTTATTTATA
GTTATTCCAT TGATAGCTAG TATAGCCATT AACAAGGTAG GGGGTATTAA GGAAGTGTTC
CTTCAAGTTC CAGCAGATAG GTTTAAGGTA TTTACACATG AAAAATTCTC TTTTTATCTA
ACCTTATTCT TGATTTGGAG TGTTTTACCC TTAGGGCTTG TAAGTCCTCC TATTTTTCAA
CGACTTTTGA TGGCCAAACA AACAGAACAG CTACGTAAAC AATATTTAAT AGTGGCTGGT
TTAGATCCTT TGTTACGCAT AACCATTATG TTGATTGGAC TAGCTGGACT AGTGCTTTAT
CCATACATTC AGGCAGCTGA TGTTATGCCC CATATTATAA AAGAACTCTT GCCAATAGGT
ATTAGAGGAT TGGCTATAGC TGGCATGTTG GCTGTGGTTA TATCTACGGC TGATTCTTAT
TTACATACTG CTGGTTTATT GCTTGTTCAT GATGTTATTA AGCCTATTTT TGGACAAAAG
AAGATTTTTT TTAATGAATT GCATTGGACT AAATATTGTA CTTTTCTCAT AGGTACAATA
AGCATTATAA TAGGTTTAAA ATCGACTAAT CCTCTTAGCT TAAGTTTTGG AGCCATGCGC
ATAGCAGGCC CTGTTTTATT GTTTCCACTA CTAGCCGGAA TTATAGGGCT TAAACCAGAT
AAGAAGCCTT TTTATGTGTC CATGGTAATT ACCGTGTTTA CCTTTATTAT AACCACTTTC
TACTTGCCTA AAAGTCAAGC GCATTTCGGA GTGCCTATTA GTATTGTTGT TAATGCGATC
AGCTTTTTTA TTGTTCATCT TATACAAAAT CGAGGCGTTG CTATGGTTGA TAGAAATTAT
AGTGAGTCTT CCAGAATAAT ACCAAGCGCA GGTAGTAAAA CCATCCAAGA TCAGCTCAAG
TCCTATATCC CTACATTATC TAATATTATT CGATATTCAC AACAACGTGT TCGACAATAT
GGGGCTCCTT ATATCTTATT TGGGATATTT TTTACCATTA ACTTTACTTA TCCATATTTC
ATGTGGAGTT CTAGCGGACT GCAAGCCCCT AATCTAATGC TTGCATTACG ATTGGTAGGA
GCGTTCGCTT GTGGGCTTTT AATTGTGCAA TCGAAATGGC CTAAATCTTT ACTTCCTTAT
ATGCCTACCT ATTGGCACTT AACCATACTT TATTGTTTGC CTTTTATGAG TACTATGATG
TTTCTACTTA CCCAAGGTAG CACAGAGTGG CTCATTAATA TTGCCATTGT GATCATCCTG
CTTTTTATAC TGGTAGATTG GGTTACTGCT ATGATACTAG GCATCCTTGG TGTAAGCTTG
GCAGCTATAT TTTATAAATT ATTTGTCGGA GCAATACATT TCTCGTTAGA TTTTTCTTCT
AAATACTTGC TGCTATATCA AGGTATTTTT GGGTTGTTTA TTGGCCTTAT TTTTGCCCGT
AGAAAAGAAC AAAGGTTCGA CTTTCTTCTA CAACGCAACC AACAACTCAC CGAAGTCCAG
CAAAAAAACC GTGCAGAGCT GGCAGAAACT CTTGCCTACA GAGAACAACT GTTTCAAGAG
CTTAATCCAG ATGAAGCAGC CCTTTTTGAT GAGGTAACTA CGGCTTATAT CAAGCAAGCG
ATTTATCGTA TGACTGATTA TATGCGCTTA GATGTAACGT CTATAAGTCT AGATGAACTG
CAAAAAGCAT TATCAGACGC CTATAAGCTG CAAGGTATTG AACAATCTGA GCTTCTGTTT
CATAAAGATA CAAAACAGAT GGCTCTCCAA GCGGACGTTG CCAAGTTAAA GCAACTTTTA
CTTAATGCGA TCAACTACAC ACAACAATAT AACACTGATA ATAACCCTAT TACCATATCC
ATAGAGGATG CCTTACTAGG ACATGATATA GCTCACATGC AAAACTATAC GCGTAAATTG
GAAGGGTTAA AAATAACTAT TACCACAAAA CAAGCTTTAC CTCTAACCCA GCCTATTTAT
AAAATAGATC CTGCTAAATC TAGCACCTGG GTACCTCAGC ATGAAGATGA ATTCTTATTG
GTAGAAAATG CCCGGATTAT CGATGCGCAT TATGGATATA TGTATGCCAA ATCAAGGCAC
ACACAAGTGT ATGTATTCCC TGTTAAGCTA AGGGAAATTC GTGGCAAGGT GATGGAGCTT
ATCAAAGAGT CAGCAGCTGC AGACCCAGGA GAATTGAGCC ATCCGCTAGC CATACAACTC
GAGCAAGAAC TTTTAAAGAA GCTTAAAGGG ACACAAGTAG ATATAGTCCT TATTCAGAAG
GCGCTAGATA TCATCAAGAG ATACCATGGA GGTGTAAAAA GAAAATCAGG AGAACCCTTT
TTTACTCATC CTATAGCTGT AGCACTAATT TTATTAGAAT ATTCACAAGA TCAAGATGCT
ATTTTAGGAG CTTTGTTGCA TGATACAGTA GAAGATACTA GCCTATCACT CGCGCATATT
CGTATGCTTT TTGGAGAAAC AGTGGCATTT TTAGTAGCCA AAGCAACTAA TCTAGAAGAT
CGTGAGCGAC GAATAAGCTT AACCGATAAA GAAAATCTAG CTCGAATTCT AAACTATGAA
GATCCTAGGG CACCTCTGAT AAAATTATCA GATCGGTTGC ATAACATGCG TACCATCCAG
TTTCACTCTT CGGTAGCTAA ACGTAAATAT ATTTCTCAAG AGACATTAGA TTATTTTGTG
CCATTAGCAA GAAAATTAGG TTTAGAAAAA ATGTCTGTTG AACTGGAGCA GCTAAGCCGA
GCAATAGTAT TTAATAATTA A
 
Protein sequence
MAPYYITNTI TMNYFSIDLL IVYAFLATTL IIGIRAGRGI KDIREYAIGN KMYGAAILVF 
TFLATNIGGA STLNAAADVF SNGIIRTLAT LGVIIQIFIF AIIIVPHMKH FTNHLTMGDV
MGSLYGRYSQ ILTGILGTLY SFCMIGMELF MLGIVCQSLL GIPASWGIIV GGLLLTIYSA
YGGIKAVTAT DVFQFLILFI VIPLIASIAI NKVGGIKEVF LQVPADRFKV FTHEKFSFYL
TLFLIWSVLP LGLVSPPIFQ RLLMAKQTEQ LRKQYLIVAG LDPLLRITIM LIGLAGLVLY
PYIQAADVMP HIIKELLPIG IRGLAIAGML AVVISTADSY LHTAGLLLVH DVIKPIFGQK
KIFFNELHWT KYCTFLIGTI SIIIGLKSTN PLSLSFGAMR IAGPVLLFPL LAGIIGLKPD
KKPFYVSMVI TVFTFIITTF YLPKSQAHFG VPISIVVNAI SFFIVHLIQN RGVAMVDRNY
SESSRIIPSA GSKTIQDQLK SYIPTLSNII RYSQQRVRQY GAPYILFGIF FTINFTYPYF
MWSSSGLQAP NLMLALRLVG AFACGLLIVQ SKWPKSLLPY MPTYWHLTIL YCLPFMSTMM
FLLTQGSTEW LINIAIVIIL LFILVDWVTA MILGILGVSL AAIFYKLFVG AIHFSLDFSS
KYLLLYQGIF GLFIGLIFAR RKEQRFDFLL QRNQQLTEVQ QKNRAELAET LAYREQLFQE
LNPDEAALFD EVTTAYIKQA IYRMTDYMRL DVTSISLDEL QKALSDAYKL QGIEQSELLF
HKDTKQMALQ ADVAKLKQLL LNAINYTQQY NTDNNPITIS IEDALLGHDI AHMQNYTRKL
EGLKITITTK QALPLTQPIY KIDPAKSSTW VPQHEDEFLL VENARIIDAH YGYMYAKSRH
TQVYVFPVKL REIRGKVMEL IKESAAADPG ELSHPLAIQL EQELLKKLKG TQVDIVLIQK
ALDIIKRYHG GVKRKSGEPF FTHPIAVALI LLEYSQDQDA ILGALLHDTV EDTSLSLAHI
RMLFGETVAF LVAKATNLED RERRISLTDK ENLARILNYE DPRAPLIKLS DRLHNMRTIQ
FHSSVAKRKY ISQETLDYFV PLARKLGLEK MSVELEQLSR AIVFNN