Gene Aasi_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1444 
Symbol 
ID6377494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1867809 
End bp1871321 
Gene Length3513 bp 
Protein Length1170 aa 
Translation table11 
GC content37% 
IMG OID642682513 
Producthypothetical protein 
Protein accessionYP_001958462 
Protein GI189502745 
COG category[E] Amino acid transport and metabolism
[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases
[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTT ATTACATAAC AAATACTATT ACTATGAATT ACTTCAGTAT AGATTTTTTA 
ATTGTATATG CTTTCTTAGC TATTACACTT GTTATAGGCA TACGTGCAGG TAGAGGTATT
AAAAATATTA GAGAATATGC CCTGGGCAAT AAAATATATG GAGTGGCAAT ACTTGTTTTT
ACCTTTTTAG CCACTAACAT AGGTGGAGGC AGTACACTTA ATGCTGCTAG TGACGTTTTT
GCAAATGGCA TTATTGGGGC TATTGCTAGC TTTGGTGTAA CTATCCAAAT TCTTCTTTTT
GCTATTTTTA TTGTACCTCA CATCAAGTAT TTTACAACTC ACTTAACAAT AGGAGACGTA
ATGGGCAGTC TCTATGGAAA GTATAGTCAA ATACTCACAG GTATACTAGG CACCTTGTAT
TCTTTTTGTA TGATCGGTAT GCAACTATTT GTACTTGGTA TTATTTGTAA GTCATTATTA
GGTGTACATA CCAACTGGGG AATCATGGTC GGAGGATTAC TATTAACTAC TTATTCAGCG
TATGGAGGCA TTAAAGCAGT TACAGCTGCG GATATCTTTC AATTTTTAGT ACTATTTATT
GTTATTCCTC TAATAGCAGG AATGGCACTT AATAAAGCAG GAGGCCTTCA AGCAGTATTT
ACTCAAATAC CAGCAGCTAA ACTTGAAGTA TTTACTCATA AAAAGTTCTC TTTTTATTCA
ACCCTCTTTT TAATCTGGAG CGTTCTACCT TTAGGACTTA TTAGCCCTCC AATCTTCCAA
CGGCTTTTAA TGGCTAAACA TCCAGAACAA CTGCGTCAAC AGTATTTTAT TTTAGCTGGA
CTAGATCCTC TTATACGTGT AACTATTATG ATTATAGGAT TAGCTGGTCT AGTGCTGTAC
CCACAAATCC AGTCGGCTGA TGTTATGCCT CACATTATTA AGAAGCTCTT ACCAATAGGG
CTTAAAGGCT TTGCCATAAC AGGTGTGCTT GCTATTGTCA TATCGACAGC TGATTCTTAT
TTGCATACGG CTGGGTTATT GTTAGTGCAC GATATTATAA AGCCTATTAT TGGCCGAAAG
CGAATTTTTC TAAATGAATT GAAATGGGCT CAGTATAGTA CTTTTTTAAT AGGTGTTGTA
AGTATTATAA TAGGCTTGAG TTCCACCAAT GCCCTTGACT TAAGTTTTGG AGCTATGCGT
ATAGCAGGGC CAGTGCTATT GTTCCCTCTG CTAGCTGGGA TAATAGGCAT TAAGTCTGAT
AAAAAATCTT TTTATGGGGC CATGGTAGTG ACGCTCATTA CCTTTGCAAT AACTACCCTA
CTTTTACCTA AATCTCATAG CCCTTTAGGG ATGCCCATTA GTATTGTAGT CAATGCAATC
AGCTTTTTGG TTATTCACTT GATACAAAAT GGAAAAATTG TGATGGTAGA TAGGAGTGCT
TATGAGATCT CTAACACTTT AGCAAAACCT CATCCTAAGT CTATCCAAAA ATGGCTAAGC
CATTACTTAC CCAATTTATT GAATATCATT CAGTACTCTC AAGAACGTAT CAAGCAGTAC
GGGGCCCCTT ATATCCTATT TGGAATATTT TACGGAATTA ACTTTACTTA TCCCTATTTT
ATGTGGAGTG CTAGTAGCTC GCCTGCTGCT AATTTAATGC TTACACTTAG GCTAATAGGA
GCCCTAGCCT GTGGACTATT AATTGTCCAA GCAAAATGGC CTAAAAGGCT GCTTCCTTAT
ATGCCCGCTT ATTGGCATGT GACCATCCTG TATTGCTTGC CTTTTATGAG CACGATGATG
TTTATGCTAA CGCAAGGTAG CACAGAATGG CTCATTAATA TAGCTATTAT GATTATTCTG
CTTTTCATTC TAGTGGATTG GGCTAGTGCC CTAGTTCTTG GAAGTTTAGG TATTATACTC
GCTTTTGCTT GCTACTCCTT ATTTGTAGGT AAAATAAACT TATCCTTAGA CTTTTCTTCG
AAGTATTTAT TATTCTACCA AACTATCTTT GGACTTCTTA TTGGGCTTAT ATTTGCTCGC
AGAAAAGAGC AGCGTTTTGA TAGGTTAGCT ACCGATAATC AAACATTAAC GCTTGTCGAT
CAAGAAAACA AAGAAGCCTT ATTAGAAATT TTTAAAGAAA AAATACGCCT ACTTAAAACA
CTTAAGCGAG CTGGAGTACA AGACCTTACA AAAGCGGTGA GCTTAGTAAA AGAACTACAT
ATACAAGAAA AACAAGGCTT TAAGGAGGCA ACAGTTGTAC GCAATACACT TAATCAATTA
CAAAATACAC TTACCCCCAT GGCCGTAGCT TTAGAGCGAA TCGAAAGTAG AGCTACTGAT
TATATGAGAT TGGAAATTAA GCCGATTGCT ATAGGTAATT TACTAGCAGC CGTGCAAGCT
AAGTTTTCCG ATACAAAACT TTACATTAAA AATAGCAGTG TTTGCCAAGA ACTGATTTGC
GACCCTAAAC ACACTCAAAA AATGCTAGTG AATGGTATAG AAGCATTAAA AGCTTCCAAA
GAGGAAGAAG AAACTATTTA TATCACTTTA GCCGATACAG CGCTTACTTA TCCGCTTCCT
TCTGTTAAAA AAGATAAAAG TTATATAAAA AGAGTATCAG CTCTTGCTTT TATTTTAAGT
ACTACACCTG ATTTTCCTAG TATCCAACCA GCTTACAAAG CTCAAATGCA TACTGGTGCT
TTGCCTATGC CGGAAACACC AGTATCATTC TTGTTAGTTT CTAATCAGCG TATTGTAAAG
GCGCATTATG GCTATACGAA TATAAACATT AGTAAGCAAG AAAATTATGC TATGCATTGC
TATGTTTTGC CTACCCGTGT TAGTGATGTA AGGCCTCGTG ATATGGATGA TCCTTATATG
GAGCTAGGTG CTGAGTTAGT AAGAGCTGAT GATAGCTTCC CAGGAGCTCT TGAGCAAGAA
AAAACTTTTC TAGCTGCTGT CAAGCAAAAG AGTAATGCCA ATCTAACTGC TATAGAAACA
GCCATTGAGA TGATCAAGTG GTATCATGGG CCTGTAAGAC GTAAATCAGG AGAACCTTTT
TATTTACATC CCTTAGCAGT GGCGCATATT GTGCTAGACT ATAATACAGA TGAATCCACC
ATCTTAGGGG CTTTACTCCA TGATACAGTA GAAGACACGC CTATGCTCTT AGAAAATCTG
GAAATGATGT TTGGGAAAGA GGTAGTGAGT ATTGTAGCTG GGGTAACGCA TTTTGAAAGT
ATGCAAGATA GCTTTTACAA GGTTCAACTA GCCCCACACG AAAATATAAT GATGCTTTTG
GGCGTAGAAG ACAAGCGAGT ATTATATGTT AAGATAGCAG ATCGTATGCA TAATATACGT
ACCATTGAAG GTCATAGTTC TTATGCTAAG AAAAAGCAAA TTGCAGAAGA AACACTTCAA
TTTTTTGTGC CGCTAGCTCA AAAATTAGAT CTAGAAGCAG CTGCTGCAGA ATTACAAGAA
CGAAGTGTAG CCGTTATTAA TCAGCAAAAA TGA
 
Protein sequence
MAPYYITNTI TMNYFSIDFL IVYAFLAITL VIGIRAGRGI KNIREYALGN KIYGVAILVF 
TFLATNIGGG STLNAASDVF ANGIIGAIAS FGVTIQILLF AIFIVPHIKY FTTHLTIGDV
MGSLYGKYSQ ILTGILGTLY SFCMIGMQLF VLGIICKSLL GVHTNWGIMV GGLLLTTYSA
YGGIKAVTAA DIFQFLVLFI VIPLIAGMAL NKAGGLQAVF TQIPAAKLEV FTHKKFSFYS
TLFLIWSVLP LGLISPPIFQ RLLMAKHPEQ LRQQYFILAG LDPLIRVTIM IIGLAGLVLY
PQIQSADVMP HIIKKLLPIG LKGFAITGVL AIVISTADSY LHTAGLLLVH DIIKPIIGRK
RIFLNELKWA QYSTFLIGVV SIIIGLSSTN ALDLSFGAMR IAGPVLLFPL LAGIIGIKSD
KKSFYGAMVV TLITFAITTL LLPKSHSPLG MPISIVVNAI SFLVIHLIQN GKIVMVDRSA
YEISNTLAKP HPKSIQKWLS HYLPNLLNII QYSQERIKQY GAPYILFGIF YGINFTYPYF
MWSASSSPAA NLMLTLRLIG ALACGLLIVQ AKWPKRLLPY MPAYWHVTIL YCLPFMSTMM
FMLTQGSTEW LINIAIMIIL LFILVDWASA LVLGSLGIIL AFACYSLFVG KINLSLDFSS
KYLLFYQTIF GLLIGLIFAR RKEQRFDRLA TDNQTLTLVD QENKEALLEI FKEKIRLLKT
LKRAGVQDLT KAVSLVKELH IQEKQGFKEA TVVRNTLNQL QNTLTPMAVA LERIESRATD
YMRLEIKPIA IGNLLAAVQA KFSDTKLYIK NSSVCQELIC DPKHTQKMLV NGIEALKASK
EEEETIYITL ADTALTYPLP SVKKDKSYIK RVSALAFILS TTPDFPSIQP AYKAQMHTGA
LPMPETPVSF LLVSNQRIVK AHYGYTNINI SKQENYAMHC YVLPTRVSDV RPRDMDDPYM
ELGAELVRAD DSFPGALEQE KTFLAAVKQK SNANLTAIET AIEMIKWYHG PVRRKSGEPF
YLHPLAVAHI VLDYNTDEST ILGALLHDTV EDTPMLLENL EMMFGKEVVS IVAGVTHFES
MQDSFYKVQL APHENIMMLL GVEDKRVLYV KIADRMHNIR TIEGHSSYAK KKQIAEETLQ
FFVPLAQKLD LEAAAAELQE RSVAVINQQK