Gene Aasi_0355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0355 
Symbol 
ID6376774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp416213 
End bp419548 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content34% 
IMG OID642681524 
Producthypothetical protein 
Protein accessionYP_001957508 
Protein GI189501791 
COG category[E] Amino acid transport and metabolism
[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases
[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0552395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGATA TAGCCATTTT TGTTCTCTTC TTACTTGCCA ATCTTGCTAT AGGTATTTTT 
TACCGTGGTA AATCGAAGTC CTTTCAAGAA TATAGCATTG GCGATAAAAA ATTTTCTACA
GCTACCCTAA CTGCTACTAT GGTAGCTACT TGGGCTTCAG GAGGATATTT ATTTAATTCT
TTGGAGGAAA ACTATGCTAG TGGTTTAACG TTTATCTTGC CAGCCGTGTT AGGAAGTGCC
TTAGGTCTAC TTATTGTTGG TTATATTATA GGGCCACGAT TAGCACCCTT TTTAAATAAT
GTTTCTATGG CTGATGCTAT GGGAAGTATA TATGGTAAAT CTATACAAGT TATTTTTGCT
ATTAGTAGTG TTTTAAGTAG CATAGGTTTA ATAGCTATTC AATTTAAAGT AATTTCTAGG
ATTCTACCTA TTCTTTTTGA TTATAAAGGC CCTGAACTTA CTATTATAGC TGCTATGATT
ATTACTGTTT ATGCGGCTTT TGGTGGAGTT AAAGCTGTTA CTTTTACAGA TGTAGTGCAG
TTTATTACTT TTGGAACATT ACTTCCTGTT TTGGCGCTTG CTGTTTGGCA TCACATACCC
AATCCGAAAC AAATCACTGA TGTTTTGGTT CATAGTCCTA ATTTTAATTT TAGCAATGTA
ATAGGTTGGA ATTCTAAGTT TGTAGATGCG TTAATTTTAA TGTATTATTT TATGTCTCCT
ACTATACCAC CTATACTTTT TCAACGTTTA TTAATGTCCC GGAATGTTAC TCAAATGAAA
CGTTCAATTA CCTATACAAC AGGTATCATG CTGTTAATAG ACCTACTTAT CATATGGATA
ACCATATTAA TTTTAGCAGA TAACGCAAAT CTTCAAAGTT CCAAAATTAT TCAATATATA
GTAGATACAT ATATGTATCC AGGACTAAAA GGTTTATTTG GAGTAGGTGT TATTGCATTA
GCTATGTCTA CTGCAGATTC TGCTTTGAAT GCCAGTTCAG TAATTGTTAC TAATGATATC
TTACCACCAT TAGGAATTAC TAAGAAACCT TTGGTGTCTA CAGCTTCTTT AGCAACTTTA
GTTATAGGTG GGTTAGGATT ATTATTGTCC TTATCTATCC AAAATATTTT AAAAATACTT
CTGTTTTCTG CTAACTTTTA TATGCCTTTA GTAGATGTGC CTGTACTACT AACTGTTTTT
GGATTTAGGA CAAGTAAACG ATCTATGTTG ATTGGGATTA CAGCAGGCTT TGTAACAACT
GCCTTTCTAA TGATTTTCTT CAAGGATGTT AATAGCTTTG TTCCAGGTAC ACTTGCCAAC
TTAATCTTCT TGCTAGGTAG TCATTATCTT TTAGAAGAGC CAGGAGGATG GAGAAAAATA
GAAAAAATAG AGCCGCTTGA TATACGACAA GCATATCCTA AAACGTGGAA AGATAAGCTA
ATAGCATTTA GAAATATTAG GCTAGTTGTT TATCTAGAAA AGAGTCTTCC TAATAAAGAG
TATTATTATC CCTTATTTGC ATTCTATCTG CTTACAGCTA CCTATGTATC TTTATATAAT
GTTCCTCATG GTATAGAAAA AGAGTACCTA GTTATATACA GAACTATTCA ATATTCAATT
CTTATTATTG CTACTAGCTT GTTCTCTTTT CATATTTGGC CTACAACTTT AAAAAACAAA
ACTTTTTTAA GCTGGTTATG GTCCTTTATT GTTTTTTATA GCTTGTTTTT TGTGGGGGGT
ATATTAGTAA TCTTAAGTAA ATTTCAGTCA GATCAAGTAT TAATATTCAT GCTAAACTTG
GTTATGGCTG TATTGTTGCT GTATTGGCCT GTGGCTATTA CATTGGCACT AAGCGGCGTA
GTTGCCGCTG TCTTACTATT CAAATGGGGT TTACACTCTC ATTTAGTTGC TGGTGGATTG
TCACAAATTT CCTTCCGATT AGGCTATGGA GTACTGCTAT TTAGCAGTTT CCTAATTGCT
TTATTTAGAT ATAAACAGGC CCATGCTAAG TTAACAGCAC GCCAGCAGCT TTTAGAAAAA
ATCAACCAAG AAACCAATAC GCGTTTATTA AAAGCATTAC AATATAGAGA AGAGTTACTA
GAAGAGTTGA AGCCGGACGA AGTAGCTCTA TTTGATAGTA CTACTGCTAG CTATATTAAT
CAAGTTATAT ATAGGGTAAG AAATTACTTA CGTTTAGAAG TAAGTGAAGC TACTTGCCAA
AAGCTTATAG AGGAAATGAT GGCTACCCTG GAGCTACAAG AGTTAAAAGT GCTCCCTAAA
ATCATAAAAG AAACTAAACA TACCAGTTTA CAAGGAGATA TAGATAAGAT TTTAAAGTTG
CTTGTAAATG CTGCCTTTTA CTTACAAAGG TATCAGAAAT CTAATCAACC TATACTGTTA
GGTTTAGAAG ATGCTACTTT AGGTTATGAA GTATCCTATA TAAAAGATTA TATAAAGGAA
ATAGATGCTT TAAAGATAAT TTTTACAACA GACAGCAAGC TGCCTGTCAC TCCAGAATTG
TATAAAATTA TTCCTGATAA GCCAGTCATG TATTTGCCTC ACAGTGAAGA AGATCTAAAC
TTAACCGAAA ATGCACGCAT CATAGATGCA CATTATGGTT ATTTCGAAGT ACTCTCATTA
CCTACAGGTT ATACACATAT TTATGTATTA CCTGTTAATG TGCGCGAAGT AAGAGGAAAA
GTAATGGAAC TGATTAGGAA ACCTGTGGCT GCTGATCCAG AAGAACTAGC TCACCCTTTT
TCCATACAAG TAGAAAAGGA GCTATTTGAG AAATTAGAAG GTACTAAAGT AGACAAATAT
ACAATCACTA AAGCATTAGA TTTAATCAAA AGATATCATA GCGGTGTTAA ACGAAAATCA
GGCGAACCGT TTTTTACGCA TCCTATAGCA GTAGCCTTGA TAGTATTACA GTACTCGCAA
GACCAAGATG CCATAATTGC TGCTTTACTC CATGATACGG TAGAAGATAC CAGCATAAGT
TTATCACAAT TAGAAGCTAC CTTTGGGACT ACTGTTGCCT TTTTGGTACG CAAAGCAACT
AATCTGGAAG ATAAGCTAAA AAGAATTTCG TTAGCTGATT ATGAGAATAT ACAGCGCTTA
ACATACTACG AAGACTCGAG AGCTGCTTTA GTAAAGCTAG CTGATAGGTT GCATAATATG
CGTACGGTCA AAGGACATTC TTCACTTACT AAACAAAAGA ATATAGCCAG TGAAACCTTA
AACTTCTTTG TGCCTTTGGC TAACTATCTT AAGTTAACTG ATATAGCACA AGAGTTGGAA
AAGTTAAGTT TAGAAGTATT AGCTAAAAAA GGATAA
 
Protein sequence
MIDIAIFVLF LLANLAIGIF YRGKSKSFQE YSIGDKKFST ATLTATMVAT WASGGYLFNS 
LEENYASGLT FILPAVLGSA LGLLIVGYII GPRLAPFLNN VSMADAMGSI YGKSIQVIFA
ISSVLSSIGL IAIQFKVISR ILPILFDYKG PELTIIAAMI ITVYAAFGGV KAVTFTDVVQ
FITFGTLLPV LALAVWHHIP NPKQITDVLV HSPNFNFSNV IGWNSKFVDA LILMYYFMSP
TIPPILFQRL LMSRNVTQMK RSITYTTGIM LLIDLLIIWI TILILADNAN LQSSKIIQYI
VDTYMYPGLK GLFGVGVIAL AMSTADSALN ASSVIVTNDI LPPLGITKKP LVSTASLATL
VIGGLGLLLS LSIQNILKIL LFSANFYMPL VDVPVLLTVF GFRTSKRSML IGITAGFVTT
AFLMIFFKDV NSFVPGTLAN LIFLLGSHYL LEEPGGWRKI EKIEPLDIRQ AYPKTWKDKL
IAFRNIRLVV YLEKSLPNKE YYYPLFAFYL LTATYVSLYN VPHGIEKEYL VIYRTIQYSI
LIIATSLFSF HIWPTTLKNK TFLSWLWSFI VFYSLFFVGG ILVILSKFQS DQVLIFMLNL
VMAVLLLYWP VAITLALSGV VAAVLLFKWG LHSHLVAGGL SQISFRLGYG VLLFSSFLIA
LFRYKQAHAK LTARQQLLEK INQETNTRLL KALQYREELL EELKPDEVAL FDSTTASYIN
QVIYRVRNYL RLEVSEATCQ KLIEEMMATL ELQELKVLPK IIKETKHTSL QGDIDKILKL
LVNAAFYLQR YQKSNQPILL GLEDATLGYE VSYIKDYIKE IDALKIIFTT DSKLPVTPEL
YKIIPDKPVM YLPHSEEDLN LTENARIIDA HYGYFEVLSL PTGYTHIYVL PVNVREVRGK
VMELIRKPVA ADPEELAHPF SIQVEKELFE KLEGTKVDKY TITKALDLIK RYHSGVKRKS
GEPFFTHPIA VALIVLQYSQ DQDAIIAALL HDTVEDTSIS LSQLEATFGT TVAFLVRKAT
NLEDKLKRIS LADYENIQRL TYYEDSRAAL VKLADRLHNM RTVKGHSSLT KQKNIASETL
NFFVPLANYL KLTDIAQELE KLSLEVLAKK G