Gene Aasi_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1080 
Symbol 
ID6377559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1391821 
End bp1393620 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content36% 
IMG OID642682193 
Producthypothetical protein 
Protein accessionYP_001958154 
Protein GI189502437 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.752206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTT CCGCAACTTC TAGCGACTTA GTTAATTATT TTATTAAAGT TGGCACTAAA 
GAAATAGATA AAGAATTTCC TATTTTTTCT ATTACTGTAA ATAAGACCAT TAATAAAATT
CCTTATGCAA AAATTCTCTT GCATGATGGA GATCCTGCGG AACAGTCGTT TCTAATTAGT
AATAGTAACA TATTTGAAAT AGGAAAAGTT GTAGAAATTC AATTAGGCTA TCAGTCTGAC
TTAACGGTTG TATTTGAAGG TGTCATTACC AAACATAGTT TAAAAGCTAA TAACTACACA
AATCCCTATT TAGAAGTATA TTGTAAAGAT ATTGCTTACC AAACTACCTT AGTTCCCAAA
ACAATTAGTT TTGCAGACAC TACTGATAGT GGAGCTTTGG AGAACATATT GAAGAACTAC
GAAGGCGATA TAGAGCAACA TATTAGCTCT ACAACAATCA AACATGAAAT TCTAGCACAA
CAAGATACTA CGGACTGGGA TTTTATTAAC GTGCGTGCAG AAGCTAACGG ACAAGTAGTA
ATAGTAGATG ATGGGACCCT TACTACAAAA AAGCCCAATA GTAGCCAAAC ACCTAGCTAC
ACATTTACTT ATGGGGTTGA TATTTATGGG TTAGATTTAG AAATGGATGC CAGTACACAA
TGGCAACAAG CAGGCGGCAA AATATGGAAA CATGATGAAC AAGCATGTGA AGTTGTTAAT
GCAAGTACTG TGGGAGAAAA ATCGTTTGGT GCTACTTCAC ATGCTAAACT AGCAGGTACT
AACAAACAGG AACCTATTAC GTTTATTCAT GGAGGTAATG TTACTAAAGA AGAGATGAAA
AGCTTCTCCA CAGGCCTGTT GGAGCTTAAC AGGTTAGCAA AAATCAGAGG AAAAGTGACT
GTGCAAGGAA TTGCTGATCT TAAACCAGAC AGTACAATCA AAATTGATAA AGGGGCTGAT
AATTTTGAGG GCAATGCTTA CGTAAGTGGG GTGCACCACC GGTTAGAAGG AGGACAATGG
TTTACAGATA TAACTGTGGG CCTGCCTAAC GAACGTTATA TGCGTAAATA CAACAATATA
GCAGGCCTCC CAGCAGCCGG AATGCTTTGT CCTGTCTACG GATTACAAAT AGGTATAGTA
AAAGAATTAT ATAAACAAGA AGACCCAGAC CCTAATTACC GAATTTTTGT AAATATCCCT
ATCATTCACC AACCGAACGA GGGTATTTGG TGTAGAGTAG CTTCATTTTA CGCTTCTAAA
GGCATAGGTG CCTTTATTAT GCCAGAAAAA GAAGATGAAG TTATCATTGG CTTTGTTAAT
GATGATTTTA GATCACCGGT CATAGTAGGC TCTCTATATA GTGGCAGTAA ACATAAAACT
CCTATTCAGC AAGACCCGGA AAATAATATC AAAGCACTGG TAACCAGAAG CAAGTTAGAG
ATGACTTTTA ATGATAAAGA TAAAGCAATT GTATTTCAAA CTCCAGGAGG AAGGACTATT
AGCATTTCTG ACAAAAGTGG TACAATAGAA ATTACGAATG GTAACGCGAA CAAGATAATT
CTAGGTAAGC AGAATGTTGA AATTATTAGT AATAAAGATA TAATCTTAAA TGCCAAAGGA
AGTATCAACT TACAGGCAAC AGACAGTGTA CAGATAAAAG GAAACAATAA AGTAGAACTT
AGTGGCATGA ATGTATCTGC CAACGCAACA ATGAAAGCTT CATTGGTAGG CAATTCAAGT
GCGCAGGTAC AATCTAGTAT GTCTACTATA ATAAAAGGAA CTATAGTACA AATAAATTAA
 
Protein sequence
MSTSATSSDL VNYFIKVGTK EIDKEFPIFS ITVNKTINKI PYAKILLHDG DPAEQSFLIS 
NSNIFEIGKV VEIQLGYQSD LTVVFEGVIT KHSLKANNYT NPYLEVYCKD IAYQTTLVPK
TISFADTTDS GALENILKNY EGDIEQHISS TTIKHEILAQ QDTTDWDFIN VRAEANGQVV
IVDDGTLTTK KPNSSQTPSY TFTYGVDIYG LDLEMDASTQ WQQAGGKIWK HDEQACEVVN
ASTVGEKSFG ATSHAKLAGT NKQEPITFIH GGNVTKEEMK SFSTGLLELN RLAKIRGKVT
VQGIADLKPD STIKIDKGAD NFEGNAYVSG VHHRLEGGQW FTDITVGLPN ERYMRKYNNI
AGLPAAGMLC PVYGLQIGIV KELYKQEDPD PNYRIFVNIP IIHQPNEGIW CRVASFYASK
GIGAFIMPEK EDEVIIGFVN DDFRSPVIVG SLYSGSKHKT PIQQDPENNI KALVTRSKLE
MTFNDKDKAI VFQTPGGRTI SISDKSGTIE ITNGNANKII LGKQNVEIIS NKDIILNAKG
SINLQATDSV QIKGNNKVEL SGMNVSANAT MKASLVGNSS AQVQSSMSTI IKGTIVQIN