Gene Aasi_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0804 
Symbol 
ID6376999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1018660 
End bp1020141 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content35% 
IMG OID642681946 
Producthypothetical protein 
Protein accessionYP_001957909 
Protein GI189502192 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.288732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC TTAAAAAGTT GGCTAGTGAT ACAGCTATAT ATGGACTTAG CAGTATTATA 
GGTAGGGTGC TTAACTATCT GCTAGTACCA TTTTATACTA GCTTGCTTTT GCCTGCTGAA
TATGGTATTG TTACCGAATT ATATGCGTAT GCTGCTTTTT TGAATATTAT TTATGGCTAT
GGGATGGAAA CAGCCTATTT TAGGTTTGCT ACGCAAGGTT CTCCCATAGA GGTATTTAAA
CTTACCAGTA GCTTGTTAAC TTTAAGTAGT CTATTATTTT CAAGCTTATT AGCATCTCTC
GCTCCTCTCC TTAGTCGTTG GTTAGGCTAT TCAGGCCATG AACATTATGT TTACTACTTG
GCAGCTATTT TGGCTGTTGA TACCATATTG TTGGTTCCTT TCGCACAGTT ACGTTTTTCT
AACCAATCAT TTTTGTTTGC CCAAGCAAAA TGTTTACAAA TAGCCTTAAA TATAATTTTT
AATCTTTTGT TGCTGTATAT ACTTCCAGGA ATCTATACAG GTAAGTTTTT GTACTCATTC
AAGCCTTTCG TACAACTTAT CTATAATCCA GCCAACCATA TAGAATATAT TTTCTTAGCT
AATTTAATGG CTAATTTATG CGTGTTACCT ATTTTGGGTA AGCCACTCAT CCATTTTAAA
TTTAAAATAG ATTGGCAAAA GCTAAGGCCT ATGATTATAT ATGCTTTGCC TTTATTGGTT
ATGGGGTTAG CTGGAACTAC CAACGAAATG CTGGCTAGGG CTTTGCTGAA GCATCTATTA
CCATCCAATT TTTATTCCGG ACAGAGTAAG GAGGCAATAG TAGGTATTTT TGGAGCTTGC
TATAAGCTTG CTGTCTTAAT GTCGCTAGCA ATCCAAGCCT TTCGTTATGC AGCTGAACCT
TTTTTTTTCA CACATGCACA AGACAAGCGC TCTCCTCAGC TTTTTAGTAA AATTATGCAA
GGATATGTAT TGGTCGCTTG CTTCATCTGG TTTGCTATTA GTGTTAACTT AGATATATTA
GGTTATATAT TTCTTAGAAA CCCAGCATAT CGGGCAGGCA TTGAAATTGT TCCTTACCTT
TGTCTAGCAT ACATATGGTT AGGCATCTAT TATAATCTTT CAGTGTGGTT TAAGCTAGCT
AACAAAACAT ATTATGGTAG TGTCATAACT CTTATAGGAG CAGGTATTAC TATACTGTTG
AATGTTTTAT TAGTACCTTA TTATGGGTAT TGGGGTAGTG TATGGGCAAC TGTAATCAGT
TACCTAATTA TGGCTGTAAT TTGTTATTGT AAAGGACAGC AATACTACGC TGTTCCTTAT
AAGACTGGTT ATGCACTATT TTTTATGCTA GTTACACTAC TTTTGATAAT AGTAATACGT
CAAATACAGT ATGCTACTTG GGCTTATGCT TTGGTTAGTA ATATAGGGTT TACACTTGTA
TTCGGGCTGG TTATATATAG AGCTATGCGC AGATCTTTAT AG
 
Protein sequence
MNALKKLASD TAIYGLSSII GRVLNYLLVP FYTSLLLPAE YGIVTELYAY AAFLNIIYGY 
GMETAYFRFA TQGSPIEVFK LTSSLLTLSS LLFSSLLASL APLLSRWLGY SGHEHYVYYL
AAILAVDTIL LVPFAQLRFS NQSFLFAQAK CLQIALNIIF NLLLLYILPG IYTGKFLYSF
KPFVQLIYNP ANHIEYIFLA NLMANLCVLP ILGKPLIHFK FKIDWQKLRP MIIYALPLLV
MGLAGTTNEM LARALLKHLL PSNFYSGQSK EAIVGIFGAC YKLAVLMSLA IQAFRYAAEP
FFFTHAQDKR SPQLFSKIMQ GYVLVACFIW FAISVNLDIL GYIFLRNPAY RAGIEIVPYL
CLAYIWLGIY YNLSVWFKLA NKTYYGSVIT LIGAGITILL NVLLVPYYGY WGSVWATVIS
YLIMAVICYC KGQQYYAVPY KTGYALFFML VTLLLIIVIR QIQYATWAYA LVSNIGFTLV
FGLVIYRAMR RSL