Gene Aasi_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0298 
Symbol 
ID6377735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp344014 
End bp345192 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content34% 
IMG OID642681479 
Producthypothetical protein 
Protein accessionYP_001957464 
Protein GI189501747 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.343578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGG ATGGTACAGA AATTTATACA AAAGAAACTA AACCTAACAT TAAATCAATA 
ACCTTAAGGT ATCTAAAATT ACTATTTAGA TGGGTCAATA AATACAAATG GTTGTTATTA
CTACTGTTAC TATTTTTGTT TATTATTTCA GTTAGAAAGG AAAAGCCACA AGTTATAACC
TTACAAGGGC AAGCATTAGG GAGAAATTAT ACTGTACAAT ATAAAGTAAA AGGAGATGCT
AACTACCAAA CTGAGATTGA AGCTTTGTTA GCTGATGTAT CACAAGCGTT AGATATTTCC
AACAAAGATT CAGAGGTAGC TAAGTTTAAT AGACATAACT GTACAGCATT TCATTTTGAG
TCTCCTTATT TATATCCTAT CCTTGATAAA AGCAAAGAAA TATATAATAG AACACAAGGA
GCTTTTGACC CTACGGTAGC CCCTCTTATT AAACTGTGGA AAAACAACCT ACAGAAAGGT
ATACCTCCTG CTAATTCACA AATACAAGCT TTACAAGAAT ATGTTGGTTT AGACTATGTA
GTAGTGAATC AGAAGCGAGT AAAAAAACTG AAAGAAGGAG TTACAGTTGA TTTAAGCAGT
ATCATTTCTA GTTATGCGAT AGACGTGATA GTAGCTTTTT TACATTCTAA AGGTGTAAAA
GATTTGTGTA TAGAATTAGG TAATGAAGCG GTAGCACATG GGATAAATAG TGACAAACAG
CCGTGGCAAG TAAAACGAAC CATAACTGAA AATAAGTTTA TAATTGAACC TTTTTCTATA
CACGGCAAGC TAACTGACAA AGCTATTTCT ATAGTTAGGC AGTATGCTCC TTGGGACAGT
GAACAAAACA TGCATATTAT TATTAACCCA CAAACAGGCT ATCCCGCTCA TGGAAATATA
ATAGCTGCTT CCGTACTAGC AAATGACTGT ACGACAGCAA GCGCATATGC CACGGCTATT
CTTACAAAAG ATTTTGATGG AGCTTTAAAA ATGCTTGAAA CCATCGATAG CATAGAAGTG
TTTCTGATAT ACCAAGACGA GCAGGGTAAA GTGGAGTTTT ACAATTCTAA GGGACTACAC
ATACAACCTA AGGAAGGCGT TCAAGGAATT TATCTTGAAA ATAAAAAAGC AGTAGCAGAA
GATTCTTCTA AGGAATCAAA AGTACAGGCT GATAATTAA
 
Protein sequence
MKMDGTEIYT KETKPNIKSI TLRYLKLLFR WVNKYKWLLL LLLLFLFIIS VRKEKPQVIT 
LQGQALGRNY TVQYKVKGDA NYQTEIEALL ADVSQALDIS NKDSEVAKFN RHNCTAFHFE
SPYLYPILDK SKEIYNRTQG AFDPTVAPLI KLWKNNLQKG IPPANSQIQA LQEYVGLDYV
VVNQKRVKKL KEGVTVDLSS IISSYAIDVI VAFLHSKGVK DLCIELGNEA VAHGINSDKQ
PWQVKRTITE NKFIIEPFSI HGKLTDKAIS IVRQYAPWDS EQNMHIIINP QTGYPAHGNI
IAASVLANDC TTASAYATAI LTKDFDGALK MLETIDSIEV FLIYQDEQGK VEFYNSKGLH
IQPKEGVQGI YLENKKAVAE DSSKESKVQA DN