Gene Aasi_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0220 
Symbol 
ID6376640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp243573 
End bp245258 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content34% 
IMG OID642681408 
Producthypothetical protein 
Protein accessionYP_001957393 
Protein GI189501676 
COG category[N] Cell motility 
COG ID[COG3225] ABC-type uncharacterized transport system involved in gliding motility, auxiliary component 
TIGRFAM ID[TIGR03521] gliding-associated putative ABC transporter substrate-binding component GldG 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.447855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTA GTAAGCATAC TTATAAGAAT AGATGGTTGA TATTATTAAC TTTTGTATCA 
GTTATAGGAA TAGCTAATAA ACTAGCTTAT TATTTTCCTT TACGGGTAGA TTTAACAGAG
GATAAGCGCT ATTCTCTACA TTCCTCTACA AAAGCATTAC TCAGTCGTTT AGAATCTGAG
TTGCAAGTAA ATATATATTT ATCAGGAGAT TTACCAAATG AATTTAAGCA ATTACAGACT
GGTGTAATTG CACTTTTAGA AGAATTTAAA GCTTATGCAC GTTGCCCTGT TACTTGCCAC
GTAATAGACC TAAGCAAAGA ACCTGCTGAT AAGCGTAAAG AAATACTAAA GAAGTTGTTG
GAAAAGAAAA TAGAACCTAC TAACCTTTAT AGGCAAGCAC ATGGAAAACG AATAGAAAAT
CTAATTTATC CAGGTGCTAT TCTCTCTTAC CAGGGGAATG AAGTGGGTGT TATGATGTTG
AAGGCTGATA AAATGACACC TACTAACAAG ATGGTAAGCC AATCTATAGA AAATCTTGAA
TATGAATTTA TCAGTTCGCT GGCTAAGTTA ATTGATTATA AAAATGTAAA AATAGGCCTT
ATTAAAGGAC ATGGAGAACC CAATACCACT CAGTTACAAG GGCTATCACA AGCGCTAGGA
GAATTGTATG AAGTTCATAA CGTAATGCTT TCTCAAGTGT TTGAATTATC TAACTATGCC
GCTTTATTGA TCACTAAGCC TCAAGAGGGT TTTACAGAGT CTGAAAAGTA CGTATTAGAT
CAGTATATTA TGCAAGGGGG TAAGGTCTTA TTTTTTTTAG ATCGCTTGAA AATTAATATG
GACAATCTAG CTAGTGGAAA TTCAATTGCA CTTCCGTTAG AGCTTAATTT AGATGACCAA
TTATTCAGGT ATGGAGTAAG AATTAACCCA GACCTGATAA AAGACTTACA GGCTGGTGTC
TATCCTATTA TAGTTGGTAA AATGGGCAAT CAGCCTCAAC TCAAGCTTCT TCCTTGGCCT
TTCTTTCCTA TTATTAATAA TTTCTCCGAC CATTTAATAA CTAAAAATAT AAATGCTATA
TATACACAAT TTATTAGTAG TATAGATATT GTGAAGGTGG AAGGTGTCAT TCAAACTCCC
CTTTTATATA GCTCTCCATA TAGTCTTAAA GCGACTACTC CTGTGTATGT AGACCTAGAA
TCACTAAGAA GAGCACCTGA TACAAACCTA TATAAGCAAG GCCCTATTCC ATTAGCTTGC
CTATTAGAAG GTAAATTTAA TTCATTGTAT AAAAATAGGA TGCTTCCTAA AGGCATAGAT
GCTACACAGT TTCTTGCTGT GAGCCAGCCT ACTAAAATAC TTGTTGTAGC AAGTGGCAGT
ATTGTACTCA ATGCTGTTTC CCCAAAAGAT CAGCAAGCGT TACCATGGGG TTATGATCCA
TTTTTGCAAC AAAGCTTTGC TAATCAAGAC TTTGTGTTGG GTGTACTTGC TTATATGTTA
GAAGAATCAG GGGTTATTAA TGCCAAGCGT AAAACAGTTA AACTGCGTCT TTTAGACAAT
TTAAAAGTAA CTAGAGAAAG GTTGTATTGG CAGTTGTTAA ATATAGTTAC ACCTATTTTT
ATGTTGGTGC TTATGGGAAT ACTTTGGCAT ATAGTCCGAA GAAAAAGATA TAGGGTAAAA
ATTTAG
 
Protein sequence
MRFSKHTYKN RWLILLTFVS VIGIANKLAY YFPLRVDLTE DKRYSLHSST KALLSRLESE 
LQVNIYLSGD LPNEFKQLQT GVIALLEEFK AYARCPVTCH VIDLSKEPAD KRKEILKKLL
EKKIEPTNLY RQAHGKRIEN LIYPGAILSY QGNEVGVMML KADKMTPTNK MVSQSIENLE
YEFISSLAKL IDYKNVKIGL IKGHGEPNTT QLQGLSQALG ELYEVHNVML SQVFELSNYA
ALLITKPQEG FTESEKYVLD QYIMQGGKVL FFLDRLKINM DNLASGNSIA LPLELNLDDQ
LFRYGVRINP DLIKDLQAGV YPIIVGKMGN QPQLKLLPWP FFPIINNFSD HLITKNINAI
YTQFISSIDI VKVEGVIQTP LLYSSPYSLK ATTPVYVDLE SLRRAPDTNL YKQGPIPLAC
LLEGKFNSLY KNRMLPKGID ATQFLAVSQP TKILVVASGS IVLNAVSPKD QQALPWGYDP
FLQQSFANQD FVLGVLAYML EESGVINAKR KTVKLRLLDN LKVTRERLYW QLLNIVTPIF
MLVLMGILWH IVRRKRYRVK I