Gene Aasi_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1118 
Symbol 
ID6376799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1433971 
End bp1434960 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content35% 
IMG OID642682228 
Producthypothetical protein 
Protein accessionYP_001958188 
Protein GI189502471 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.420783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGT TAGCAGGCTA TTATAAAAAT CAAGGACAAG AATCTCGAGC AACTATAATG 
GATAGTATGT CAAAAGATAA AGAAGTTACT GGATCTGCCG AGTGGCACTT AGGTAAGCTG
TATGAAAATG GTTGGGGGAT AACTAAAGAT TGTAAAAAAG CTATAGCATG GTATCAAAGC
GCGAGTTATC AAAATCATAC GGAAGCACAA TGTAGGCTTG GTAGGATTTA TGAGAATGGT
ATAATAAATG GTATGATAAC AGAGAAAGAT GAACAAGAAG CGAGAGACTG GTATGAAAAA
GCTGCTGAAA GGGGAAGTTC AGTAGCAAGG AATGCATTAT GTTCTATGTA TGAAAAGGCT
GTAAGAGTAA GACAAGAAGA TATGGAAGCA CAATATAACC TAGGAGTAAT GTATTACAAG
TGCTGGGGAG TAGATAAAAA TTATCAAGAA GCTAAAGAAT GGTATGAAAA AGCTGCGGAG
CAAGGATACG CGAAAGCACA ACATACTTTA GCAGCAATGT ATATAAATGG AGAAGGAGTA
GAAAAGGACC ATGTTAAAGC ATTTAAATGG TGTCAAAAAG CTGCGAAGCA AGGATACGCA
AGAGCACAAC ATAATTTAGC AGCAATGTAT ATAAATGGAG AAGGAGTAGA AAAGGACCAT
GCTAAAGCAT TTAAATGGTG TCAAAAAGCT GCGAAGCAAG GATACGCAAA AGCACAAGAT
AATTTAGCAG CAATGTATAT AAATGGAGAA GGAGTAGAAA AGGACCATGC TAAAGCATTT
AAATGGTGTC AAAAAGCTGC GGAGCAGGGT AATGTAAGTG CACAATACAA TAGAGCCGCT
GCGAAACAGA AAATTAATAA AACTATTGGA TTTTTGAGAG ATAAATTTAC TATCTACAAA
AAATCAACCT GCCTTTTTTA TACTTTAAAT GCCTATTTTT ATTTTTTGTG TTCTCTAAAT
TGTGAAAAAT ACTCCTCTGT TTTTTTATAA
 
Protein sequence
MSKLAGYYKN QGQESRATIM DSMSKDKEVT GSAEWHLGKL YENGWGITKD CKKAIAWYQS 
ASYQNHTEAQ CRLGRIYENG IINGMITEKD EQEARDWYEK AAERGSSVAR NALCSMYEKA
VRVRQEDMEA QYNLGVMYYK CWGVDKNYQE AKEWYEKAAE QGYAKAQHTL AAMYINGEGV
EKDHVKAFKW CQKAAKQGYA RAQHNLAAMY INGEGVEKDH AKAFKWCQKA AKQGYAKAQD
NLAAMYINGE GVEKDHAKAF KWCQKAAEQG NVSAQYNRAA AKQKINKTIG FLRDKFTIYK
KSTCLFYTLN AYFYFLCSLN CEKYSSVFL