Gene Aasi_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0854 
Symbol 
ID6377050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1084709 
End bp1086763 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content39% 
IMG OID642681992 
Producthypothetical protein 
Protein accessionYP_001957953 
Protein GI189502236 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00109274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGGC ATTATACTAT AAATCTAAAA CTCATAGCCT ATATTTTACT TATCAGCTTA 
TGCTTACAAA GCTGTGGTGG GTTAAATAAT TCAATTATTC CTATAGAAGA AGAAAAAGAT
CCACAAATAC AAACTGATAC CCAGCAACAA CTGATCCCAC ACACACAGAC GAGTATCCAG
TCTTTAGTTG AACAAACAGT GACTGTTCAA GGAGGCAATG CCGTTACTTT TTACGAGTAT
AAAGGAGAAT TGCAAGCCAG TGTAGAACCC TTATATGAGA AACATAAAGT CTACAATGGG
ATACCTGTAT ACATAGAAAA AGGAATAAAA ATAGAAACCT TATTTTGTCT GGACAAGAAA
ACACAAGAGC GACGTATACA TCTTCAAAAG GAAAAAGGAC GCCCGTCATA CGTTGCTATA
TACGAGCCAT GGTTAATGGG TGGAGGCAAT ATATTAGGCT ATCAGACAGA AGGACTGCCT
CAACAACTTC AAAAAGCTGA ACAGGGAGAT GCAAGAGCAC AATTTAACTT AGGAGTAATG
TACTTCAATG GAGAAGGAGT AGAAAAAGAT GCAAGGAAAG CGGTAGAATG GTTTCAAAAA
GCTGCTGAAC AGGGAGTTGC AGGGGCACAA TTTAACTTAG GACTAATGTA CTCTAAGGGA
AAAGGAGTAG AAAAAGATGC AAGGAAAGCA GTAGAATGGT ATGAGAAAGC AGCGGAGCAA
GGACATGCAG GGGCACAATT TAACTTAGGA CTAATGTACT CCAATGGAGA AGGAGTAGAG
AAAGATGCAA GGAAAGAATT AGGATGGTAT GAGAAAGCAG CCAACCAAGG AAATGTAGAC
GCACAATTTA ATTTAGGAGT AATGTATGCC AAGGGAGAAG GAGTAGAGAA AGATGCAAGG
AAAGCAGTAG AATGGTATCA AAAAGCAGCC AACCAAGGAA ATGCAAGAGC ACAATTTAAT
TTAGGAGTAA TGTATGCCAA GGGAGAAGGA GTAGAGAAAG ATGCAAGGAA AGCAGTAGAA
TGGTATCAAA AAGCAGCCAA CCAAGGAAAT GCAAGAGCAC AATTTAACTT AGGAGTAATG
TACTCCAAGG GAGAAGGAGT AGAGAAAGAT GCAAGGAAAG CAGTAGAATG GTATGAGAAA
GCAGCCAACC AAGGAAATGT AGAGGCACAA TTTAATTTAG GAGTAATGTA TGCCAATGGA
GAAGGAGTAG AGAAAGATGC AAGGAAAGCA GTAGAATGGT ATGAGAAAGC TGCTGAACAG
GGAGATGCAA CTGCGCAATT TAACTTAGGA CTAATGTACT CTAAGGGAAA AGGAGTAGAA
AAAGATGCAA GGAAAGCAGT AGAATGGTAT CAAAAAGCAG CCAACCAAGG AAATGCAAGA
GCACAATTTA ACTTAGGAGT AATGTACTCC AATGGAGAAG GAGTAGAGAA AGATGCAAGG
AAAGCAGTAG AATGGTATGA GAAAGCTGCT GAACAGGGAG ATGCAACTGC ACAATTTAAT
TTAGGAGTAA TGTATTCCAA TGGAGAAGGA GTAGAGAAAG ATGCAAAAAA AGAATTAGAA
TGGTATAAGA AAGCTGCTGA ACAGGGAGAT GCAACTGCAC AATTTAACTT AGGAGTAATG
TACTCTAAAG GATTAGGAGT AGAGAAAGAT GCAAAAAAAG AATTAGAATG GTATAAGAAA
GCTGCTGCAC AGGGAAACGC AAGTGCACAA TTTAATTTAG GAGTAAGATA TGGAGAAGGA
TTAGGAGTAG AAAAAGATGC AAAAAAAGAA TTAGAATGGT ATGAGAAAGC TGCAGAGCAA
GGACACGTGA AAGCACAACA TAATTTAGCA TGGATGTATG CAAATGGAGA AGGAACAGCC
CAAAACTATA CTAAAGCAAT AGAATGGTAT GGGAAAGCCG CTGAAAAAGA AGATGCAGAT
GCACAATTTA ATCTAGGGCA GATGTATGAG AAGGGAGAGG GAGTAGCTAA AGATTGTGCT
AAAGCGGCAG AATGGTATCA AAAGGCTGCT GAAAAGGGAG ATTTAGATGC ACAAGAGAGG
TTGAAAAATA TGTAG
 
Protein sequence
MKRHYTINLK LIAYILLISL CLQSCGGLNN SIIPIEEEKD PQIQTDTQQQ LIPHTQTSIQ 
SLVEQTVTVQ GGNAVTFYEY KGELQASVEP LYEKHKVYNG IPVYIEKGIK IETLFCLDKK
TQERRIHLQK EKGRPSYVAI YEPWLMGGGN ILGYQTEGLP QQLQKAEQGD ARAQFNLGVM
YFNGEGVEKD ARKAVEWFQK AAEQGVAGAQ FNLGLMYSKG KGVEKDARKA VEWYEKAAEQ
GHAGAQFNLG LMYSNGEGVE KDARKELGWY EKAANQGNVD AQFNLGVMYA KGEGVEKDAR
KAVEWYQKAA NQGNARAQFN LGVMYAKGEG VEKDARKAVE WYQKAANQGN ARAQFNLGVM
YSKGEGVEKD ARKAVEWYEK AANQGNVEAQ FNLGVMYANG EGVEKDARKA VEWYEKAAEQ
GDATAQFNLG LMYSKGKGVE KDARKAVEWY QKAANQGNAR AQFNLGVMYS NGEGVEKDAR
KAVEWYEKAA EQGDATAQFN LGVMYSNGEG VEKDAKKELE WYKKAAEQGD ATAQFNLGVM
YSKGLGVEKD AKKELEWYKK AAAQGNASAQ FNLGVRYGEG LGVEKDAKKE LEWYEKAAEQ
GHVKAQHNLA WMYANGEGTA QNYTKAIEWY GKAAEKEDAD AQFNLGQMYE KGEGVAKDCA
KAAEWYQKAA EKGDLDAQER LKNM