Gene Aasi_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0785 
Symbol 
ID6376759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp999833 
End bp1001170 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content36% 
IMG OID642681929 
Producthypothetical protein 
Protein accessionYP_001957892 
Protein GI189502175 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.954902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAG CAAGTAAAAT ATTATGGATT GCTATAGGAG TATTTTTTAC CCAACATACG 
CTAATGGCAC AAACTCCACA AGGTAGATTA TTGGATAAAG TAATTGCTAG TGTAGACGAC
CAACCCATAT TACAATCTGA GTTAGATGCC GAGTATCAAC TCTATCAAGC ACAAGATAAC
GCAAGCAAGA AACCAACCAA GTGCCAAGTA CTTGAAAATA TGGTGATTAA CAAAATATTG
CTAGCCAATG CAGCTAAAAA GGAAATAAAT GTAAAAAATG GAGAAGTAGA TAGATACTTG
AAGTATAGAA TGCAAGCCAT ATTAGAAGAA GTAGGTACAG AAGCTAGGTT AGAACAATAT
ATACGTAAGC CTATACATGT CTTTAAAGAA GAACTTAGAA AGTCAATTAG AGAGCAGCTT
ACAATAGAAA AAATGCGAGA TTCTATTATT GGCAATATTA CTATATCTCC AATAGAAGTA
CAATCTTATT TTGATCAGCT TCCAGCTAGT GACGTGCCAT TTTTTCCAGC TACTGTAGAA
GCATATCAGC TTGTACTTTT TCCTAGCATA GAAGAGCAAG AAAAAAAGTT AGTTATAGAG
AATTTAGCAT CCTTAAAGAC ACGTATACAA GCAGGCGAGA GTTTTGCAGT GCTTGCTAGA
CAATATTCAG AAGATATAGG CAGTGCTAGC AACGGTGGTG AACTAGGATT CTGGAGGATT
GGTGAGCTTG ACTCATCTTA TGAGAAAGCT GCATTAGCTT TAAACCCAGG AGAAATATCC
GAGCCAGTAG AAACGAGATT CGGATTCCAT ATCATCCAAC TGATTGAAAA GCAAAAAGAT
AAATATAATA CTAGGCATAT TTTATTGAAA CCCCTGGCTG CTAAAGTGAG CATAGAAGAA
GCTATAGAGC GGATAAATAA TATAAGAACT TCTATTTTGG AGAAACAAGT TACCTTTGAA
AAAGCAGCTA TGTCCTATTC ACAAGATATT GCTACAGCTC ACCAAGGAGG CCTTTTAACG
GGTAATAGTG AAGGTGTGCA AATGCCTGTA GATAAGTTGC CATCAGATGT GTTTTTTATT
TTGGATAAAA TGGCACCAGG GGATGTTTCA CGGCCGATTG TCTTTACGAT AGATGGCAAG
CAGGCTGCTT GTATCATATA TTTGAAAGAA AGGATTGCTT CTCACCAAGC TAACTTTGGA
CAAGATTATG AGAAGATTCA TAAATTAGCT TTAGCGTATA AAAAGCAACG TATATTAAAT
GAATGGATTG AGGCGGCTAA AGCCAAAGCT ATTATACAAT TAGAGCCTTC TTATGAGACA
TGCCATATAT TAAAATAA
 
Protein sequence
MNTASKILWI AIGVFFTQHT LMAQTPQGRL LDKVIASVDD QPILQSELDA EYQLYQAQDN 
ASKKPTKCQV LENMVINKIL LANAAKKEIN VKNGEVDRYL KYRMQAILEE VGTEARLEQY
IRKPIHVFKE ELRKSIREQL TIEKMRDSII GNITISPIEV QSYFDQLPAS DVPFFPATVE
AYQLVLFPSI EEQEKKLVIE NLASLKTRIQ AGESFAVLAR QYSEDIGSAS NGGELGFWRI
GELDSSYEKA ALALNPGEIS EPVETRFGFH IIQLIEKQKD KYNTRHILLK PLAAKVSIEE
AIERINNIRT SILEKQVTFE KAAMSYSQDI ATAHQGGLLT GNSEGVQMPV DKLPSDVFFI
LDKMAPGDVS RPIVFTIDGK QAACIIYLKE RIASHQANFG QDYEKIHKLA LAYKKQRILN
EWIEAAKAKA IIQLEPSYET CHILK