Gene Aasi_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1251 
Symbol 
ID6377361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1598933 
End bp1601890 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content36% 
IMG OID642682346 
Producthypothetical protein 
Protein accessionYP_001958302 
Protein GI189502585 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAGAA CAAGACATAT ACTTTATTTT ATAGGTAGCT TCTTATTAGT TTCCTGTGAT 
CGGGATATAC CTATCACAAC TTCAGCTCCA AAAGATATTC GTAAAACTAC AATTTTACAA
ACAGAAGAAG CTGGTAGCGA TTTGATTGCC TTAAATGAAA ATATGCTTGT GCAAGCTAAT
GAAAAGGAAG TAGCTAAAGC AAGCACACCT GTCGATCAGC TAAAATATAT AAGCCAACTT
CCGGAAGAGA CAGTGCTGCA AGACAAACAA GCACAAGGAA CTATAATCCA AGCGATTGCT
CAAGGCTGGC AGAACATTAA TCAGTATGTC AAAGAAGCCT ATATACCCTT TATCTTAGAG
CATGTTGAAA AGGAAGATCC AGAAGCATTG ATCGCCTTAT TATGGTTGAT GGAGAAATAC
ACAAAAAATC CTTTACTTAA ATCCTACAAG CTAAAATCAG AGGCTTATCA GAGGCTACTT
ATAAAGCTAG AAGAATCTAC AGATCATACA AGTATAGATG GTATGCATGC CTATTATATA
GGTGAAATAT ACAGTAGTAA GGCAATAAAA TACTTATCTC CAGATGCAAA CAAGGCCATA
GCATATTACC AAAAAGCAGT CAGGATGGGA AATGCTAATG CTGCTCATGC GCTAGGGTAC
ATTCATCATA AAGGAATAGA AGTGGAATTA GCACCTAATG CAGCAAAAGC TATAGAATAT
TATGAGAAAG CAATCGGGAT GGGAAATACT AAGGCTGTTC ATGCACTGGG CTTTCTTTAT
CATAATGGTA TGGAAGGTCA AATAGCACCT AATGCAGCTA AAGCTATAGA ATATTATGAA
AAAGCAATCG GGATGGAAAA TGCTGGAGCT GTTCATGCGC TGGGGTACCT TTATCATAAT
GGTATGGAAG GTCAAATAGT ACCTAATGTA GCAAAAGCTA TAGCATATTA TGAAAAAGCT
ATTGACATGG GATATGCAGA TGCTGCTCAT AACCTTGGCT TTCTTTACCA TAATGGTATA
GGAGATCAAT TAGAGCCTAG TGCAGCTAAA GCAATAGAAT ATTATGAGAA AGCTATTAGT
ATGGGAAATA CTGATGCTGC TCATAACCTT GGCATTGTTT ATGAGAGAGG AATAGAAGGC
CAATTAGTGC CTAATGCATC TAAGGCTATA GAATATTATG AGAGAGCTAT TAATATGGGA
GACGTTACGG CTGTTCATAA TCTTGGCATC CTTTATGCAA AAGGGATGAA TGGTCAGTTA
GCACCTAATG TAGCAAAAGC CATAGAAGCT TATGAGAAAA CTATCAAGCT AGGAAATGCT
GAGGCTGCTA CCGATTTAGG CATTCTTTAT GCAGAAGGGA TAAAAGGCCA ATTAGCGCCT
AATGCAGCAA AAGCCATAAA ATCATATGAG GAAGCTATTA AGCTAGGAGA TTTTAGGGCC
GCTACTAATC TTGGCTCTCT TTATCATCAT GGCATGGAGG GACAATTAAC GCCTAATCAA
GTAAAAGCTA TAGCATATTA CGAGAAAGCA GTTAGCATGG GAGATGCTGA AGGTGCTTAT
ACCCTTGGCG TTCTTTATGA GAAAGGAATG AAAGAACATT TAGCGCCTAA TGCAGTAAAA
GCTATAGAAT ATTATGAAAA AGCTATTAAA CTCGGAAGAA CTGATACTGC TAATAATTTA
GCCGTTCTTT ATCATAGGGG TATGCCAGGT CAATTAGCAT TCAATGCAGT TAAAGCTATA
GAATATTATA AGTTAGGTGT TGAGTTAGGT AATGCTGATG CTGCTACTAA TCTTGGCATC
CTTTATCATA ATGGTATGCC AGGTCAGTTA GTATCCAATT CAACTAAAGC TATAGCATAT
TATGAAAAAG CAGTTAGTAT GGGAGATGCT AAAGCTGCAT ATGGTCTTGG CATTCTTTAT
GATAATGGCA TAGAAGATCA ATTAACGCCT AATACAACAA AAGCGATAGC ATATTATGAA
AAAGCAGTTA GTATGGGATA TGGAGGCGCT GCTAATAGCC TTGGAGCTCT TTATGCGAGA
GGAATAAAAG GTCAATTAGC GCCTAATAGA GCAAAAGCTA TAGCATATTA TGAAAAAGCA
GTTAGTATGG GAAATGCTGA TGCTGCTCGT AACCTTATCA CCCTTTATGC GAAGGGTAAA
AGGGGGCAGT TAACTTCTAA TAAACAATTA ACTTTTAAAA CATATTATGA TACTTATCTA
GCAAATAATA AGTCTGATTA TGTTAAAGAG GGTTTGCTAA AATTTCTAGT AAGCAATCCA
TCAATAAAAG TTGATCCCTC TAATATAGCA GAGTCCCAGC AAGGGTTAGA GACTTTTAGG
GAAAATACAG AAACACTTAC CGGGCTTATT CTCCTCAAGC AAGAGGAAAA TAATGCCACT
TCTCAAGCTA TGCAATTTAA GGATTTCTAT ATTATTCCGG AGCTTTACCC TTGCTATAGT
GCACTTATAG AATATCTAGA TAAAGTTAGA AATATAATAC CTTATCTATC AAAATATGGG
GTTATGGTTG ATTGTATTAA AATTAAAAAG AATGGAAAAA AAAGAAAAGT TATAGCAGAT
GGTGATCATC TTCATACATA TTTGATAGGT GGTCAATCTT ACATATGTTT AGGAGAGAAT
AATGTGAAAG CAGGTAAAAT GTTAATGAGC CTTTTGGAAG AGGAAAAAGA TGTGAATCAA
ACTGTGGGTT CTATAAGAAA GATGATGATG CAGGGTTCTT CTACAGAATT AGGGCTACGT
TTATATAAGC AAGCATTAAA GAAATTACCC TCAGATTATA CAGGGCCAGT GGAAGAATAT
ATTGCTACTA GGACTACAGA GACACTTGGT ATGCTAGATA ATATGCAAGC TTTGTGCATA
CAATTAAAAG ATATAGTTAT AGCAACAACT TCTCTTAGAA ATAAAAGCTC TATGGAGCTT
TATCATTTCT TACAGTAG
 
Protein sequence
MHRTRHILYF IGSFLLVSCD RDIPITTSAP KDIRKTTILQ TEEAGSDLIA LNENMLVQAN 
EKEVAKASTP VDQLKYISQL PEETVLQDKQ AQGTIIQAIA QGWQNINQYV KEAYIPFILE
HVEKEDPEAL IALLWLMEKY TKNPLLKSYK LKSEAYQRLL IKLEESTDHT SIDGMHAYYI
GEIYSSKAIK YLSPDANKAI AYYQKAVRMG NANAAHALGY IHHKGIEVEL APNAAKAIEY
YEKAIGMGNT KAVHALGFLY HNGMEGQIAP NAAKAIEYYE KAIGMENAGA VHALGYLYHN
GMEGQIVPNV AKAIAYYEKA IDMGYADAAH NLGFLYHNGI GDQLEPSAAK AIEYYEKAIS
MGNTDAAHNL GIVYERGIEG QLVPNASKAI EYYERAINMG DVTAVHNLGI LYAKGMNGQL
APNVAKAIEA YEKTIKLGNA EAATDLGILY AEGIKGQLAP NAAKAIKSYE EAIKLGDFRA
ATNLGSLYHH GMEGQLTPNQ VKAIAYYEKA VSMGDAEGAY TLGVLYEKGM KEHLAPNAVK
AIEYYEKAIK LGRTDTANNL AVLYHRGMPG QLAFNAVKAI EYYKLGVELG NADAATNLGI
LYHNGMPGQL VSNSTKAIAY YEKAVSMGDA KAAYGLGILY DNGIEDQLTP NTTKAIAYYE
KAVSMGYGGA ANSLGALYAR GIKGQLAPNR AKAIAYYEKA VSMGNADAAR NLITLYAKGK
RGQLTSNKQL TFKTYYDTYL ANNKSDYVKE GLLKFLVSNP SIKVDPSNIA ESQQGLETFR
ENTETLTGLI LLKQEENNAT SQAMQFKDFY IIPELYPCYS ALIEYLDKVR NIIPYLSKYG
VMVDCIKIKK NGKKRKVIAD GDHLHTYLIG GQSYICLGEN NVKAGKMLMS LLEEEKDVNQ
TVGSIRKMMM QGSSTELGLR LYKQALKKLP SDYTGPVEEY IATRTTETLG MLDNMQALCI
QLKDIVIATT SLRNKSSMEL YHFLQ