Gene Aasi_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0871 
Symbol 
ID6377070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1104977 
End bp1106917 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content36% 
IMG OID642682008 
Producthypothetical protein 
Protein accessionYP_001957969 
Protein GI189502252 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.985956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGGA AATACAATTT AACGCATCAA TTAGTAATCT GCCTTTTGCT TACCAACTTA 
TTTTTGCAAA TGCAAAGTTG CGGCAATTCT CCTTTGCCTA ATCCTATGGA GAAAAATCAA
AATACTACAG TACAAAATAT AAATAGGCAA AGGGGTAAGG AGCAAACCCT GGTAGGAAGA
ACCACTAGTA GTTCATCTTC TGCACCAAGT ACTGAATATC AAGAGGTAGC TACAACTGTC
CCTACCTATG AGCATCCATC TGATCAGGAA ACTTTACAAG GAAATGGTTC GTCTAATAGC
GGTAATAGTA TGGGACTAAG GCGTAGAAGA AAAGATAAAA AAGGTAAAGA GAAAGTATTA
GAAGGAGAAG AAAATTCTAT GACTAATAAA AATAAAAGTC ATGGTGGTAA AGCTACAGAA
ACAAGACTCA GCAAAATCAT TGGTAAGTCA GCAAGTGAAG CAATTCAAAA AAGAAAGGAA
AAAGGAAATA ACATGTATTC CTTGCATGAG GCTATTGAAA GTATGGATAT AGAAAGTATT
CAAGCATTAA TAGAGGCAGG CACTAAAGTT AATTTTAAGG ACATAAATGG GAATACAGCT
TTGCACCTGG CTATCAAGCA TGTTGACATA TTTCTAAATA ATTATTTACA ACCTTTATCA
GAAACATATA CCACTCCTAT CTTTAAAAGT GTTGATCGTA CTAGTCTTGT ACATTGTTGT
TTGGCAGCTA TTAAGAAAAG TTATATAGAA GCAATAGTTA GACGATTGAT AGAACTAGGT
GCTGATATAA ATGCTAGAAA CAAGCAAGGA GAAACACCCT TGCATATAGC TGTACAAGTA
AGCAGTGAAG AAGGAATAAA GCTACTACGT GAAAAAAGCG CAGATATTAA AATCAAGGAT
ATACACGGCA ACTCTCCGCT GCATCATGCT GCTGTAGCTG GACAGTTAGA AATAGTTGAG
CTGTTGATAA AGCAATGGGG TTATGATATA GTAACTAGCA AGAACAACAA TAACGAGACA
GTATTACACT GGGCTGCTAA AGGAGGAAAT CCAGAAGTAG TTGAACTTTT AATAAGGCAA
GGTATTAATG CAGAAACTAA AGATAAGTCC GGTAATTCTC CGCTACATTA TGCTGCTGAA
GCAGGACAGC TAAAAGCAGT TAAATTACTG ATAAAAGAGT GGGGCAGCAT TATAAATGTT
AAAAACAATA ATAATGAGTC TGCATTACAT CATGCTGCTA AAAAAGGTCA CGTGGCAGTA
GCACGATTTT TAATAAAAAA GGGAATTACT ATAGACCGTC AGAATAAGCA TGGTTATAAT
CCATTAAGTT TGGCTGTTGA AAATCACCAT GCAGCAGTAA TCAATTTCTT AAAAGAGAAG
GGGGCAAATA TAGATACTGT AGATGATGAA GGTCGTACCC CCTTACATTG GGCTGCTTTA
CAAGGCCATA CAACATTAAT CAAGCAATTA AAAGAGCAAG GCGCAAATAT AGAAGCTAGA
GATCAAGATG GTTATACACC GCTACATCTT GCTAGTGGAA GAGCTCGGAT GGAAGCAATA
AAAATGTTAC AAAAACAAGA GGCTGATATA TTTGCAAGAG ACCATATTGG GTTTACTGCT
CAGCAATTAA TAGAACAGCG TCCAACACCG TGGGGTATTT CGTTTACTTA TATAATGTTG
GCAGGGTTGT TATACGAATT TTTAATTGCC TGTATGTTTC ATCCTACTGC AGCACTTGTA
TTTTTCTCAG TAATAATTTC TATATTAGCT GCTAACTATA ATCTTTATAG AAGGTATATA
AGATTTTATA TGAGTGCACG TAGCATGAAT ATTTTAGACC GTATTGCTGG AAATAGGCTT
GTACGATGGT GTAATTCACT ACCTGCTATA CTAGGTATAT TAATGCTTCT CTATACGCTA
ATAACACATT ATTTCGGTTA G
 
Protein sequence
MQRKYNLTHQ LVICLLLTNL FLQMQSCGNS PLPNPMEKNQ NTTVQNINRQ RGKEQTLVGR 
TTSSSSSAPS TEYQEVATTV PTYEHPSDQE TLQGNGSSNS GNSMGLRRRR KDKKGKEKVL
EGEENSMTNK NKSHGGKATE TRLSKIIGKS ASEAIQKRKE KGNNMYSLHE AIESMDIESI
QALIEAGTKV NFKDINGNTA LHLAIKHVDI FLNNYLQPLS ETYTTPIFKS VDRTSLVHCC
LAAIKKSYIE AIVRRLIELG ADINARNKQG ETPLHIAVQV SSEEGIKLLR EKSADIKIKD
IHGNSPLHHA AVAGQLEIVE LLIKQWGYDI VTSKNNNNET VLHWAAKGGN PEVVELLIRQ
GINAETKDKS GNSPLHYAAE AGQLKAVKLL IKEWGSIINV KNNNNESALH HAAKKGHVAV
ARFLIKKGIT IDRQNKHGYN PLSLAVENHH AAVINFLKEK GANIDTVDDE GRTPLHWAAL
QGHTTLIKQL KEQGANIEAR DQDGYTPLHL ASGRARMEAI KMLQKQEADI FARDHIGFTA
QQLIEQRPTP WGISFTYIML AGLLYEFLIA CMFHPTAALV FFSVIISILA ANYNLYRRYI
RFYMSARSMN ILDRIAGNRL VRWCNSLPAI LGILMLLYTL ITHYFG