Gene Aasi_1253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1253 
Symbol 
ID6377376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1605293 
End bp1608163 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content33% 
IMG OID642682348 
Producthypothetical protein 
Protein accessionYP_001958304 
Protein GI189502587 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTTT TTAAAAATCA ACATCTATAT ACAGCACTTG CTAGCTTGTT GATTATTCTT 
ATCACTATCT CTTGTAATCA CTGTGGTAAT ACTAATAATG CTAAAGAAGT AAAAACTTTA
GATTTATCAA TAAGCAAAGA TTTGCTGCAA GGAAGCGATG AGCTTAATTT TATGGTTAAG
ATTAGTAATC AGGATGGCAA AGAAGCACAT TTAAATAAAT TTAAACTAAA AATTAGTGTT
GAAGAGCCTC ATAATTTTCT TAACTACATA GATGCGAAAG GCACTACACA AATTAAGTCT
GTTATTGATG AAAATTTAAC ACATTTTACG GTCCAGCAGT TGTTGCAATC TGCAGATAAG
CCTATTAACT TAGAGTTTAT AATACAACCT CAATTAGCAT CTAAACAAGT TAACATAAAA
GCAGCCCTCT ATTATGAAGG TACAGAGCAA CCCATTGTAG AGAAATCTAT TACCTGGCAA
GAAACCGTAT CACCTTATCA GCTAAGCTTC AATAGCATAA GTGCTTCTGA TTTATTAGAA
GGTGGTGAAG AATATATCTT TAAAATAGAG AATCAAGATC CTGAATATGC CTTAGCTACA
GATGAAGTTA TTCTCTCGTT ACAAAGCAAA GCAGAATTTA CATTAAATGG CTGTTTAGCA
ACTGAGCAAG GAATCTTGCT TAAAACAGTT TTAGGAATAG AAAAAATTGA GCAGGCACAA
AATGTTAAAT ATAACCTTGC CAATATTAGG TTAAAAATAA AAGATCCAAA CGGGCAAACT
GATGCTACCT CTGTTGTACT AACATTAAAA GATAAATTTG GCAATAAGTT AGGAAGTGAC
AAAATAATAA CCTGGAAGCC TAAAAGTGAG ATATCATTTA TACTAAGCTT TGATAAGTTA
GAATTGCAAG GATCTGAACT CAGCAATAGA CAAATTAAAT TTACAGTAAA TCAGTCAGGG
GAATCTATTC TTAATAATGG TGAGTTAATC TTACAATTGA CGCCAGAACA AGGTAGTATG
GCTAGCATCT TAGGTGCTAA CCTAATAACA GATGTGTCTG GTAAAACAGT TTATATTTAT
AAGATTAAAA AAGAGGATAT AGGCAAACAA AGTGCAGCTT TAAGTATAGA CCCACAAGAA
AGTAAGGAAG CTAGCTTCAA GGCACAGCTT TTGTATAATG GAGCTCTCCT AGGAGTTACA
CAGAAATTAG TTTGGCAAGC TGGAGCAGAG TTGAGTTTTA GCTTAGAAGG ATTGGAAGAA
AAAGATAGAT GTATTTTATC AGGTACCCAA ACATTGAAGG GTACTGATAT ACTCCAAATT
GTAATTAAGA ATTTGAGTAG AGCATTGAAA AAAGAGGGGG AAGCTGTATT GTGTGTTGAG
CAAAACGAAC ACCCCAGTAA TGTAGCTTTT GAAGTATATT ACAATTATGT TGACAAGATT
GAACATGATA CCCCTAATAT ATCGCTAGGT AAGCATAAAG CTGTAACTAT AGACCTTTAT
CATCTTATAG CAAATAACAA TTTTGTTAAA AGAGAGGACG ATATCAAGGT TGCCTTACAA
TTGTTAAATC CTACATCCAA ACAACATGCT ACTGTTAGCT TCAAAATAAA GAATGCAAAT
AATAATAGCG ATATAACTAC ACCTATAACT ATCAATTGGC AAGCAGCTCT AGCTCCTGTA
ACACCAGTTA TAGATGAAAT GCTTGCTGTG GTTAAAAAAG CAACTTTATG TACAAGTTTA
TATAAAGTCT TAAAAATTAT TAAAAAAGGC AAGGGGATAC ACCCAAACGA TATCAATAAA
ATAGATGTAA AACATCCATA TGGTTACACG GCTTTACAAG AAGCTATATA TATGGGTCGA
TTGGACATTG TTACTTTATT ATTAGATAAA GGTGCTGATG TAAATATGAG AAATAAGCGT
GGGCAAGCAC CTATTGAATT AGCGCTTGGC AGATTTGATA TAGAGATGGT ACGTCTATTA
TTAAAGCAAC CAGATATACA ATCAAGGATA AGTATTTATA ATGGTGGAGA AAAACTATTA
TTAAACCTAG TCATAGAACG AGCTAATACG GTAGATAAGG AAAAATTTAC AGAACTTACC
GATCTCCTAT TAGATCACTT GAATACACCT GATGTGCTCA ATAAGCAAGA TAGTATTATA
AAACAAACTC CTCTCTTGTT GGCTATGCAT TATAACCAAC CTGAACTAGC AAAGAGGTTA
TTAGAAAAAG GAGTTAATCC AAATATAAAA GATAATCAAG GTAGAAATGC GCTTCATCTA
GCTGTTACTC ATAACCATAA AGAGTTAGCA GAACAACTGA TAGCAAAAAA TATCGAGTTA
GATATAAAAG ATGATAAAGG TGATACCCCT CTGCATATGG CCGTATCTCT ATCTAGCAGC
AAGGAGGTAG CTAACCTATT AATCAACAAA TTTAAAGAAA GTGGAATTAG CTTAGATATA
CTGGGGTACA AAGAGGTTAC GCCTTTGCAT AGAGCTGCCG CAGCACAAGG AGATAATGTG
GAAATTGTTA CAGCATTATT AGAAGCTGGT GCTCAGCTAG ACGTAATAGA TAAAGATCAG
CAAACACCTT TGCATTATGC TGCTCAAAAT AATAACATCA AGGTTATTGA AAAACTGACA
CAATACAACC CTAGTTTGAT AAATTTACAA GATAAAAATG GGAAAACCCC CTTGCATATG
GTAGTTTCTC AAAATTATAA TACCTCTAAT GTCAAAAAAC AAATAGCGCA AACTATTAAC
TTTCTAATAG ACAAAGGTGC TAGGTTAGAC ATCGAGGATA ACCAAGGGTA TACACCTTTA
AATATATTGG TTACTAGAAA TTACGCAGAT ATAGTACAAA AGGTATTATA A
 
Protein sequence
MQVFKNQHLY TALASLLIIL ITISCNHCGN TNNAKEVKTL DLSISKDLLQ GSDELNFMVK 
ISNQDGKEAH LNKFKLKISV EEPHNFLNYI DAKGTTQIKS VIDENLTHFT VQQLLQSADK
PINLEFIIQP QLASKQVNIK AALYYEGTEQ PIVEKSITWQ ETVSPYQLSF NSISASDLLE
GGEEYIFKIE NQDPEYALAT DEVILSLQSK AEFTLNGCLA TEQGILLKTV LGIEKIEQAQ
NVKYNLANIR LKIKDPNGQT DATSVVLTLK DKFGNKLGSD KIITWKPKSE ISFILSFDKL
ELQGSELSNR QIKFTVNQSG ESILNNGELI LQLTPEQGSM ASILGANLIT DVSGKTVYIY
KIKKEDIGKQ SAALSIDPQE SKEASFKAQL LYNGALLGVT QKLVWQAGAE LSFSLEGLEE
KDRCILSGTQ TLKGTDILQI VIKNLSRALK KEGEAVLCVE QNEHPSNVAF EVYYNYVDKI
EHDTPNISLG KHKAVTIDLY HLIANNNFVK REDDIKVALQ LLNPTSKQHA TVSFKIKNAN
NNSDITTPIT INWQAALAPV TPVIDEMLAV VKKATLCTSL YKVLKIIKKG KGIHPNDINK
IDVKHPYGYT ALQEAIYMGR LDIVTLLLDK GADVNMRNKR GQAPIELALG RFDIEMVRLL
LKQPDIQSRI SIYNGGEKLL LNLVIERANT VDKEKFTELT DLLLDHLNTP DVLNKQDSII
KQTPLLLAMH YNQPELAKRL LEKGVNPNIK DNQGRNALHL AVTHNHKELA EQLIAKNIEL
DIKDDKGDTP LHMAVSLSSS KEVANLLINK FKESGISLDI LGYKEVTPLH RAAAAQGDNV
EIVTALLEAG AQLDVIDKDQ QTPLHYAAQN NNIKVIEKLT QYNPSLINLQ DKNGKTPLHM
VVSQNYNTSN VKKQIAQTIN FLIDKGARLD IEDNQGYTPL NILVTRNYAD IVQKVL