Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1253 |
Symbol | |
ID | 6377376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1605293 |
End bp | 1608163 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642682348 |
Product | hypothetical protein |
Protein accession | YP_001958304 |
Protein GI | 189502587 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGTTT TTAAAAATCA ACATCTATAT ACAGCACTTG CTAGCTTGTT GATTATTCTT ATCACTATCT CTTGTAATCA CTGTGGTAAT ACTAATAATG CTAAAGAAGT AAAAACTTTA GATTTATCAA TAAGCAAAGA TTTGCTGCAA GGAAGCGATG AGCTTAATTT TATGGTTAAG ATTAGTAATC AGGATGGCAA AGAAGCACAT TTAAATAAAT TTAAACTAAA AATTAGTGTT GAAGAGCCTC ATAATTTTCT TAACTACATA GATGCGAAAG GCACTACACA AATTAAGTCT GTTATTGATG AAAATTTAAC ACATTTTACG GTCCAGCAGT TGTTGCAATC TGCAGATAAG CCTATTAACT TAGAGTTTAT AATACAACCT CAATTAGCAT CTAAACAAGT TAACATAAAA GCAGCCCTCT ATTATGAAGG TACAGAGCAA CCCATTGTAG AGAAATCTAT TACCTGGCAA GAAACCGTAT CACCTTATCA GCTAAGCTTC AATAGCATAA GTGCTTCTGA TTTATTAGAA GGTGGTGAAG AATATATCTT TAAAATAGAG AATCAAGATC CTGAATATGC CTTAGCTACA GATGAAGTTA TTCTCTCGTT ACAAAGCAAA GCAGAATTTA CATTAAATGG CTGTTTAGCA ACTGAGCAAG GAATCTTGCT TAAAACAGTT TTAGGAATAG AAAAAATTGA GCAGGCACAA AATGTTAAAT ATAACCTTGC CAATATTAGG TTAAAAATAA AAGATCCAAA CGGGCAAACT GATGCTACCT CTGTTGTACT AACATTAAAA GATAAATTTG GCAATAAGTT AGGAAGTGAC AAAATAATAA CCTGGAAGCC TAAAAGTGAG ATATCATTTA TACTAAGCTT TGATAAGTTA GAATTGCAAG GATCTGAACT CAGCAATAGA CAAATTAAAT TTACAGTAAA TCAGTCAGGG GAATCTATTC TTAATAATGG TGAGTTAATC TTACAATTGA CGCCAGAACA AGGTAGTATG GCTAGCATCT TAGGTGCTAA CCTAATAACA GATGTGTCTG GTAAAACAGT TTATATTTAT AAGATTAAAA AAGAGGATAT AGGCAAACAA AGTGCAGCTT TAAGTATAGA CCCACAAGAA AGTAAGGAAG CTAGCTTCAA GGCACAGCTT TTGTATAATG GAGCTCTCCT AGGAGTTACA CAGAAATTAG TTTGGCAAGC TGGAGCAGAG TTGAGTTTTA GCTTAGAAGG ATTGGAAGAA AAAGATAGAT GTATTTTATC AGGTACCCAA ACATTGAAGG GTACTGATAT ACTCCAAATT GTAATTAAGA ATTTGAGTAG AGCATTGAAA AAAGAGGGGG AAGCTGTATT GTGTGTTGAG CAAAACGAAC ACCCCAGTAA TGTAGCTTTT GAAGTATATT ACAATTATGT TGACAAGATT GAACATGATA CCCCTAATAT ATCGCTAGGT AAGCATAAAG CTGTAACTAT AGACCTTTAT CATCTTATAG CAAATAACAA TTTTGTTAAA AGAGAGGACG ATATCAAGGT TGCCTTACAA TTGTTAAATC CTACATCCAA ACAACATGCT ACTGTTAGCT TCAAAATAAA GAATGCAAAT AATAATAGCG ATATAACTAC ACCTATAACT ATCAATTGGC AAGCAGCTCT AGCTCCTGTA ACACCAGTTA TAGATGAAAT GCTTGCTGTG GTTAAAAAAG CAACTTTATG TACAAGTTTA TATAAAGTCT TAAAAATTAT TAAAAAAGGC AAGGGGATAC ACCCAAACGA TATCAATAAA ATAGATGTAA AACATCCATA TGGTTACACG GCTTTACAAG AAGCTATATA TATGGGTCGA TTGGACATTG TTACTTTATT ATTAGATAAA GGTGCTGATG TAAATATGAG AAATAAGCGT GGGCAAGCAC CTATTGAATT AGCGCTTGGC AGATTTGATA TAGAGATGGT ACGTCTATTA TTAAAGCAAC CAGATATACA ATCAAGGATA AGTATTTATA ATGGTGGAGA AAAACTATTA TTAAACCTAG TCATAGAACG AGCTAATACG GTAGATAAGG AAAAATTTAC AGAACTTACC GATCTCCTAT TAGATCACTT GAATACACCT GATGTGCTCA ATAAGCAAGA TAGTATTATA AAACAAACTC CTCTCTTGTT GGCTATGCAT TATAACCAAC CTGAACTAGC AAAGAGGTTA TTAGAAAAAG GAGTTAATCC AAATATAAAA GATAATCAAG GTAGAAATGC GCTTCATCTA GCTGTTACTC ATAACCATAA AGAGTTAGCA GAACAACTGA TAGCAAAAAA TATCGAGTTA GATATAAAAG ATGATAAAGG TGATACCCCT CTGCATATGG CCGTATCTCT ATCTAGCAGC AAGGAGGTAG CTAACCTATT AATCAACAAA TTTAAAGAAA GTGGAATTAG CTTAGATATA CTGGGGTACA AAGAGGTTAC GCCTTTGCAT AGAGCTGCCG CAGCACAAGG AGATAATGTG GAAATTGTTA CAGCATTATT AGAAGCTGGT GCTCAGCTAG ACGTAATAGA TAAAGATCAG CAAACACCTT TGCATTATGC TGCTCAAAAT AATAACATCA AGGTTATTGA AAAACTGACA CAATACAACC CTAGTTTGAT AAATTTACAA GATAAAAATG GGAAAACCCC CTTGCATATG GTAGTTTCTC AAAATTATAA TACCTCTAAT GTCAAAAAAC AAATAGCGCA AACTATTAAC TTTCTAATAG ACAAAGGTGC TAGGTTAGAC ATCGAGGATA ACCAAGGGTA TACACCTTTA AATATATTGG TTACTAGAAA TTACGCAGAT ATAGTACAAA AGGTATTATA A
|
Protein sequence | MQVFKNQHLY TALASLLIIL ITISCNHCGN TNNAKEVKTL DLSISKDLLQ GSDELNFMVK ISNQDGKEAH LNKFKLKISV EEPHNFLNYI DAKGTTQIKS VIDENLTHFT VQQLLQSADK PINLEFIIQP QLASKQVNIK AALYYEGTEQ PIVEKSITWQ ETVSPYQLSF NSISASDLLE GGEEYIFKIE NQDPEYALAT DEVILSLQSK AEFTLNGCLA TEQGILLKTV LGIEKIEQAQ NVKYNLANIR LKIKDPNGQT DATSVVLTLK DKFGNKLGSD KIITWKPKSE ISFILSFDKL ELQGSELSNR QIKFTVNQSG ESILNNGELI LQLTPEQGSM ASILGANLIT DVSGKTVYIY KIKKEDIGKQ SAALSIDPQE SKEASFKAQL LYNGALLGVT QKLVWQAGAE LSFSLEGLEE KDRCILSGTQ TLKGTDILQI VIKNLSRALK KEGEAVLCVE QNEHPSNVAF EVYYNYVDKI EHDTPNISLG KHKAVTIDLY HLIANNNFVK REDDIKVALQ LLNPTSKQHA TVSFKIKNAN NNSDITTPIT INWQAALAPV TPVIDEMLAV VKKATLCTSL YKVLKIIKKG KGIHPNDINK IDVKHPYGYT ALQEAIYMGR LDIVTLLLDK GADVNMRNKR GQAPIELALG RFDIEMVRLL LKQPDIQSRI SIYNGGEKLL LNLVIERANT VDKEKFTELT DLLLDHLNTP DVLNKQDSII KQTPLLLAMH YNQPELAKRL LEKGVNPNIK DNQGRNALHL AVTHNHKELA EQLIAKNIEL DIKDDKGDTP LHMAVSLSSS KEVANLLINK FKESGISLDI LGYKEVTPLH RAAAAQGDNV EIVTALLEAG AQLDVIDKDQ QTPLHYAAQN NNIKVIEKLT QYNPSLINLQ DKNGKTPLHM VVSQNYNTSN VKKQIAQTIN FLIDKGARLD IEDNQGYTPL NILVTRNYAD IVQKVL
|
| |