Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0871 |
Symbol | |
ID | 6377070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1104977 |
End bp | 1106917 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642682008 |
Product | hypothetical protein |
Protein accession | YP_001957969 |
Protein GI | 189502252 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.985956 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAGGA AATACAATTT AACGCATCAA TTAGTAATCT GCCTTTTGCT TACCAACTTA TTTTTGCAAA TGCAAAGTTG CGGCAATTCT CCTTTGCCTA ATCCTATGGA GAAAAATCAA AATACTACAG TACAAAATAT AAATAGGCAA AGGGGTAAGG AGCAAACCCT GGTAGGAAGA ACCACTAGTA GTTCATCTTC TGCACCAAGT ACTGAATATC AAGAGGTAGC TACAACTGTC CCTACCTATG AGCATCCATC TGATCAGGAA ACTTTACAAG GAAATGGTTC GTCTAATAGC GGTAATAGTA TGGGACTAAG GCGTAGAAGA AAAGATAAAA AAGGTAAAGA GAAAGTATTA GAAGGAGAAG AAAATTCTAT GACTAATAAA AATAAAAGTC ATGGTGGTAA AGCTACAGAA ACAAGACTCA GCAAAATCAT TGGTAAGTCA GCAAGTGAAG CAATTCAAAA AAGAAAGGAA AAAGGAAATA ACATGTATTC CTTGCATGAG GCTATTGAAA GTATGGATAT AGAAAGTATT CAAGCATTAA TAGAGGCAGG CACTAAAGTT AATTTTAAGG ACATAAATGG GAATACAGCT TTGCACCTGG CTATCAAGCA TGTTGACATA TTTCTAAATA ATTATTTACA ACCTTTATCA GAAACATATA CCACTCCTAT CTTTAAAAGT GTTGATCGTA CTAGTCTTGT ACATTGTTGT TTGGCAGCTA TTAAGAAAAG TTATATAGAA GCAATAGTTA GACGATTGAT AGAACTAGGT GCTGATATAA ATGCTAGAAA CAAGCAAGGA GAAACACCCT TGCATATAGC TGTACAAGTA AGCAGTGAAG AAGGAATAAA GCTACTACGT GAAAAAAGCG CAGATATTAA AATCAAGGAT ATACACGGCA ACTCTCCGCT GCATCATGCT GCTGTAGCTG GACAGTTAGA AATAGTTGAG CTGTTGATAA AGCAATGGGG TTATGATATA GTAACTAGCA AGAACAACAA TAACGAGACA GTATTACACT GGGCTGCTAA AGGAGGAAAT CCAGAAGTAG TTGAACTTTT AATAAGGCAA GGTATTAATG CAGAAACTAA AGATAAGTCC GGTAATTCTC CGCTACATTA TGCTGCTGAA GCAGGACAGC TAAAAGCAGT TAAATTACTG ATAAAAGAGT GGGGCAGCAT TATAAATGTT AAAAACAATA ATAATGAGTC TGCATTACAT CATGCTGCTA AAAAAGGTCA CGTGGCAGTA GCACGATTTT TAATAAAAAA GGGAATTACT ATAGACCGTC AGAATAAGCA TGGTTATAAT CCATTAAGTT TGGCTGTTGA AAATCACCAT GCAGCAGTAA TCAATTTCTT AAAAGAGAAG GGGGCAAATA TAGATACTGT AGATGATGAA GGTCGTACCC CCTTACATTG GGCTGCTTTA CAAGGCCATA CAACATTAAT CAAGCAATTA AAAGAGCAAG GCGCAAATAT AGAAGCTAGA GATCAAGATG GTTATACACC GCTACATCTT GCTAGTGGAA GAGCTCGGAT GGAAGCAATA AAAATGTTAC AAAAACAAGA GGCTGATATA TTTGCAAGAG ACCATATTGG GTTTACTGCT CAGCAATTAA TAGAACAGCG TCCAACACCG TGGGGTATTT CGTTTACTTA TATAATGTTG GCAGGGTTGT TATACGAATT TTTAATTGCC TGTATGTTTC ATCCTACTGC AGCACTTGTA TTTTTCTCAG TAATAATTTC TATATTAGCT GCTAACTATA ATCTTTATAG AAGGTATATA AGATTTTATA TGAGTGCACG TAGCATGAAT ATTTTAGACC GTATTGCTGG AAATAGGCTT GTACGATGGT GTAATTCACT ACCTGCTATA CTAGGTATAT TAATGCTTCT CTATACGCTA ATAACACATT ATTTCGGTTA G
|
Protein sequence | MQRKYNLTHQ LVICLLLTNL FLQMQSCGNS PLPNPMEKNQ NTTVQNINRQ RGKEQTLVGR TTSSSSSAPS TEYQEVATTV PTYEHPSDQE TLQGNGSSNS GNSMGLRRRR KDKKGKEKVL EGEENSMTNK NKSHGGKATE TRLSKIIGKS ASEAIQKRKE KGNNMYSLHE AIESMDIESI QALIEAGTKV NFKDINGNTA LHLAIKHVDI FLNNYLQPLS ETYTTPIFKS VDRTSLVHCC LAAIKKSYIE AIVRRLIELG ADINARNKQG ETPLHIAVQV SSEEGIKLLR EKSADIKIKD IHGNSPLHHA AVAGQLEIVE LLIKQWGYDI VTSKNNNNET VLHWAAKGGN PEVVELLIRQ GINAETKDKS GNSPLHYAAE AGQLKAVKLL IKEWGSIINV KNNNNESALH HAAKKGHVAV ARFLIKKGIT IDRQNKHGYN PLSLAVENHH AAVINFLKEK GANIDTVDDE GRTPLHWAAL QGHTTLIKQL KEQGANIEAR DQDGYTPLHL ASGRARMEAI KMLQKQEADI FARDHIGFTA QQLIEQRPTP WGISFTYIML AGLLYEFLIA CMFHPTAALV FFSVIISILA ANYNLYRRYI RFYMSARSMN ILDRIAGNRL VRWCNSLPAI LGILMLLYTL ITHYFG
|
| |