Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0738 |
Symbol | |
ID | 6376765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 946872 |
End bp | 949889 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642681884 |
Product | hypothetical protein |
Protein accession | YP_001957850 |
Protein GI | 189502133 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGAC TGATACTGTC TCACCTAGTA ATAAGTTTCT TTCTATTATT AACATTAGGA TGTGGTGGCA ATAACCCTAA CCCTGCAATT ACAAATGATT CGTCAGAATT AGTGAATGAC GATAGTTTAT CACTGCCAAA TACCCTACCT ATATCGCCCA TTAATAACCC AGTAATAGCT AGTTCACCAA GCTTACCCGT AGAGACTGAT GCCGTGGCTG TACCAAGTAA TCAAAATACG ATCGAAGAAG ATGTTTCAAA TGGTTCTAAT ACAATGACAG TTGATGAACG GAAGGCAAAT TTAGGCTTAC TTACCCCACA ACCTAGGGAT CAACTAATCC ACTTTCTATC TCCTAGAGAG AAGATAAATT TAGGTTTAAC TATCCCACTC TTGGCCGAGA TCATCCCCTA TATGTCTCCT GGTCTGATAA ATAAACTCAA AAGAGTAAAT GACTTTTTAA ATATGCATAT AACCGAGCTA ATAAAAGAAA AAATAATTAA GGATCTATAT GAAAAATATG ATGAAAATAA AGATACTAAC GCAACCTTTT TACAGCTAGC TGTAAGAAAA GGAAATATAG AAGCAGCTAA ATTCTTAATA GGTAAAAATA GTCTAAATAA TAGAGATGAA TATCATAAAA CTCTTCTACA TGAAGCTGTT ACGAACGAAC ATATAAATAT GGTCGTATTT TTAATAGCAA AAGAAGCTGA TATAAACACT AAGGATAAAG ACGGCAATAC TCCTCTCGAT TTAGCCTTTG AGCATAAGAA TATAGAAATA ATGAAATTAC TCTTAAAAAA AGAAGGTAAA TTTCGAGATG ATGCTGATGA CAAGAAAAGA AGCCATTTGT TGAAAATTTT AAATAATGAT AATAGGCCAC TTGTAGTAAT GGGGCTAACC TTACTGCACT TATTTAATCA TAATAAGGAA TACACCTCAA AGACGAATGC CTCACAGGAT GCTATTGATA CAGGAAATAG CAACCATGTA AACACATCTC CATATATAAA CGCAAGTGCT TTGCACCTTG CTATATTAGA AGGTAATTTA GAAACAATTA AGTTACTAAT AAATCAAAAA GCAGACATAA ATTCAAAAAT CGGAGAGAAC TATACACCTT TACATGTAGC TGCTTACATA GGAAGAAAAG ATATAATAAA ATTATTAATA GATAGCAATG CTAATATCCA TGCTAAGTGT AATGATGGTA ATACCCCCTT ACATTATGCT ACTATGCTCA GTCATATAGA AGCAGCTAAC TTATTATTAG AACAGGAAGC CGAGATTGAG ATGCCAAATG ATTTATGGGA AACACCACTA CATATAGCTG CTGAACAAGG CCACTTAGGA ATGGTTAAGT TATTAATAGA AAAAGGAGCT GACTTTAACA CGCAAGACAA AGAGGAAGAA ACACCTTTGT ATAAGGCTGT TAAAGGTGGA AAGATAGAAG TAATTAAATT TTTATTATTT GAAGGAGCAG ATATAAATAC AAAAAATATA CATGGTTATA CACTCGTGCA TATAGCAGCC GAAAAGGGGC ACTCAGATAT ATTGATGTTT TTGTTAAAAA ACGAGAATAT ACATGTACAA GTTAGAGATA ATCGTAATCA AACTCCATTG CATGTAGCTA TTGGTAGTGG CAATTTAGGA GTAGCAGGAC TGTTACTAAA TTATGGTGCT AGCATGTGTG ATAGAGATGA TCAGGGAGCT ATTCCTTTAC ATTTAGCTGC TTTAAATGGC AACATGGAAG CAGTTAAGTT GCTAACAAGC ATAGGCCCCT TACCCCAACA TATAATTGAA AATGAAGAAT CAACCACACT AATTATACAA ACAAGGTTAG GCATAAATAC GAACAATGAG CTTGGATGTA CTCCCTTGCA CCATGCTGCT AGCAATGGCT ATATAGAAAT AGTCCAATTA TTACTAAAAA AAGGAGCAGA TATAAATATT AAGAATAAGG AAGGGTTTAC TCCCTTATAC TTGGCAGTCA TGAATAATAA TGATATACAT TTGATAACAA CTTTAATAAA GACAGGAGCT GATATTAACA TTCAAGATAA CCAAGGTAAT ACCGCTTTGC ATTTTATAGT TCAAAAAGAG CGTTTTGAAT TAATTAGATA TTTTCTAAGT AATGACCCTA ATGTTAATAT TAAAAATACA AAAGGGCAAA CTCTTTTGCA TATAGCTACC CAGCTGGGCA ATATAGAAAT GGTTAAAAAA TTAATAGATA AAGGGGCTGA TATTAGTATT CAAGATAACC AAGGTAATAC TGCTTTGCAT TTTATGTTTC AAAAAGAGCG TTTTGAATTA ATTAGATGTT TTCTAGATAA TGCACCTAAT GTTAATATTA AAAATACAAA AGGGCAAACT CTCTTGCATA TAGCTACCCA GCTGGGCAAT ATAGAAATGG TTAAAAAATT AATAGAAAAG GGAGCCAATG TAAATATTAG CATAAACCAC CATGGGCAAA CCCCTTTACA TCTAGCTCTT GAAAAAGGAT ATACAGGAAT AGCTAGACTT TTAATAGAAA ATGGCGCTAA TCTAAATGCC AGGTATAAAT ATTTTAATAC ACCAGTCCGT TTAATTCTTA AAAAAGGATA CACAGAATTA GCTGGTCTTT TACTAGAATC GGCAGATAAG CAACGTAATA GCCCCCTACA TCTGGCTGCT CAAGGAGGTT ATACAAGAAT GGTGCAACAT TTAATAGATG CAGGCGCAAA GATTAATTTA GATATTGATT TTACGAATCG AGATGGCAGA ACACCATTGC ACTTATCTGC AAAACATGGC CATAGAGCTA TAGTCCAATT ATTACTAGAT GCAAATACTA ACATTGATGA ACAAGATTGT TTTGGGCTTA GTCCTTTACA TCTAGCTGCT CGAGAAGGCC ATCAAGAAAT TGTTGAATTA CTAATAAGAG TAGAGGCAGA TCTTAACCTA CAAAATAATG CTGACCATAC AGCCAGAGAT TTAGCTATTC AAAAAGGGCA TACGGCTATA GCAGGCTTAT TGCCTTAA
|
Protein sequence | MQRLILSHLV ISFFLLLTLG CGGNNPNPAI TNDSSELVND DSLSLPNTLP ISPINNPVIA SSPSLPVETD AVAVPSNQNT IEEDVSNGSN TMTVDERKAN LGLLTPQPRD QLIHFLSPRE KINLGLTIPL LAEIIPYMSP GLINKLKRVN DFLNMHITEL IKEKIIKDLY EKYDENKDTN ATFLQLAVRK GNIEAAKFLI GKNSLNNRDE YHKTLLHEAV TNEHINMVVF LIAKEADINT KDKDGNTPLD LAFEHKNIEI MKLLLKKEGK FRDDADDKKR SHLLKILNND NRPLVVMGLT LLHLFNHNKE YTSKTNASQD AIDTGNSNHV NTSPYINASA LHLAILEGNL ETIKLLINQK ADINSKIGEN YTPLHVAAYI GRKDIIKLLI DSNANIHAKC NDGNTPLHYA TMLSHIEAAN LLLEQEAEIE MPNDLWETPL HIAAEQGHLG MVKLLIEKGA DFNTQDKEEE TPLYKAVKGG KIEVIKFLLF EGADINTKNI HGYTLVHIAA EKGHSDILMF LLKNENIHVQ VRDNRNQTPL HVAIGSGNLG VAGLLLNYGA SMCDRDDQGA IPLHLAALNG NMEAVKLLTS IGPLPQHIIE NEESTTLIIQ TRLGINTNNE LGCTPLHHAA SNGYIEIVQL LLKKGADINI KNKEGFTPLY LAVMNNNDIH LITTLIKTGA DINIQDNQGN TALHFIVQKE RFELIRYFLS NDPNVNIKNT KGQTLLHIAT QLGNIEMVKK LIDKGADISI QDNQGNTALH FMFQKERFEL IRCFLDNAPN VNIKNTKGQT LLHIATQLGN IEMVKKLIEK GANVNISINH HGQTPLHLAL EKGYTGIARL LIENGANLNA RYKYFNTPVR LILKKGYTEL AGLLLESADK QRNSPLHLAA QGGYTRMVQH LIDAGAKINL DIDFTNRDGR TPLHLSAKHG HRAIVQLLLD ANTNIDEQDC FGLSPLHLAA REGHQEIVEL LIRVEADLNL QNNADHTARD LAIQKGHTAI AGLLP
|
| |