Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1878 |
Symbol | |
ID | 6377375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1602558 |
End bp | 1604978 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003573238 |
Protein GI | 294661362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTCG TTGCTGTTTC TTGTAAAGAC TGCAATAATG GCAATCCTGC ACATACTCCT AATCCTGATA TAGAAGATAC AAAACCTGTT TACCAATTGA TTTTAGATGG TATAGGTGCT TCAGCTTCTT TAGAAGGCAT TGAAACATAC TCCTTTTTTA TAGTAAATAA CGATAACCAA AATACAGTTC CTTCAGATGA AATCTTACTC TCATTAGAAA GTGAGATAGA ATTTACACTA AATAATTATT CAGTAGATAG TCAAGGGCTG ACACTCAAAA AAATTCTAGA AGTAGACAAA ATTAAACCAA GTGCTCCTCA AGAAGTTATT TTAAAATTAA AGAATGCTAA AGATAGGAAC GTTGCTGCAT CCATTAAGTT ACAATTAAAA GACAAGACAG GTAATAAAAT AGGAGAGGAG AAAAGCTTAG TATGGAGGCC TAAAGTAGGA ATTCAGTTAG ACCTACAGAT AACTAATAAA AACTTGAAAG GTAATGAGAA GGAAAATAAA AAAATTGAAT TTAAGGTAGC TTCATTGGGT ACAATCATGC CTGATAAAGA TGCAATAAGT TTAAATTTAA TACCTGAAGA TGGTATTACT GCTACCATTG TAGGTGCTAG CCAAGCAACT GTGAATGGTA AAATAATTTA TACTTACCAA GTTAAAAAAG AAGATATAAA TAAAACAATT ACTTCATTGA GTATTGACCC ACAAGGAAGT AAACAAGCTA GCTTTAAAGT ACAGCTTGTA TATGATGGTA ATCTGGTTGG GCCTGAACAA ACTTTGACAT GGCAAGCTGA CGAACCATTA GCATTTGCCT TTGAAGCATC AGAAGAAAAA TACAAGGATA TTGTAACAGG AATGAAATCA TTAATAGGTA CAGATATAGT GGAGATATCT ATTAAAAACT TAGGGAAAGA TACAGAAGAC GACGAAGTTC TGTTATGTGT TGAACAGGAT AGCCCAGTAG AGGAGATAGC GTTTGAAGTA TATTATAACT ATCAAAATGT TGACCAAATA GGTGCCGGAA CCCCTTCCAT AGCTTTTAAA AGTCAGAAAA AAGTAGATGT GGAATTGTTA AATCTTTCTA ACGATGGTAC TGCTATTAAA AAAGGAGATA CTATTAAAAT TGCATTGCAA CTAATAGACC CTAAAGTTAA GCAACAGGCT AGTATTACTT TTAAAATGAA AAGTAAGAAA GATAATCAAG ACATAGCTAC CCCAGTAACT ATTAATTGGA GGGCAGTTAC AACTACTTCG AAAAGTATAG AAGTCACTCA GCAGATGCTT AATGAAATCT TTTCCAAACC TGGATGCAGA AGGTTGTACG ATGTGCTAAA AGATATTAAA GATGGTAAAG TTGGACAAGA ACTTGATATT AATAAAACAG ATTCTCACCA TAGATTAGGT TATACAGCTC TACTAGAGGC TATTAATATA GGACGAGAAG ATATAGTAAC TTTATTATTA GACAAAGGTG CTGATGTGAA TACACCAAGT ATACAAGGTG AAACGCCAAT TCAAGTAGCT ATTAGAAAGC TTGATGTAGA AATTGTGAAC TTATTATTAG AACAGAAGAA GCTTGATTCA AATATAACTT ATGAGAAAGG TAAGAAATTG CTACTGAATT TAGCTATAGA AAGCAAGAAT GTAACAGGAG ATATAGAAGA AGTTACTAAG ATAGCTGATA TGTTATTGGA TAAATTAGAT ATAGATACTA TCATTCGAAT TAATGGAAGT GGGCAAGAAT CTCCAATTTT ACTGGCTATC CAATGTAGGC GTACCCAGTT GGTAAAAAAA CTTTTAGCAA AAGGCTTTAC TCCTGATTTA AAAAATAAAC AAGGGGAAAC TGCAATCCAT TTGGTTGCTC GGTATAATCA GAGAGAGTTA GCTGAACAAT TGATAGCAAG AAATGTAGAG CTAGACGTGC GGGATAATAT AGGAAATACT CCTCTCCACA TTGCTGCAGC CCTACCTAAA AATAAAGAGA TAGCTAAGCT ATTGATTGAT AAATTTCAAG AGAAAGGAAT TAGTTTAGAT TTAGTTAATC AGCTTGGCCA AACACCTCTG CACAAAATAG CTGGCAATTC TAATGCAGAA AATATAGAAA TTATGGAAAA CTTATTGCAA GCAGGTGCCC AGCCAAATGT ACAGGATAAA AATGGCAGTA CTCCACTCCA TTATGCTATA GGTGCTAAAT ATAGGAATAT TATTGAGGAA TTAATAAGAG CAGGTACTCA GATGGATATA CAGGATAATC AAGGAAATAC ATCTTTACAC TTACTAGTTG CTAACAATTA TGTGGACATA GTAAGAAGTG TAATAGCTAA GAGTCCTAAT CTTAAGAATA TAAAAAACAA GGCTGACAAA TTGCCTAAAG ACTTGGCTAC AACTCCTGAA ATGAAGGCCT TATTTAATTA A
|
Protein sequence | MSLVAVSCKD CNNGNPAHTP NPDIEDTKPV YQLILDGIGA SASLEGIETY SFFIVNNDNQ NTVPSDEILL SLESEIEFTL NNYSVDSQGL TLKKILEVDK IKPSAPQEVI LKLKNAKDRN VAASIKLQLK DKTGNKIGEE KSLVWRPKVG IQLDLQITNK NLKGNEKENK KIEFKVASLG TIMPDKDAIS LNLIPEDGIT ATIVGASQAT VNGKIIYTYQ VKKEDINKTI TSLSIDPQGS KQASFKVQLV YDGNLVGPEQ TLTWQADEPL AFAFEASEEK YKDIVTGMKS LIGTDIVEIS IKNLGKDTED DEVLLCVEQD SPVEEIAFEV YYNYQNVDQI GAGTPSIAFK SQKKVDVELL NLSNDGTAIK KGDTIKIALQ LIDPKVKQQA SITFKMKSKK DNQDIATPVT INWRAVTTTS KSIEVTQQML NEIFSKPGCR RLYDVLKDIK DGKVGQELDI NKTDSHHRLG YTALLEAINI GREDIVTLLL DKGADVNTPS IQGETPIQVA IRKLDVEIVN LLLEQKKLDS NITYEKGKKL LLNLAIESKN VTGDIEEVTK IADMLLDKLD IDTIIRINGS GQESPILLAI QCRRTQLVKK LLAKGFTPDL KNKQGETAIH LVARYNQREL AEQLIARNVE LDVRDNIGNT PLHIAAALPK NKEIAKLLID KFQEKGISLD LVNQLGQTPL HKIAGNSNAE NIEIMENLLQ AGAQPNVQDK NGSTPLHYAI GAKYRNIIEE LIRAGTQMDI QDNQGNTSLH LLVANNYVDI VRSVIAKSPN LKNIKNKADK LPKDLATTPE MKALFN
|
| |