Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1844 |
Symbol | |
ID | 6377414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1503015 |
End bp | 1506032 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003573216 |
Protein GI | 294661340 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0832694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTTATT TTAGCAAGCA TATTATGAAA ATATTTCGTA CTATCAATTT ATGGATTGGC TGGGGACTCT TTTTTCTAGC CATGCTCGTC TATACACTAA CTATAGAGCC TACAGCTAGT TTCTGGGATT GCTCTGAATA TATTGCTGCT GCTTATAAAT TACAGGTAAC GCACCCACCT GGTGCTCCCC TATTTCTTCT AATTGGCAGG ATGTTCTCTT TTTTAGCTGG TAATAACACA GAGAAAGTAG CTTTTTGGAT CAATATGAGT TCGGTAATAA CTAGTTCGGC TACTGTAATG GTAGTATTCT GGATTATTTC TTTACTAGCT AGACGAATTA TAGGTAAGAC AACACAAGAT TTACAACTTT ATGAAGCAGC ATCCATATGG GGTGCTGGCA TAATAGGTGT GCTGTCGCTA ACTTTCTGCA GTACCTTTTG GTCCAATGCT ACAGAGGCTG AGACATATGC CTGCTCTACA CTCTTAATGT CCCTTACAGT ATGGGCTATG TTAAATTGGG AGTATACAAC ACCCAGACCA CGAAGCTATC AATGGCTTTT ATTAGTCGCT TACTTGATAG GATTAAGTTT AGGAATACGG ATGTTTAGTG TATTGACAAT ACCTGCTTTG TGCCTCATAT TTTATTTTAA GCGAGTATCA AAAATCACTT TGCTGGGTAC TACTATAACA CTTTTAATTG GTGGAATACT TTTAGCCTTT ATATATACTG GCATTACCTT GAGCTTACCT ACCTGTGCCA TGCAGTTAGA ATTACTTTGC GTTAATCAGT TAGGTTTGCC TTTTAAAAGT GGCATCATTA TACTAAGTAT TACACTAATA GCTAGTTTAA CTTATGGTAT CATTTATACT ATACAAAAGC AGCATACCAC AATACATATA GGATTGCTAT GTTTAGGATT TATTTTAATA GGATACTCTT CTTATGGACT AGTGCCCATT CGTGCTCATG CCAACCCTCC TATCAATGAA GGGCATCCAA GCGATATTAT TAGTTTTATT AACTATCTCA AGAGAGAACA ATATGGCCAT AGACCTTTGG TATACGGACC ACATTTTGCT GCCCAGGTCA TAAGTGCTAA AAAAGGTGAC CCTATTTATA GAAATACTGG GAAAAAATAT GAGATTATTG ACTATAAGCA TATCCCTATT TACGATGCTG GAGCCTATAC GCTTTTGCCT AGGACTTGGA GTCAGCAAAA TTCTATGCAT ATAACAGCTT ATAGGAAGAT TCTTAATCTT AAACCTTGGC AAAAACCTAG TTTGGGAGAT CAGCTATATT TTCTCATAAG GCACCAGCTA GGACATTTCT ATTTACGTTA TTTCTTATGG AATTTTGCAG GACGTGCAAG CGACATGCAG GGTGCTTCAT GGCTTACACC ACTAGATGCT TTTGAGAAAT TGCCGCCTAG CTTAACACAA ATACCTGGAA GAAGTAATTA CTTATTCCTT CCATTCCTAT TAGGCCTAAT AGGAATGCTT TTCCAGTATA GGCATGATAG ACGTTATTTC TGGGTAATAA CTATTTTATT TGTGATGCTA GGAGCAGCAT TAGTAACTTT TTTAAATCCT CCTCCTATTG AACCACGTGA AAGAGACTAT ATTTATGTAG GTTCATTCCT GTTTTTTACA ATCTGGATAG GCCTAGGTAC ATTAGCTGTT GTAAACTATT TCAGGAAACT ATTTACACAA TATAAAATAG CTGTTACAAT AGGTATTATT AGTTGCCTAG CAGTACCTAG CATTATGGCT ACGCAAGCTT GGCAAACACA TAATCGTTCT CAGCGTTACT TTTCAGTAGA AAGCGCCAAA AATTTACTAG CCTCCTGTGC CCCTAATGCT ATACTCTTTA CAGCAGGTGA TAATGATACT TTCCCGCTAT GGTATGTACA GGAAGTAGAG GGCTTTAGAA CAGATGTACG AGTAGTTATC CTTAGCTATG CTAATGCAGC CTGGTATATT AAGCAACTCA CACGCCCAGT AAACAATTCA GCACCACTAC CTTTATCTCT TCCATTTGAA ATTTACCAGC AATATGGGCT TAATGATATT TTACCGTATG TACCACAACC CAATATACAA GAATTAGATA TCATACAATA TCTCCAACTT ATCCGTGAAT CACATCCAGC CTTGCAAATA CAGAACATAT TAAGAGAAAC TACCAATACA TTACCTTGCA AGAATATGTG TTTCCATATC GATAAAACAG GAATAGCTGC TAAAGAAATT GTACCAACAC AATATGAATA TTTAATTCCT GAAAAAATGA GCTGGTCTAT AAAAGGTAGA GGATTAGATA AAAGAGACTT GCTTATACTA GATTTACTAG CGACTAACAA CTGGGAAAGG CCTATTTACT TTAATCATAG TTCATTACAT ACCTTAAATA TAGACCTAAG TACCCATGTA ATGGTGGAAG GCTTAACACT CCGTTTAATG CCTATACAGA ACAATATAGG TCACGAGCTA GTCAATACCG AAACAATGTA TAATAATATG GTGAAAAACT TTTATTGGAA AGGAATGGAT AAGCCAGGAG TATATTATGA TGAAAATTAT AGACTAGTAT TTATCCGTAA CCAACGTATG AGTTTTTGTA CGTTAGCTAA AGCATGTTTA CATGAAGGAA AATTGCAACA AGCCAAAGAA GTACTATTAT ATGGCTTATC GGTAATACCT GATGAAGTGG TACCATATGA TATAGCCAAC GTGTATATGA TACATTTGCT CTTTGAAGTA GGAGAAAATG AACATGCCTT AAATATGATA AAAATTATAG GCAACAGAGC TGAAGAAATA CTAACCTACA AAACAAGAAA AAGTAGTTTT ATAGATAGAG AAGTACAGGA ACAGATGGGG ACATTATATG AAATAGCTAG AAGCCTAAGA GCAATAGATT ATCAAGAGTT AGCACAAGAA TATGAAGACC TTTTAAACAA ATACCAAATT TTACTTGATG TACCTGATGA TAATAATAAT GATATAGCTA GACGCTAA
|
Protein sequence | MFYFSKHIMK IFRTINLWIG WGLFFLAMLV YTLTIEPTAS FWDCSEYIAA AYKLQVTHPP GAPLFLLIGR MFSFLAGNNT EKVAFWINMS SVITSSATVM VVFWIISLLA RRIIGKTTQD LQLYEAASIW GAGIIGVLSL TFCSTFWSNA TEAETYACST LLMSLTVWAM LNWEYTTPRP RSYQWLLLVA YLIGLSLGIR MFSVLTIPAL CLIFYFKRVS KITLLGTTIT LLIGGILLAF IYTGITLSLP TCAMQLELLC VNQLGLPFKS GIIILSITLI ASLTYGIIYT IQKQHTTIHI GLLCLGFILI GYSSYGLVPI RAHANPPINE GHPSDIISFI NYLKREQYGH RPLVYGPHFA AQVISAKKGD PIYRNTGKKY EIIDYKHIPI YDAGAYTLLP RTWSQQNSMH ITAYRKILNL KPWQKPSLGD QLYFLIRHQL GHFYLRYFLW NFAGRASDMQ GASWLTPLDA FEKLPPSLTQ IPGRSNYLFL PFLLGLIGML FQYRHDRRYF WVITILFVML GAALVTFLNP PPIEPRERDY IYVGSFLFFT IWIGLGTLAV VNYFRKLFTQ YKIAVTIGII SCLAVPSIMA TQAWQTHNRS QRYFSVESAK NLLASCAPNA ILFTAGDNDT FPLWYVQEVE GFRTDVRVVI LSYANAAWYI KQLTRPVNNS APLPLSLPFE IYQQYGLNDI LPYVPQPNIQ ELDIIQYLQL IRESHPALQI QNILRETTNT LPCKNMCFHI DKTGIAAKEI VPTQYEYLIP EKMSWSIKGR GLDKRDLLIL DLLATNNWER PIYFNHSSLH TLNIDLSTHV MVEGLTLRLM PIQNNIGHEL VNTETMYNNM VKNFYWKGMD KPGVYYDENY RLVFIRNQRM SFCTLAKACL HEGKLQQAKE VLLYGLSVIP DEVVPYDIAN VYMIHLLFEV GENEHALNMI KIIGNRAEEI LTYKTRKSSF IDREVQEQMG TLYEIARSLR AIDYQELAQE YEDLLNKYQI LLDVPDDNNN DIARR
|
| |