Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1767 |
Symbol | |
ID | 6376961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1255049 |
End bp | 1258162 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003573165 |
Protein GI | 294661289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.636968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTTAT TAGTAATTGG AAACGCCATG AAAATGTTTA AAATTAATCC TTCACTATTT CTAAGTAGAA AAACCATTCT TTTTATACTT AGCTTACAGC TTATTGGTTG TGGCCATAAC TCACCTGATA ATCAGAAAGA AGCACGTTTG CTCATGGAGA TAAATCCCCA CCATTTAATA GGTGATCAGA AAGCCATTGA AGCCTGTTTT TCATTAGCTG AGCATAAGCA ACATGTTATG CTTAATAAGT ATAGGTTAAA AATTAGCTTA AGTGGTAGCG CGAACCAATT GCTGTTTGAA AAAGACACCG AAACTACAAG AAGTTTCCAG AATACGACCC AGAATTTAAC TTATTTTACA TCCCAAAGAG AGCTAAACCT GGAAGATGAA CTATTAATTG TCCCCTTCAC CTTATTGCCA GCTGCAACTA CTGATAATGT AATCATTAAG TTTGAACTTT TAGATGAAGA TGGTAGCATC ATACAAAAGC ATAGGGTTGA TTGGAGTAGC CATACCAAAC AGACGAGCTT ATTACTTCCA AGCAGTGATC TACCAGAAGG GGAAATTAAA ATTTTAACTA GTACTTTAAG TAAACGTCCT TTGTTGCCTA CTTTACCATA TCCAGTTGAA AAGACTAAGG AAATGCAGGA AGAAGAATAC CATGATCTGA TATGTAGTTC TGGTTCAGAA ATAATAGATA GATCCTTACA ATATCCAGCC CAAAAGAAAT ATAAGAAAAG CTTACCAAAC CCTAAAGAAT CGACAGAATC TTATGATACT TTGAGTATAC GTGCATTAGC TGAGCGAGCC GATCATAACG ATACTCAAGC ACAAGAAGAA ATTATTAGGC GTTGCTTACA AGTGAGTATA AATCCGCTTA TAAAAAAAAT ACTCCATCCT TTTAGCTGGC CAGGTATACA AGAAAAAGCA AAACAGAAGC AAGAATATGC ATACTTACTG CTATGCTTTT CAAAACAAGC AACAGATTAT ACGATTTATC AAACATTGAT GGAACATGTA AAAGAACAGG CACAAGCAGG AGATCCTTTA GCTCAAACCA ATTTAGGGTA TATGTATAGT GAAGGATTAG GTTTTCCAGT TGATGCTCGG AAAGCTATTG AATGGTATAC TAAGGCTGCT CATCAAGAGT TTGCCATAGC ACAATGTCTT TTAGGGGATA TATATTATTT TGGAAAAATA GTTTCATGTA ACTATCAAAA TGCATTGAAA TGGTATAAGA AAGCTGCTGG AAAGGGGTAT GCTAAGGCAC AAAATGCTTT AGCATATATG TACGAGGAGG GATTAGGCAT CCAAAATAAG AGTGAAAGAG CTGTCGAGTG GTATACCAAG GCAGCTATGC AAGGAAATAT AACTGCTCAG TATAACCTGG GTAGAATATA TTACAATGGT AAAGGTGTAA GGCGGGCCTA TAACAAAGCA TTTAAGTGGT ACCATAAGGC TGCTAACCAA GGCAATATAA AAGCACAAAC TAAATTAGGA TATATGTATG CTAAGGGCTT GGGGATTGAG CAAAATCTTG GAAACTCAGT AAAATGGTAT AACAAAGCGG CTAATAAAGG GAATATAACC GCTCAATTTA AGTTAGGCCT TCTATACAAA AAAGGAGAAG GGGTTGCTCA GGATTACCAT AAAGCATCTG AGTGGTTTAC TAAAGCGGCT AACCAAGGGC TTGTAAAAGC TCAATATAGC TTAGGATGTC TCTACTATAA TCTAGGTGAA AGCATTGAGC ATAACTATCA ACAAGCTTTT AAATGGCTTA GTAAAGCTGC GAATGAAGGT CATGCGGAAG CTCAATTCAG CCTAGCACGT CTCTTTGAAG ATGGATTAGG GGTCGAACAA GATAAACAGG AGGCTATAGA GTGGTTTACT AAGGCAGCTA ACCAGGGTCT TGTAAAAGCT CAATATAGTC TAGGTCTTCT CTATGAAACA GATGAAGACA TTGGACATGA TTACCACAAG GCATTCGAAT GGTACAGTAA AGCAGCAAAT CAAAATGACG CAGTGGCACA ATCTAGTTTA GCATTTCTCT TTATAGATGG CTTGGGGGTT GAGCGAAATG TGCAGCAAGC CATAGAATGG TTTACTAAGG CTGCTCAACA GGGGGTTGTA GAGGCACAGT ATAATTTAGG GATTATCTAT AAAAGGGGGG AGGATATTGA GCGTAACTAT CAAAAATCAT TCGAGTGGTT CACTAAAGCT GCTAGTCAAG GCAGTGTAGC TGCACAAAAT AAGCTGGGAA GTATTTACAA AAAGGGTTTA GGAAGAGAGA AAGATCTGAG CCAAGCCATC TTCTGGTGGA TGAAGACACG AAATACAGAC AAGCTGATGC ATATCTTTAA TGTTAATGCT ACACTTCTTC CTTTAGTAGC TACTCTGCCA GATAATCAAA CTACTAGCGT TGAAGCAGAA ATGCAGTCTG ACGAAGCAGC TGAACTTTTG AGTACTTGTC AGGCAATGTT AATACAGAAT AAGTATGATT TAGCTGGCAA GCAAGCTAGT TTAAGACTTG GTTATTGTGA AGAACTCGAA AAAATAATTT TACAGCTTAT CGATTGGAAA CAGCAACTAT CTATACAAAC TGGCTTAATG GTTAGCTGTT TATCCTTGCA AAAGTCAGAG GATACAACGG CTATAGAGAA CTATCAACAG CGGACAGGCA TTGTGCCTTT TGTTAAGCAA CATATCTTAG CTGAAAATAA AACTTGTCTT TCTTTTGGCC AGGCGAATGT AGAATTAGCT GACCAAATTA TTACAGAGCT ACAGCATAAG CGTACTTATA GAAGTTTTTA TGGGTTAATT AACCAACTTA AAGAGTCTTA TGAGTTGGCT CGTCAAAAAG TTACAGTTAA GGTAAGTGCT ATACAAGAGC AACTACAGAA GTCTGTTGTA GAAGAATCAG AAAAAGCAGT GCTTCTAGCC ACATTAGCTA GAAAAATGAA ATTAGTAGAT ACATTTAATA CTACTTTACA AACATTAGAA GAAATACCTA TACAGTTTAA TACTTACTAT AACTTGCTAT TAGAAGAAAT TGATAAAGGA CAGGCTATCC GTAATCAAAA GTTTAAAGAA GAATATATTT ATCTTTTTCA ATAG
|
Protein sequence | MHLLVIGNAM KMFKINPSLF LSRKTILFIL SLQLIGCGHN SPDNQKEARL LMEINPHHLI GDQKAIEACF SLAEHKQHVM LNKYRLKISL SGSANQLLFE KDTETTRSFQ NTTQNLTYFT SQRELNLEDE LLIVPFTLLP AATTDNVIIK FELLDEDGSI IQKHRVDWSS HTKQTSLLLP SSDLPEGEIK ILTSTLSKRP LLPTLPYPVE KTKEMQEEEY HDLICSSGSE IIDRSLQYPA QKKYKKSLPN PKESTESYDT LSIRALAERA DHNDTQAQEE IIRRCLQVSI NPLIKKILHP FSWPGIQEKA KQKQEYAYLL LCFSKQATDY TIYQTLMEHV KEQAQAGDPL AQTNLGYMYS EGLGFPVDAR KAIEWYTKAA HQEFAIAQCL LGDIYYFGKI VSCNYQNALK WYKKAAGKGY AKAQNALAYM YEEGLGIQNK SERAVEWYTK AAMQGNITAQ YNLGRIYYNG KGVRRAYNKA FKWYHKAANQ GNIKAQTKLG YMYAKGLGIE QNLGNSVKWY NKAANKGNIT AQFKLGLLYK KGEGVAQDYH KASEWFTKAA NQGLVKAQYS LGCLYYNLGE SIEHNYQQAF KWLSKAANEG HAEAQFSLAR LFEDGLGVEQ DKQEAIEWFT KAANQGLVKA QYSLGLLYET DEDIGHDYHK AFEWYSKAAN QNDAVAQSSL AFLFIDGLGV ERNVQQAIEW FTKAAQQGVV EAQYNLGIIY KRGEDIERNY QKSFEWFTKA ASQGSVAAQN KLGSIYKKGL GREKDLSQAI FWWMKTRNTD KLMHIFNVNA TLLPLVATLP DNQTTSVEAE MQSDEAAELL STCQAMLIQN KYDLAGKQAS LRLGYCEELE KIILQLIDWK QQLSIQTGLM VSCLSLQKSE DTTAIENYQQ RTGIVPFVKQ HILAENKTCL SFGQANVELA DQIITELQHK RTYRSFYGLI NQLKESYELA RQKVTVKVSA IQEQLQKSVV EESEKAVLLA TLARKMKLVD TFNTTLQTLE EIPIQFNTYY NLLLEEIDKG QAIRNQKFKE EYIYLFQ
|
| |