Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1058 |
Symbol | |
ID | 6376870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1368585 |
End bp | 1370825 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642682172 |
Product | hypothetical protein |
Protein accession | YP_001958133 |
Protein GI | 189502416 |
COG category | [S] Function unknown |
COG ID | [COG2268] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.506927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGAGC ATACATACTT CCTAAAATTG ATAGCATATA TTTTATTCAT CTGCTCATTT TTTCAGAATT GTAATTCTCC GGTTACTTTA CCTCTGGCTA TAAAGGATAA AGGGCGAAAC CAGCAGCGAG ACATACAGTC TATACTAGAT AAAATATTTA CAACAGAAGA AGGCCACGGC ATTAGTTTTT ATGGATCATC AAGTACATTA CAAGCCAGAG TGCAAGCACT CGATAAAATA TATGACGGCA TACCTGTAAA TGTAGATGAA GGGGTAAAGC TAACCGAAAT AGCTAGCTTA GAAGAGAAAA TACAACAAAA ACGCATTCAA ATAACATTTG AACAAGAAAA GCCTATAAGT GTCACCGTGC GTAAAGCCTG GCTGCTTGAG GAAAGCAGAC CTAATGAGGC GGTTATTTTC TGCGGTAATC CTGGTGTTGG CAAGAGCGCA CTCTGTAATT CTATTTTTCA AGAAGCGAAA TTTCCTTCAG GCACAAGTCG TGGACAGGGG CTAACTACTC ATCATCAAAC CCATATGTAT AAAAATAGAT TATATATTGA TACCCCAGGT TTGGAAGATA TAAATATGAG AGAACAAGCG GCTGCTGAAA TAGAACAAGC ACTAAAAAAA GGAGGTAATT ATAAGATGGT CTTTGTGATA GTGTTAGATG ATGGTAGGCT AAGACAAGCT GATTTAAATG CGATCAATAT AATTTGTGAT GCTATCAAGA CTCCCTTTGA ATATGGACTA GTTATCAATA AGTCTATGCA GGAAACTATT GACATAATTA TTGAAGAAGG AGGATTAGGG CCATACTTGA TAACCTTACA CAAACAGCCC TTTCAAACCA TTATGCTCAC AAGAGAGGAG CGGATGGCTG CAAAGGCTAA CGTATATTTT GAAGTGAATG AAAATAGAAG AAAATTGTTA GACCTTATCA ATAACTTGCC CGCCAGTAAC ATACAACTTG GTAAGATAGA TATCAGGAAC TTTGAGGAGA AGATCCAACA GATGGAAGAA AAGTATATAA AAGCCATGGC TGAGCTGAAG GCTCAACACA AGCAGGAAAG AGAGGAACAG GAAGTGCGTA TTACAAAGGA AAGAGAAGAG CATGAAGCGC AAGTGAGACA TCAGGGAAAA GAGTACGAGG CTAAAGTAAA AGAAATACAA GCACAGATGC AAGAGCAACA GCAGGCAGCA GAAATTAAAA GGAAAAGGGA AAGAGAAGAA GCAGATGCTC AAATTAAAAC ATTAAAAAAA GAGGTGGAAG TAAAACAGCG AGCAGCAGAA GCTCAAAGAC AGAAAGAGAA AGAAGAACAC GATGCTCAGA TTAAGAGGTT GGAAGAACAG CTGGAAGAAA AACAACGTAC TGAAGAAGCT AAAATGCAGA AAGACAAAGA AGAACAAGAA GTTCGGATGA AAAGAATGCA AGAACAGCTG GAAGAAAAAC AACGCGCTGA AGAAGCTAAG AGGCAAAAAG ACAAAGAAGA ACAAGAAGCT AAAATGGCTG AAATGAGAAG AATACAACAA GAGTTAGAAA TGCAAACTCT TGCTTTGAAT GCGCCCCATA AGGAAGCTAT TGAAACAGCA TTGGCAAAAC AGCAGAGGTA TTGTTGCTTA AATATAAGAC CATTTCTCAA GGATTCTGGC AAATTAAATA TTAATGATGT TAAGTTCTTA ATTAACCATC CACTATTTAA AGATCAGGGG AATTATCCTT ATGGTGAGGT TGATTTAAAC GATATAATAG AAGCTGACGC AGTGGTAGAG CTAACTAAAG GACTAAGAGG CACACGTGTG GAAAGACTTA ATTTAATTTC TAATCAAATA GGGCCTGCTG TAGCTGCTGA ACTTGCTAAA GCCTTACAAG GAACAAATGT GGGGGATGTT GATTTAAGTT GGACTCTAAT AGGAGATTTA GGTGCAATAG AATTTGTTAA AGGTTTAAAA GGAACAAGCG TACATACGGT TAATTTAAGC TACAATAAAA TAGACGACCA AGGGGCTATT GAATTTGCTC GAAATTTAAA GGGTACACAA ATATATTCAG TTGATCTAAG AGCTAATAAA ATAGGTGATA CAGGAGCCAT AGAATTGCTT AAAAGCTTAA AGGATACTCA GGTGAAAATT GTTTGGTTAA TTCGTAATAA AATAACAAGA GAAACAAAAC AATTAATAAT AAATCAATAT CCACATATTG AAACCTACTT GGAAAACGTA GGAATGGTTA TTTATGAATA A
|
Protein sequence | MVEHTYFLKL IAYILFICSF FQNCNSPVTL PLAIKDKGRN QQRDIQSILD KIFTTEEGHG ISFYGSSSTL QARVQALDKI YDGIPVNVDE GVKLTEIASL EEKIQQKRIQ ITFEQEKPIS VTVRKAWLLE ESRPNEAVIF CGNPGVGKSA LCNSIFQEAK FPSGTSRGQG LTTHHQTHMY KNRLYIDTPG LEDINMREQA AAEIEQALKK GGNYKMVFVI VLDDGRLRQA DLNAINIICD AIKTPFEYGL VINKSMQETI DIIIEEGGLG PYLITLHKQP FQTIMLTREE RMAAKANVYF EVNENRRKLL DLINNLPASN IQLGKIDIRN FEEKIQQMEE KYIKAMAELK AQHKQEREEQ EVRITKEREE HEAQVRHQGK EYEAKVKEIQ AQMQEQQQAA EIKRKREREE ADAQIKTLKK EVEVKQRAAE AQRQKEKEEH DAQIKRLEEQ LEEKQRTEEA KMQKDKEEQE VRMKRMQEQL EEKQRAEEAK RQKDKEEQEA KMAEMRRIQQ ELEMQTLALN APHKEAIETA LAKQQRYCCL NIRPFLKDSG KLNINDVKFL INHPLFKDQG NYPYGEVDLN DIIEADAVVE LTKGLRGTRV ERLNLISNQI GPAVAAELAK ALQGTNVGDV DLSWTLIGDL GAIEFVKGLK GTSVHTVNLS YNKIDDQGAI EFARNLKGTQ IYSVDLRANK IGDTGAIELL KSLKDTQVKI VWLIRNKITR ETKQLIINQY PHIETYLENV GMVIYE
|
| |