Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1439 |
Symbol | |
ID | 6377499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1852982 |
End bp | 1856005 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642682509 |
Product | hypothetical protein |
Protein accession | YP_001958458 |
Protein GI | 189502741 |
COG category | [G] Carbohydrate transport and metabolism [V] Defense mechanisms |
COG ID | [COG1472] Beta-glucosidase-related glycosidases [COG1680] Beta-lactamase class C and other penicillin binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00938781 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACGCT TCGCTATACC CTTTATAACT GCTATATTGT TTTTCTCCCT TACCTTTATT CCTACCCAAT CAATTCCATC AGATAAAGCA GGTTGGATAG AATATCAGTT TCAACGGCTC ACTTTGGAAG AACGTATTGG ACAGCTTTTT ATGGTAGCTG CTTATTCTAA CCAAGGCGAA AAGCATCATG AATTTATAGA AAACTTAATA CAACGATATA ATATTGGAGG ACTTATTTTT TTTCAAGGTG ATCCTATCAG CCAAGCAAAA CTTACCAATC AATACCAACT AAAAGCAAAA ACACCTTTAC TTTTAGCTAT AGATGCAGAA TGGGGGCTTG GTATGCGTCT TACTAACACA ATAAGCTACC CTAGGCAGAT GACCTTAGGT GCTATACAAG ATCATCAGCT TATTTATGAT ATGGGTGCTG AAATTGCACG CCAGCTTAAA CTACTAGGTA TACATGTTAA CTTTGCTCCT GTAATTGATA TCAATAACAA TCCAGACAAC CCAGGTATAG GTAATAGAGC TTTTGGAGAT GGTAAGGGGA GTGTAATTAG TAAAGGACTA GCCTATATTC AAGGATTACA AGATAACGGT ATATTAGCAG TAGCTAAACA TTTTCCTGGT ATAGGAGATG CTAGTAAAGA TCCGCATCAT GAACTGCCTA CTATTCCATA TGATATTACT CGGTTAGAAT CTATAGAGCT TTATCCATTT AGAAAAGCAA TACGTGCTAA TGTAGGAGGT ATTATGGTTT CTCATATCTA TTTACCAGCT TATGAAAAAA CACCTAACCG AGCTGCATCT CTCTCATCTC ATATTGTGAC CCAGCTCTTA AAAAACAAAT TGGGTTTTAA AGGACTTATT TTTACAGATG CGCTTAATAT GAAGGCTGTT AGTAAGTACT ATCAGCCTGG TGAAGTAGAC TTACTAGCTT TGCAAGCAGG GAATGATATT TTACTTTTCC CGGAAGATGT GCCTAAAGCT ATTGCACTCA TTAAATCTGC TATTGAGCAA GGTAAATTAG CTAAGGAAGT AGTAGAAGAA AAGGTTAAGA AAATTCTAGC AGTCAAATAC CAGATGGACT TACATCAGTG GAAGTCTATA GAAATAGATG GGTTGTACGA GCAACTCAAT ACACCTCAGG CTCAAGTGCT AAAGCAGAAA CTATTTGAAC AAGCCATTAC ATTAGTAGCC AACCAAGACG ATTTGATTCC CATTACCAAA TTAAATAAAC ATAAGATTGC TTCACTTTCT ATTATTAAGC AGCCTATTAC TGCAGAAGCA CAGAAATCAA TCAACCAACA GAATACAGTT GCTACTAATA AGCCATCTAC TATTTTTGGT CAGTTTTTAT CGCAATATGC TCCTGTTGCT CACTATACAC TCAATAGGAC ATCACTAGAC GTTAATATGC TACAGCAACT GGCAGATGAG CTAGAGAATT ATTCGTTGGT TATCGTAGGC TTACACGACT TAGCTGGTAA CAGAGCTAAT AAATTTGGTT TACAGCCAGA ATTATTAAGC TTTTTAACTA AACTACAGCA CGCCAATACA AAAGTTTTAA TAGTGGTTTT TGGAAGTGTT TATAGTTTAG AATTATTCCA AAACATGCAA CATCTTATAG CAGCTTACCA AGATGATCCT ATAGCAGAAC AAGTGGTGCC TCAGATTATT TTTGGAGCCT TGCCAGCTGT GGGTAATTTG CCTGTTAGTA TACCTAATGC TTGGAAGTCG GAATGGGGTA TTAGAACAAA AAGTATAAAA AGGCTTGGTT ATGCTTTGCC AGAAGCTGTG CAAATGGATA GTCGCATATT ACAAGGTATT GATAAAATTG TAGAGGATGC AATTTTAGAA GAGGTGATGC CAGGCTGCCA AGTGTTGATA GCTAGAAATG GGAAAATAGT TTTTGAGAAA GCTTATGGTT ATCATACGTA CGCAAAAAAG AATCCCGTTA CTAATACAAC ACTTTATGAT ATTGCTTCCA TAACCAAGGT AGTAGGGCCT TTGCAGGCAA TCATGTATTT AGTAAGTCAG AATAAGTTAG ATATAACACA AAAAGTTTCT ACCTATTTAC CAGAACTGTC TGCCACCAAT AAAAAAAATA TAACTATCAA GTCAATTTTG GCTCACCAAG CTGGATTACT AGATTATGGT ATAACAAGAA GCATTTTGTT TCAAAAAGAT AGTAAATTAA GCAAGAAACT GTTTAGCAAC TATCCATCGG CAAGCTATCC CAATAGAATA GGTACGGAGC TATATGCCCC TCATTTATTA AAAGAGCTTA TGTGGGATCT ATATATCAAC TCTCCAATAA AAGAAAAAGA TAAAACAAAA AAGCCATCTA AAACACATGG TTACCATTAT AATGACTTAA GCTTCCATAT TATGCATAGG CTAATAGAGA AATTGTTACA ACAGCCTATG GAGATTTTTC TTGCTAATAA ATTCTATCAA TCTTTAGGAG CTGCTCTAGT TGGTTATAAC CCATTAGAAA GAATTAGTTT ACAGCAAATA GCGCCTACAG CAGAGTGTGA CTTTTTTAGA ACTACCCCCA TTCATGGTAT TGTTCATGAC CCACAAGCTG CAATCTGTGG GGGTGTAGCA GGAAACGCTG GGCTCTTTAG CAATGCTCAT GATTTGGCTG TTATTCTGCA AATGAATTTG CAAGGTGGCT ATTATGGAGG AAAAAGATAT TTAAAAAAGA AAGTTATAAA ACAGTTTACT AGCCACGCTT TTAAAAATAA TCGGCGGGGA CTTGGATGGG ATAAACCAGA ATTGCCTACC AAGCTAGACA GCAAGCATAA GTCTAACACA TCTTTATATG CTTCTGCTGA TACTTATGGG CATTTGGGCT TTACAGGCAC GGCTGCTTGG GTAGATCCAA AGTATAATCT TGTATATATT ATACTATCTA ATAGAACCTA TCCTACCCAA GAAAATAACA AGCTGGCTGA GCAAAATATA CGTATTAAGC TACAAGATAT TGTTTACCAA GCTTTGCAGA ATATGGAACA ATAA
|
Protein sequence | MKRFAIPFIT AILFFSLTFI PTQSIPSDKA GWIEYQFQRL TLEERIGQLF MVAAYSNQGE KHHEFIENLI QRYNIGGLIF FQGDPISQAK LTNQYQLKAK TPLLLAIDAE WGLGMRLTNT ISYPRQMTLG AIQDHQLIYD MGAEIARQLK LLGIHVNFAP VIDINNNPDN PGIGNRAFGD GKGSVISKGL AYIQGLQDNG ILAVAKHFPG IGDASKDPHH ELPTIPYDIT RLESIELYPF RKAIRANVGG IMVSHIYLPA YEKTPNRAAS LSSHIVTQLL KNKLGFKGLI FTDALNMKAV SKYYQPGEVD LLALQAGNDI LLFPEDVPKA IALIKSAIEQ GKLAKEVVEE KVKKILAVKY QMDLHQWKSI EIDGLYEQLN TPQAQVLKQK LFEQAITLVA NQDDLIPITK LNKHKIASLS IIKQPITAEA QKSINQQNTV ATNKPSTIFG QFLSQYAPVA HYTLNRTSLD VNMLQQLADE LENYSLVIVG LHDLAGNRAN KFGLQPELLS FLTKLQHANT KVLIVVFGSV YSLELFQNMQ HLIAAYQDDP IAEQVVPQII FGALPAVGNL PVSIPNAWKS EWGIRTKSIK RLGYALPEAV QMDSRILQGI DKIVEDAILE EVMPGCQVLI ARNGKIVFEK AYGYHTYAKK NPVTNTTLYD IASITKVVGP LQAIMYLVSQ NKLDITQKVS TYLPELSATN KKNITIKSIL AHQAGLLDYG ITRSILFQKD SKLSKKLFSN YPSASYPNRI GTELYAPHLL KELMWDLYIN SPIKEKDKTK KPSKTHGYHY NDLSFHIMHR LIEKLLQQPM EIFLANKFYQ SLGAALVGYN PLERISLQQI APTAECDFFR TTPIHGIVHD PQAAICGGVA GNAGLFSNAH DLAVILQMNL QGGYYGGKRY LKKKVIKQFT SHAFKNNRRG LGWDKPELPT KLDSKHKSNT SLYASADTYG HLGFTGTAAW VDPKYNLVYI ILSNRTYPTQ ENNKLAEQNI RIKLQDIVYQ ALQNMEQ
|
| |