Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1501 |
Symbol | |
ID | 6376524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 224938 |
End bp | 228195 |
Gene Length | 3258 bp |
Protein Length | 1085 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003572993 |
Protein GI | 294661118 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGGC GGCGTGTAAG AAGTTTACGA GATTTCTCTA TAGGGAATAA AGATTTTACT ACAGCCACAG TAACTTCCAC CATTGTAGCT ACCTGGATTG GAGGTGGATT TATGTTTTAT GGCCTACAGA ATATTTATAA GGATGGCCTA CAATTTGTTA TACCCCTTTT AGGATCAACT TTATGTTTAC TTTTTACCGG ACAAGTCCTC GCTATTAGAA TGGGAGAATT CTTAAATAAC TTGTCAGTAG CTGAAGCCAT GGGGGATCTC TATGGGCCCA TAGTACGAGT TATTACTGCT ATTAGTGGGA TATTAAAGTC CATAGGTTCA ATAGCTATTC AATTTCAGGT GATTGCTAAA ATGTTGAATC TTCTTTTTGG CATTGAAGAG ACTTGGGCTA TTATTGCAGC TGCGTCCATT GTTATTCTTT ATTCAGCTTT TGGCGGTATC CGTTCAGTTA TTATTACGGA CTTATTTCAG TTTATTGCCT TTGTTATTTT CATTCCTATA TTAGGCCTGA TAGTTTGGAA TCATATTAAA AATCCTGGTC AAGTTCTTTC TACTCTAACT ACCAACCCTA TTTTTAGTTT AGAACAAACA ATAGGATGGA ACCCCAAATT TATAAGTTCA ATAGGGCTAA TGCTTTATTT TTTAATTCCT GGCATGGCCC CTGCTATCTT TCAGCGGGTA ACAATTGCTA GAGATCTTGA ACAAGTAAAA AGATCTTTTA CCTACGCAGC TGGAAGCACT ATCCTAGTAA ATATTGCAGT AGCTTGGGTT GCTATTTTAC TTTTATCAGA TAGCCCTAAT CTTGAACCGA GTAAACTAGT TAATTATATT ATTACAACGT ATGCTTATCC AGGATTAAAA GGGCTTATTG CTGTTGGTAT TACAGCCATG GCTATGTCTA CAGCAGATTC CTATTTAAAT TCCTCTGCAG TATTAGCCGT TAATGATATT ATTAAGCTAT TTAAACCTTC TTGGAAAGAA TCTATTATTG TTATTAGATC TTTCTCATTG ATTTTAGGAG TTTTCGGATT ATTACTAGCC CTCCATGCCA AAGATCTGCT TCAGCTGCTG CTCCTATCTG GCAGCTTTTA TATGCCTATT GTTACTGTAC CACTTCTACT AGCTATTTTT GGTTTTAGAA GTACTACTAG AGCAGTCCTA ATAGGAATGG CAGCAGGCTT TATAACAGTA GTAGGCTGGA AAAAGTTTTT TGGATATACA GGTATGGATA GCCTTATTCC TGGTATAATA GCTAACTTAG TTGTCTATAT AAGTAGCCAT TATATTTTAA GAGAACAAGG AGGATGGGTA GGGATTAAGG AAAAAGGTCC GCTTTTAGCA GCCAGACAAA GCCGTAGAGA AGCATGGAAC AAGTTACTAT ATACCATTAA GCATCCCCAT ATTTATGCTT ACTTACAAAA AAACCTCCCG GCTTATGAAG TTGTTTATTC GCTCTTTGCC GTTTATGTGA TAGGTGCTAC CTATGCTTCT TTTTATACCA TATCAGAAAC AGTAGTTTCT TCTTATCAAG GGCTTTATAA CTTTGCAGCG CATTCTGTTT TAATTGCAAC AGCTGGGTTC TTAACATACC CTGCTTGGCC TCCCACTTTT AAATCTAAAA AGTTTATCAC CTTTGCTTGG CCTCTTGGGA TATTCTATAT TTTATTTGTT GTGGGCACTA TTTTAGTGCT GATGAGCGGT TTCCACGAAG TACAAGTAAT GATCTTTATG CTTAACTTAA TCATGGCTGC TTTCCTACTT TCTTGGCCTT TAATGCTCTT CCTTGCTACA TACGGTATCC TAATAGGGTG TTTAATTGTA TATATGTACT GTGGCAATAT ACATTGTAGT GGAACAGATG GCACAGCTGA ATTCAAAGTT ATTTATAGCA TTCTTTTATT AAGTAGTTTT CTAATTACAA TATTTAGGTT TAAGCAAGAA AAAAAGTCCT TGCAAAACAA GAATATTTAC CTAGAAGGTT TATATGAAGA AAAAAACAAT GAGCTAGCAC AAATTTTAGC TTATGGAGGT GAGGTTTTAA AGGAACTCAA TGCTGACGAG AAAGCATTAA CGGCGGCTTA TATAGAGCAG ATTATCTACC GTATGACGGA TTACATCCGA TTGGAAGTAG CTCAGATAAA ATTAGACCAG CTTTTATTAG AGGTAAAAGA AACCATTAAA CTCATGAACT TATTATCTCT TCCCCAGCTG ATAACGAGTG TAGATACTCG CCAAGAAATT ATTGATGCAG ATAGAGTAAA ATTAAAACAA CTGCTGGTAA ATGGTATCCT ACATGTACAC CAACATAATG CAAACAACCA GCCTATTCAT GTAGTAGTAG AAGACGTTAA GCTAGGCTAT AAGATAGATT ATATTAAAGA TTACACCAGG CAATTAGCAG CCTTAAGATT TACCATTACC ATAGAAAAAG ACACTGCTAA CAAAAAAGAT ATTTACTTAC TTGAGCAGCT GCCTTTGATG AGTCAACATA CAAAAAAAGG TAAACTAATA GAAAATGCTC GTATTATTCA TGCTCATTAT GGATACGCAG AATTGGATAA CGAGCAAACC CAAGTATATG TACTTCCAGC CAACGTAAGA GAAGTAAGAG GTAAAGTGAT GGAGTTATTA AGGGAACCTG TAGCAGTCGA TGAAGCGGAA ATGAAACATC CGCTCGCTAT AAAACTAGAA AATGAGCTAT TAGATAAGAT CAAGGCTATC AAAATAGATA CAAAAACTAT AGCTAAAGCA TTAAAAACTA TTAAAAGATA CCACGCCGGT GTTAAGCGTA AATCAGGCGA GCCTTTTTTT ACCCATCCCA TAGCTGTCGC TCTAATCTTA TTAGAATATT GTAAGGATCA AGATGCAGTG GTAGCAGCGC TACTTCATGA CACAGTTGAA GATACAAGTC TTTCGCTGGT GCAGATTGAA GCTATATTTG GAGAACAGGT AGCATTTATA GTAAAAAAAG TAACTAACCT AGAAGATAAT TTACGCAGGA TAAGCTTAGC AGATCATGAG AATGTTTATA GACTCATGGA GTATGAAGAT GAACGTGCTG CCTACGTAAA ACTAGCAGAT AGGCTACATA ACATGCGCAC CATCAGTGGC CACTCTTCCC TGGCTAAGCA GAAGCATATA GCAAATGAAA CATTAAATTT CTTTGTAGGA CTTGCTGAAA AGTTAGGTTT GGAACCTATT GCAAGTGAGC TTAAAAAACT TAGCTTAGAA GTCTTGGCTA AGAGATAA
|
Protein sequence | MAGRRVRSLR DFSIGNKDFT TATVTSTIVA TWIGGGFMFY GLQNIYKDGL QFVIPLLGST LCLLFTGQVL AIRMGEFLNN LSVAEAMGDL YGPIVRVITA ISGILKSIGS IAIQFQVIAK MLNLLFGIEE TWAIIAAASI VILYSAFGGI RSVIITDLFQ FIAFVIFIPI LGLIVWNHIK NPGQVLSTLT TNPIFSLEQT IGWNPKFISS IGLMLYFLIP GMAPAIFQRV TIARDLEQVK RSFTYAAGST ILVNIAVAWV AILLLSDSPN LEPSKLVNYI ITTYAYPGLK GLIAVGITAM AMSTADSYLN SSAVLAVNDI IKLFKPSWKE SIIVIRSFSL ILGVFGLLLA LHAKDLLQLL LLSGSFYMPI VTVPLLLAIF GFRSTTRAVL IGMAAGFITV VGWKKFFGYT GMDSLIPGII ANLVVYISSH YILREQGGWV GIKEKGPLLA ARQSRREAWN KLLYTIKHPH IYAYLQKNLP AYEVVYSLFA VYVIGATYAS FYTISETVVS SYQGLYNFAA HSVLIATAGF LTYPAWPPTF KSKKFITFAW PLGIFYILFV VGTILVLMSG FHEVQVMIFM LNLIMAAFLL SWPLMLFLAT YGILIGCLIV YMYCGNIHCS GTDGTAEFKV IYSILLLSSF LITIFRFKQE KKSLQNKNIY LEGLYEEKNN ELAQILAYGG EVLKELNADE KALTAAYIEQ IIYRMTDYIR LEVAQIKLDQ LLLEVKETIK LMNLLSLPQL ITSVDTRQEI IDADRVKLKQ LLVNGILHVH QHNANNQPIH VVVEDVKLGY KIDYIKDYTR QLAALRFTIT IEKDTANKKD IYLLEQLPLM SQHTKKGKLI ENARIIHAHY GYAELDNEQT QVYVLPANVR EVRGKVMELL REPVAVDEAE MKHPLAIKLE NELLDKIKAI KIDTKTIAKA LKTIKRYHAG VKRKSGEPFF THPIAVALIL LEYCKDQDAV VAALLHDTVE DTSLSLVQIE AIFGEQVAFI VKKVTNLEDN LRRISLADHE NVYRLMEYED ERAAYVKLAD RLHNMRTISG HSSLAKQKHI ANETLNFFVG LAEKLGLEPI ASELKKLSLE VLAKR
|
| |