Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1484 |
Symbol | |
ID | 6376500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 142254 |
End bp | 144626 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003572981 |
Protein GI | 294661106 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCCTAT TACTACTTTT CAGTAATGTT TCTTGTAAAT GTGGCAATTT TAAGCAAGGT AAACCAGCTA AACAAGGTAG GCCCATAAAT AAGAAAGGGA GTCTTGCCCT AGAAAAAAAT AGTCTTAGTA TTAAAGCTTA TCCAGATAAA TTAATAGGAG ATTCTAAAAA GACAAAACTT GCCATACAAC TAATAGATAT TAGTAAAGAA GTTCAATTAG ATGAAATTAT ACTAAAAAGT ACACTTATAC ATCAAGATGG AAATGGTAGC CAGCTAAACT ATACTGATGC TGCAGGGAGA ATACATAAAA TGAGCAACCT AGCAAGTCAA TTAGCAGTAT TTAATAAAGG TACTATATTG CATGCTAAAG CTCGGTTGTT AGAAGTAGAA GTGGAAATAC TGCCTGGACC AGCTGTTAAA GAAGTAACAT ACCAATTTGA ATTATTAAAT AATGTAGGCA AGTACATAAA TAGTTGTGAA GCAAACTGGA AAGAGCAGGA AGCTATAATT CAAGATGTAA TTTATGATCA GACTACCCAA GAACTGGTAT GTATCTTTAA AAATATAGGT TTAAAAGCAC TAGAAAACAT ACAATTAAAT TATACTAGTC AAACAGATGA ACTTAAATTA GGCGAAGAAA TTTTAGACAA GGGTACAACA AAGACAAAGA ATATAGCTGC TCTGCCTATA GGCACTATGG CATTATCCTT AGGTAAATTA AAATTAGACA CTCAACAACT TGCAAAGATT GAGGTCTCCC TTATATCATC AGAAAATAAG ATTTTGTTTC AGCCTAAGAC TTATGCTTTC GTTAACCCTG GCATAAAATT AGATTTTAAA AATTTATATT ATAATGCTTC AGCAAAGAGT ATTGTTTATC AGGTGTCTAA TTTAGGGACG TTGCCTGGAC ATAAAATTCA GGTTAAGTAT AAGAATATAA GCAGATCTAT AGAGGAGCAA CAAGTGAACT TAAATGGAGA GAAAGAGCAA ACAGTAGATA TAGAATATTT GGATACAGGT ACACATACAG CCTTTTATGA GTTACCGATC AATTTCAAAG GGCAAAAACA TGCTGTGTTT TCATTTAATA TATTATATGC AGGTGTGTCT ATGGTGCATA AAGAGTTGAT ATGTGAAAAT GAATTGGCTG ATAATGCTAT ATACCAAGCA ATAGAGAAAG AGAATTTTGA TGAGGTATTA AAGCTTATTG AAAATGCTCC GTTTGATGCG ATCAACTATC AAGATCCTGT TACTAAAGAT ACTCCTTTAT TATTAGCAGC ACAGTTAGGC CACCAAAGCA TAATAGAAGC ACTACTTAAG GCAGGAGTAA ACGTAAATAC TCAGAATAAG TACGGAACCA CAGCATTGCT ACGTGCAGCT ATAGATGGGA AAAAAGATAT TGTAGAGGCA CTAATTAAAA GTGGGGTTGA CTTGGATACT CAGCATGGAG GTAAGGCATT ACTTCATGCA GTATATAGTG GATACAAAGA TATAGTAAAA GCTTTGCTTG ATGCAGGAGT AAATGTAAAT ACTCAAGGTG GTGACGGAAG AACAGCATTG ATGGAAGCAG TATCAAAGTC TTGGAATTTA GAGGGGGAAG AGATAATAGC GCTTTTACTT AACAATGGGG CTAATATTAA TGTGCAAGAT CAGGAAGGGA ACACAGCTTT GATGTATGCT ACTTTTGGAA AGGACCAAGC AATAGTAGAG GCATTGCTCA ACAGAGGGGC AAGAATAGAC CTGAAAAATA AGTTTGGGGA AACAGCGTTA TGGGGAGCAT TTGATAAAGA AAATATATCA ATGTTAAAGT TATTGTTCAA AAGGTTAGAC AAGCATATTC AAATTAACCA GTTAGGTAAA TTTATACAAC ATGCACTTCC ATGGGCAATT AGAAGGGGAG CTAAAGAAAT AGTAGAAGAA TTACTAAATA GAGGTGCTGA GCTTAACAGA TCAGATGAAT GGAACACTAA TCTTATAGAA GCTATCATAA ACAAACAACC AGAGATTATA AAATTATTGC TTGAAAAAGG TGCTAAAGTG GATGGCCAGA ATAATAAAGG TGAGACAGCT TTAATGTGCG CAGTTCAGAA AGGAGATACA GATACAGTCT CTACGTTATT AGAGCAAGGA GCTGATGTTA ACAAAAGAGA TTTGCAAGGC TTTACTGCAT TAATGTATAT AGTTAAGCGT ATGCATGGAA TTGAAAGCCT AGCGAAGAAA TTAACGAACG AAGATATATT AGAAAAATTA CTAGAAGCAG ATGCAGATGT TAATATAAGT AATGGTGATG GAGAAACAGC TTTGGATTTA GCTAATGATA AACCAGAAAT TAAGAATCTA TTAGAGAGTC ATTTAGCTAG AGCGAAATTT TAA
|
Protein sequence | MSLLLLFSNV SCKCGNFKQG KPAKQGRPIN KKGSLALEKN SLSIKAYPDK LIGDSKKTKL AIQLIDISKE VQLDEIILKS TLIHQDGNGS QLNYTDAAGR IHKMSNLASQ LAVFNKGTIL HAKARLLEVE VEILPGPAVK EVTYQFELLN NVGKYINSCE ANWKEQEAII QDVIYDQTTQ ELVCIFKNIG LKALENIQLN YTSQTDELKL GEEILDKGTT KTKNIAALPI GTMALSLGKL KLDTQQLAKI EVSLISSENK ILFQPKTYAF VNPGIKLDFK NLYYNASAKS IVYQVSNLGT LPGHKIQVKY KNISRSIEEQ QVNLNGEKEQ TVDIEYLDTG THTAFYELPI NFKGQKHAVF SFNILYAGVS MVHKELICEN ELADNAIYQA IEKENFDEVL KLIENAPFDA INYQDPVTKD TPLLLAAQLG HQSIIEALLK AGVNVNTQNK YGTTALLRAA IDGKKDIVEA LIKSGVDLDT QHGGKALLHA VYSGYKDIVK ALLDAGVNVN TQGGDGRTAL MEAVSKSWNL EGEEIIALLL NNGANINVQD QEGNTALMYA TFGKDQAIVE ALLNRGARID LKNKFGETAL WGAFDKENIS MLKLLFKRLD KHIQINQLGK FIQHALPWAI RRGAKEIVEE LLNRGAELNR SDEWNTNLIE AIINKQPEII KLLLEKGAKV DGQNNKGETA LMCAVQKGDT DTVSTLLEQG ADVNKRDLQG FTALMYIVKR MHGIESLAKK LTNEDILEKL LEADADVNIS NGDGETALDL ANDKPEIKNL LESHLARAKF
|
| |