Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1962 |
Symbol | |
ID | 6377487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1873800 |
End bp | 1876895 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003573286 |
Protein GI | 294661410 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACT TACTACGTTA TACTGCTATA CTTATGATAT GGTCTATAAG TAATCTAGCC ATATATGGAC AAGAACTGCT AGAAAAGCCC ATATCTATTT CAGATTTTTA TAAAGGGTTA GAATTATATG AAAAACAACA ATATGAGGCA GCACAGCACT ATATGGATAG GTATATAACC GAACATACAG CCTACATAGG AAATGAATAC GTTATAGAAG CAACTTACTA TGCAGCATTT TGTGCTATTA AACTAGATAG AATAGATGGA GAAGTACGTC TTCAGCAATT TGTAGAAAAA TATCCTTATC ATCCGAAAGC TGCTCTAGCA TATTATGAAT TAGGTAATCT ACGTTGTTAT CAACAAGACT ATGCTAAAGG TATTACCTAT TACTTATCAG TCAACAAAGA ACAGTTGGCT AACACTTTAC ATACTGAATT GCAATACAGG TTAGCGTATG CATACCTAAA TGAAAGAGAC TTTGGCCAAG CTTTGTCTTA TTTTAATGCT ATTAAAAATC ATGATACTCC CTACACACCT GCCAGCAACT ACTATGCTGG ATATCTTGCC TTAAAAAAAG GGGATTATGA GAGTGCATTA ATAGACCTAA GAAAAGCTGG GAACCATGAG GCATATGAAG CTGTAGTCCC TTATATGATA ATGGAAGTAC TTTACCAAGC GAAACGTTTC CAAGCAGCCA TTAATTATAT TAAAGACGTA CAGACTAAAC AACCAACATT AAAAAATTAC GAAGATATAG AATTACTTAC TGCCGAATCT TATTTCTTCT TAAAGGATTA TGCATCAGCT ACCCGACATT ACGAAAATTA TATTCACCTT CAACCTTCAG AGGTAACGCA TGAAGTCTTT TATCGGCTAG CTTACTCTTT ATATAAATCA GGAGAAAACT ACAAAGCGCT CAAATATTTA AAAGAGTTAG CATTGCAAGA TGATTATCTG GCTCAGCTAG CGAGCTACTA TATGGGGTTG ATATATATCA AAACGAGCCA AAAGAATTTA GCATTAGCAG CATTTGACCA AGCAAGGCAG ATGAACTTTA TTAATGAAAT ACAAACAGAA GCATCTTTCC AATATGCACA ATTAAGCTAT GAGTTAGGAA AACTTACCAT ATCCATTGAC GCATTACAAA AATTTAAAAG ATCTTACCCT AACAGCCCGC ATATAACTAC AGTTGATCAG TTACTAAGTC AAGTGTACTT TCATACCAAC CACTATGATT TAGCAATTGC TCATATTGAG AGCTTACAAG AAAAGCCAGA AACTGTTTTA CAAGTGTATC AAAAAGCAAC TTTTTATAAG GGCAATGCTT ACTTCAACCA GGAAGCTTAT GATAAAGCCA TTACTTGGCT ACAAAAATCT TTATATTATC CTTTAGATAC AGATATAACA CTACAAACAC ATTTATGGCT TGCAGAAAGT TATGTAGCAC AGCAAGCTTA CGAGCAAGCG ACTACGCATT ACCAAACAGT ACTTGCAGCA ACCGACAAAA AGAATACAAA TTATTACCAA GATGCACTTT ACGGGCTTGG CTACGTTTTA TTTAATACAG AAAAATACAA GGCAGCATTA CCATTATTTT TACAATATAT CAATATACCT AATATAACTA ACGATAATAA TTGGCGTTTA GATGTATTAG TTAGAACAGC AGATTGCTAT TATGCTATTA AAGATTACCA TAAGGCGCTA GATTTATATA CTAAAACTGA AGATAATTAT CCTGCACATA ACCGTTATCA GAAAGCTCTT ATTTATGGGT TGCTTGGCAA GTTCGTAGAA GCCAAACAAA ACTTAGAAAG TATTATTAAT ACCTGTCCAC ATACTGCCTA TTATGAAAAA GCATTATTTG AATATGCATA CCTAGCGTTG CAACATCAGG AGTATGATCT AGCAATCAAA AGTTTTACCA ACTTTATTCA AAAGAAACCT TATAGTACTC TTGTGCCAGA TGCTTTGCTT CATAGAGCAG TTGCTAAGGT AAACTTAAAA CAATATGCAG AAGCCGGAAA AGATTATGAA ACATTACTTA AAGACTATCC AACTCATCCA AATGCACAGA GTGCATTATT AGAGCTTCCC AATTTAGTTG TACAAGAAGG AAAACCAGAG AAGCTACAAC AATATTTAGC TAGCTACAAG GCTGCTAATC CAAGCAGTGA GACACTAGCA GCCATAAGCT TTGAAGCTGC TAAAAATTTA TTTTATAGTC AGAACTACAC ACCAGCAGTT CAACAACTTA AAGAGTTTAT AACCAGTTAT CCTAATAGTA CATTGATAGA TGAAGCTAAC TTTTTAATAG CGGAAGCTTA TTATAGATTA GCAGAAGATG AGCAAGCGCT TATACAATAT CATATTACTA GTAAAAATAA ACAGACACCC TTTTATAATA GAATCTTATT ACGTATTGCA TCGCTTGCTT ATAAGCACAA GGATTTTAAT ACAGCACTTA CACATTACAA GCAGCTTAAA GAAAGTGCTA GCAATAAAAA AGAAACTTAT TATGCCTTAG AGGGAATCAT GAAGACTAGT GATGCGCTAC AACAATATGA AGAAGTCAAC AAAGCAGCTT CACAAATTAT TAATCAAGGT AACATAACAA TTAATGCTGT AAGCCAAGCA GCTTTATATC TAGGAAAAAC TGCCCTAAAG CAAGCTAAGT ACCAAGAAGC TCATGAACAT TTTAAACAAA TTGTAAAAAA TGGACAAGAT ATGTATGCAG CAGAAGCTCA ATATCTAATA GCATACACTT ATTACCAACT AAGAGAATTT AAACAATCAT TAGAAGCATT GTTTATCCTT AACAAACAGT TTGCTGAATA TACTGAATGG ACTAACCAAG GGTTCTTGTT GATGGCAGAT AATTATATAG CCTTACAAGA ATTTTTCCAA GCAAGAGCTA CACTTCAGTC TATTATAGAA AATGCTACTG ATTCAAGCTT TGTTAACACA GCACAACAAA AACTACAGCA ACTTATACAG CAAATAGAAG CAGATAGCCT TGAACAAGCA CAGGCAAAAA CTACAACCAC GCAACCATTG CAGGATGAGG ATAATGAATT TAAAACATTA GAATAA
|
Protein sequence | MTNLLRYTAI LMIWSISNLA IYGQELLEKP ISISDFYKGL ELYEKQQYEA AQHYMDRYIT EHTAYIGNEY VIEATYYAAF CAIKLDRIDG EVRLQQFVEK YPYHPKAALA YYELGNLRCY QQDYAKGITY YLSVNKEQLA NTLHTELQYR LAYAYLNERD FGQALSYFNA IKNHDTPYTP ASNYYAGYLA LKKGDYESAL IDLRKAGNHE AYEAVVPYMI MEVLYQAKRF QAAINYIKDV QTKQPTLKNY EDIELLTAES YFFLKDYASA TRHYENYIHL QPSEVTHEVF YRLAYSLYKS GENYKALKYL KELALQDDYL AQLASYYMGL IYIKTSQKNL ALAAFDQARQ MNFINEIQTE ASFQYAQLSY ELGKLTISID ALQKFKRSYP NSPHITTVDQ LLSQVYFHTN HYDLAIAHIE SLQEKPETVL QVYQKATFYK GNAYFNQEAY DKAITWLQKS LYYPLDTDIT LQTHLWLAES YVAQQAYEQA TTHYQTVLAA TDKKNTNYYQ DALYGLGYVL FNTEKYKAAL PLFLQYINIP NITNDNNWRL DVLVRTADCY YAIKDYHKAL DLYTKTEDNY PAHNRYQKAL IYGLLGKFVE AKQNLESIIN TCPHTAYYEK ALFEYAYLAL QHQEYDLAIK SFTNFIQKKP YSTLVPDALL HRAVAKVNLK QYAEAGKDYE TLLKDYPTHP NAQSALLELP NLVVQEGKPE KLQQYLASYK AANPSSETLA AISFEAAKNL FYSQNYTPAV QQLKEFITSY PNSTLIDEAN FLIAEAYYRL AEDEQALIQY HITSKNKQTP FYNRILLRIA SLAYKHKDFN TALTHYKQLK ESASNKKETY YALEGIMKTS DALQQYEEVN KAASQIINQG NITINAVSQA ALYLGKTALK QAKYQEAHEH FKQIVKNGQD MYAAEAQYLI AYTYYQLREF KQSLEALFIL NKQFAEYTEW TNQGFLLMAD NYIALQEFFQ ARATLQSIIE NATDSSFVNT AQQKLQQLIQ QIEADSLEQA QAKTTTTQPL QDEDNEFKTL E
|
| |