Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0449 |
Symbol | |
ID | 6377245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 527486 |
End bp | 530866 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642681610 |
Product | hypothetical protein |
Protein accession | YP_001957589 |
Protein GI | 189501872 |
COG category | [E] Amino acid transport and metabolism [K] Transcription [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.812136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCCT ATTACATAAC AAATACTATT ACTATGAATT ACTTCAGCAT AGATCTTCTT ATTGTATATG CGTTTTTAGC TACTACCCTG ATTATAGGCA TTCGGGCAGG TAGAGGCATT AAGGATATTC GTGAGTATGC TATTGGAAAT AAAATGTATG GGGCAGCTAT ACTTGTTTTT ACTTTTTTAG CCACTAATAT AGGTGGAGCT AGCACGCTTA ATGCTGCGGC GGATGTTTTC TCAAATGGTA TTATTAGAAC TCTTGCTACC CTAGGAGTTA TCATACAAAT TTTCATATTT GCTATAATTA TTGTGCCCCA TATGAAACAC TTTACTAACC ATTTGACTAT GGGTGATGTA ATGGGTAGCT TATATGGGAG ATATAGCCAG ATCCTTACAG GTATATTAGG AACGCTGTAC TCTTTTTGTA TGATTGGTAT GGAACTGTTT ATGCTTGGTA TTGTCTGCCA ATCATTGTTA GGCATACCAG CCAGTTGGGG AATTATAGTG GGTGGACTTT TATTAACAAT TTATTCGGCT TATGGAGGTA TTAAAGCAGT AACTGCTACT GATGTATTTC AATTTCTTAT TTTATTTATA GTTATTCCAT TGATAGCTAG TATAGCCATT AACAAGGTAG GGGGTATTAA GGAAGTGTTC CTTCAAGTTC CAGCAGATAG GTTTAAGGTA TTTACACATG AAAAATTCTC TTTTTATCTA ACCTTATTCT TGATTTGGAG TGTTTTACCC TTAGGGCTTG TAAGTCCTCC TATTTTTCAA CGACTTTTGA TGGCCAAACA AACAGAACAG CTACGTAAAC AATATTTAAT AGTGGCTGGT TTAGATCCTT TGTTACGCAT AACCATTATG TTGATTGGAC TAGCTGGACT AGTGCTTTAT CCATACATTC AGGCAGCTGA TGTTATGCCC CATATTATAA AAGAACTCTT GCCAATAGGT ATTAGAGGAT TGGCTATAGC TGGCATGTTG GCTGTGGTTA TATCTACGGC TGATTCTTAT TTACATACTG CTGGTTTATT GCTTGTTCAT GATGTTATTA AGCCTATTTT TGGACAAAAG AAGATTTTTT TTAATGAATT GCATTGGACT AAATATTGTA CTTTTCTCAT AGGTACAATA AGCATTATAA TAGGTTTAAA ATCGACTAAT CCTCTTAGCT TAAGTTTTGG AGCCATGCGC ATAGCAGGCC CTGTTTTATT GTTTCCACTA CTAGCCGGAA TTATAGGGCT TAAACCAGAT AAGAAGCCTT TTTATGTGTC CATGGTAATT ACCGTGTTTA CCTTTATTAT AACCACTTTC TACTTGCCTA AAAGTCAAGC GCATTTCGGA GTGCCTATTA GTATTGTTGT TAATGCGATC AGCTTTTTTA TTGTTCATCT TATACAAAAT CGAGGCGTTG CTATGGTTGA TAGAAATTAT AGTGAGTCTT CCAGAATAAT ACCAAGCGCA GGTAGTAAAA CCATCCAAGA TCAGCTCAAG TCCTATATCC CTACATTATC TAATATTATT CGATATTCAC AACAACGTGT TCGACAATAT GGGGCTCCTT ATATCTTATT TGGGATATTT TTTACCATTA ACTTTACTTA TCCATATTTC ATGTGGAGTT CTAGCGGACT GCAAGCCCCT AATCTAATGC TTGCATTACG ATTGGTAGGA GCGTTCGCTT GTGGGCTTTT AATTGTGCAA TCGAAATGGC CTAAATCTTT ACTTCCTTAT ATGCCTACCT ATTGGCACTT AACCATACTT TATTGTTTGC CTTTTATGAG TACTATGATG TTTCTACTTA CCCAAGGTAG CACAGAGTGG CTCATTAATA TTGCCATTGT GATCATCCTG CTTTTTATAC TGGTAGATTG GGTTACTGCT ATGATACTAG GCATCCTTGG TGTAAGCTTG GCAGCTATAT TTTATAAATT ATTTGTCGGA GCAATACATT TCTCGTTAGA TTTTTCTTCT AAATACTTGC TGCTATATCA AGGTATTTTT GGGTTGTTTA TTGGCCTTAT TTTTGCCCGT AGAAAAGAAC AAAGGTTCGA CTTTCTTCTA CAACGCAACC AACAACTCAC CGAAGTCCAG CAAAAAAACC GTGCAGAGCT GGCAGAAACT CTTGCCTACA GAGAACAACT GTTTCAAGAG CTTAATCCAG ATGAAGCAGC CCTTTTTGAT GAGGTAACTA CGGCTTATAT CAAGCAAGCG ATTTATCGTA TGACTGATTA TATGCGCTTA GATGTAACGT CTATAAGTCT AGATGAACTG CAAAAAGCAT TATCAGACGC CTATAAGCTG CAAGGTATTG AACAATCTGA GCTTCTGTTT CATAAAGATA CAAAACAGAT GGCTCTCCAA GCGGACGTTG CCAAGTTAAA GCAACTTTTA CTTAATGCGA TCAACTACAC ACAACAATAT AACACTGATA ATAACCCTAT TACCATATCC ATAGAGGATG CCTTACTAGG ACATGATATA GCTCACATGC AAAACTATAC GCGTAAATTG GAAGGGTTAA AAATAACTAT TACCACAAAA CAAGCTTTAC CTCTAACCCA GCCTATTTAT AAAATAGATC CTGCTAAATC TAGCACCTGG GTACCTCAGC ATGAAGATGA ATTCTTATTG GTAGAAAATG CCCGGATTAT CGATGCGCAT TATGGATATA TGTATGCCAA ATCAAGGCAC ACACAAGTGT ATGTATTCCC TGTTAAGCTA AGGGAAATTC GTGGCAAGGT GATGGAGCTT ATCAAAGAGT CAGCAGCTGC AGACCCAGGA GAATTGAGCC ATCCGCTAGC CATACAACTC GAGCAAGAAC TTTTAAAGAA GCTTAAAGGG ACACAAGTAG ATATAGTCCT TATTCAGAAG GCGCTAGATA TCATCAAGAG ATACCATGGA GGTGTAAAAA GAAAATCAGG AGAACCCTTT TTTACTCATC CTATAGCTGT AGCACTAATT TTATTAGAAT ATTCACAAGA TCAAGATGCT ATTTTAGGAG CTTTGTTGCA TGATACAGTA GAAGATACTA GCCTATCACT CGCGCATATT CGTATGCTTT TTGGAGAAAC AGTGGCATTT TTAGTAGCCA AAGCAACTAA TCTAGAAGAT CGTGAGCGAC GAATAAGCTT AACCGATAAA GAAAATCTAG CTCGAATTCT AAACTATGAA GATCCTAGGG CACCTCTGAT AAAATTATCA GATCGGTTGC ATAACATGCG TACCATCCAG TTTCACTCTT CGGTAGCTAA ACGTAAATAT ATTTCTCAAG AGACATTAGA TTATTTTGTG CCATTAGCAA GAAAATTAGG TTTAGAAAAA ATGTCTGTTG AACTGGAGCA GCTAAGCCGA GCAATAGTAT TTAATAATTA A
|
Protein sequence | MAPYYITNTI TMNYFSIDLL IVYAFLATTL IIGIRAGRGI KDIREYAIGN KMYGAAILVF TFLATNIGGA STLNAAADVF SNGIIRTLAT LGVIIQIFIF AIIIVPHMKH FTNHLTMGDV MGSLYGRYSQ ILTGILGTLY SFCMIGMELF MLGIVCQSLL GIPASWGIIV GGLLLTIYSA YGGIKAVTAT DVFQFLILFI VIPLIASIAI NKVGGIKEVF LQVPADRFKV FTHEKFSFYL TLFLIWSVLP LGLVSPPIFQ RLLMAKQTEQ LRKQYLIVAG LDPLLRITIM LIGLAGLVLY PYIQAADVMP HIIKELLPIG IRGLAIAGML AVVISTADSY LHTAGLLLVH DVIKPIFGQK KIFFNELHWT KYCTFLIGTI SIIIGLKSTN PLSLSFGAMR IAGPVLLFPL LAGIIGLKPD KKPFYVSMVI TVFTFIITTF YLPKSQAHFG VPISIVVNAI SFFIVHLIQN RGVAMVDRNY SESSRIIPSA GSKTIQDQLK SYIPTLSNII RYSQQRVRQY GAPYILFGIF FTINFTYPYF MWSSSGLQAP NLMLALRLVG AFACGLLIVQ SKWPKSLLPY MPTYWHLTIL YCLPFMSTMM FLLTQGSTEW LINIAIVIIL LFILVDWVTA MILGILGVSL AAIFYKLFVG AIHFSLDFSS KYLLLYQGIF GLFIGLIFAR RKEQRFDFLL QRNQQLTEVQ QKNRAELAET LAYREQLFQE LNPDEAALFD EVTTAYIKQA IYRMTDYMRL DVTSISLDEL QKALSDAYKL QGIEQSELLF HKDTKQMALQ ADVAKLKQLL LNAINYTQQY NTDNNPITIS IEDALLGHDI AHMQNYTRKL EGLKITITTK QALPLTQPIY KIDPAKSSTW VPQHEDEFLL VENARIIDAH YGYMYAKSRH TQVYVFPVKL REIRGKVMEL IKESAAADPG ELSHPLAIQL EQELLKKLKG TQVDIVLIQK ALDIIKRYHG GVKRKSGEPF FTHPIAVALI LLEYSQDQDA ILGALLHDTV EDTSLSLAHI RMLFGETVAF LVAKATNLED RERRISLTDK ENLARILNYE DPRAPLIKLS DRLHNMRTIQ FHSSVAKRKY ISQETLDYFV PLARKLGLEK MSVELEQLSR AIVFNN
|
| |