Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0563 |
Symbol | |
ID | 6376418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 722840 |
End bp | 726205 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642681718 |
Product | hypothetical protein |
Protein accession | YP_001957693 |
Protein GI | 189501976 |
COG category | [E] Amino acid transport and metabolism [K] Transcription [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0552395 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTTA TGCTAAAAGA TTACTTAGAT ATCATACTAT TCACTACATT TTTACTTGTT AACTTAATAA TAGGCCTGAT CGCTGGTCGA CGTGTAAGAA GCTTGCGAGA TTTCTCCATA GGTAATAAAG ATTTTACTAC AGCTACGGTA ACTTCTACCA TTGTAGCTAC CTGGTTTGGA GGTGGATTTA TATTTTATGG ATTACAAAAT GCTTATACAA GTGGACTGCA ATATATTATA CCACTTCTGG GATCAAGTTT ATGTTTGCTA TTTACTGGTC AAGTACTTGC TGTACGTATG GGCGAATTTT TAAATAATCT TTCTGTTGCC GAGGCAATGG GAAATCTATT TAGCCGATTG GTACGTATAA TTACTGCTAT TAGCGGAATA CTTGGTGGAA TTGGTTATAT AGCCATTCAG TTTCAGGTGA TTGCTAAAAT GTTAAACTTC CTCTTAGGAT TTGAGGGGCC TTGGGTTACT GTTGCAGCAG CTTCTATCGT CATTATTTAT TCAGCTTTCG GTGGTATCCG ATCTGTTATT ATTACAGATT TATTGCAGTT TATTGTCTTT AGTATTTTTA TCCCTATTCT TGCCTTGGTG GTTTGGAACC ACCTAAAAAA ACCTGGACAA GTAGCACATA CCTTAGTTAC CAATCCGGTA TTTAACTTCA AGGAAATGCT AGGTTGGAAC CCTAAATTTT TATCTGCACT AGGCTTAGTG CTTTATTTTG CTATTCCGGG CATAACTCCT GCTATATTCC AGCGCGTAAC TATATCTAAA GATCTTAGAC AAGTTAAAGA TTCTTTTACC TATGCAGCTG CAAGTTCGTT ACTGATGGTT GCTTCATTAG CTTGGATTGC TATTTTATTA CTGTCAGATA ATCCTAATCT AGAGTCTGGT AGTCTTGTAA ACTATATTAT TACAACCTAT GCTTATCCAG GACTGAAAGG ACTTATTGCT ATTGGTATTA CAGCCATGGC TATGTCTACA GCAGACTCTT ATCTTAATTC ATCTGCTGTG TTAGCAGTTA ATGATATTCT GAAACCACTG CAGCTTTATT GGAAAGATTC TATTAAGATC GTTAGAGTTT TTTCCTTTGT TTTAGGAATT TTTGCATTGT TATTAGCGCT TCGCAGTACA GACTTACTAC AATTGATGCT ACTGTCTGGC AGCTTTTATA TGCCTATTGT TACTGTACCA CTCTTATTAG CCATTTTTGG TTTTCGAAGT AGTAGCAGAT CAGTTCTTAT AGGAATGTCA GCAGGCTGTA TAACTGTGAT ATTATGGAAT AAGTTTTTAA CTCATACAGG CATGCAAAGT CTTATCCCTG GTATGATCGC TAACTTAGTA TTCTATGTAG GCAGTCATTA TATACTTAGG CAACAAGGGG GCTGGATAGG CATACGAGAT AGAGAACCCC TTTTGGCAGC TCGGCAAGGC CGTAGAGAGG CATGGCAGAA ATTTATTTAT AAGGTTAGAC ATCCACAAAT CTATACTTAC TTACAAAAAA ATCTGCCGGC TTATGAAGTT GTTTATACGC TTTTTGCCAT TTATGTGATT GGTGCCACGT ATGCTTCTTT TTACACCATA CCAGAGTCAA CAGTTGCCAA TTATCAAAAG CTCTATGATA TTGCTGCACA TTCTGTTTTA GTTATGACGG CTGGGTTTCT TACCTATCCT GCTTGGCCCC CTACTTTTAA AGCCAAATGG TTCATCACCT TTGCTTGGCC AGCGGGTATC CTGTATGTAC TCTTTATAGT AGGGACTATC TTAGTACTTA TGAGTGGCTT TCATGAAGTA CAAGTCATGA TTTTTCTAAT TAATCTAATC ATGACTGCAT TTTTACTTTC CTGGCCCCTA ATGTTATTTG TTTCTTTGGT AGGTATAATT ATTGGCTGCC TAGTTTTTTA CATGTATTGT GGGGATTTAT ATGCTTGTAC CAGTTCAACT GGTTCAGGGC AATTTAAAGT TATTTATGGC ATTCTTTTGT TTAGTAGCTT TCTTATTGCT CTATTCAGAT TTAAACAAAG CCAAACAAGG CTCGAGAATA AGAATGTTTA TTTAGAAAAT ATATATAAAC AATTAAAAGA TGAACTATCT GAAATTTTAG GCTATAAAGA GACGCTTGTT AAAGAACTAA AAGAAGATGA GTTGGCTTTG TTTGGAACAA CCGCAGCTGC TTACATGCAA CAAGCCATTT ATCGTATTAC CGATTATGTA AGATTAGAAG TAAGCCAGAT CAAACTAGAA GATCTTATAG TAGAAATTAA AGATATACTC AAGTTAAAAG ACTTTGATGG ACAGTCCCCT CAACTAAGCA GTAAAAAATA TACTAAAGAG GAGTCAATCT ATGCGGATGC TGATAAGATC AAACAAGTGC TTGCTGACAG TATCTGTTAT ATCCATAAAC ATAATCTATC CAATAAAACT ATTGTTATTG CACTTGAAGA TGCCATGCTA GGCCATAGTG TAGAACATAT GAAAGATTAT ACACGAAAAC TAAAGGCTAT TAAAATTACC ATTACAACCC AGGAACTATT ATCTAATACC TCTAATCGTG TATATATGTT TGATCAAGCA CCTTCTATTA GTCGGATGGC TGGGGATGGA GATAGAAAAG CTTTGTTAAA TAATGCACGT ATTATCGATG CTCATTATGG TTATGCAGAG CTAGATAAGG AAGATACCCA TATATATGTT CTTCCTATCA ATGTACGTGA AGTAAGGGGA AAGGTTATGG AGCTTTTAAG GGCGCCTGCA GTGGCCAGTG AGGAAGAAGT AAAGCATCCT TTAGCCATAC AGCTGGAGGA AGAGCTACTT AATAAAACAA AAACTGCTAA ACTTGATACA AATGTTATAG AAAAATCACT TAATACCATT AAGAGGTATC ATGCAGGAGT TAAAAGAAAA TCAGGAGAAC CTTTCTTCAC TCACCCAATA GCTGTAGCAT TGATCTTGCT AGATTATTGC AAAGATCAGG ATGCCGTAAT AGCAGCACTA CTACATGATA CAGTGGAAGA TACTAGCTTA TCTATAACCC ATATTAAGGC TATGTTTGGA GAAAAAGTAG CTTTTATAGT AGGAAAGGTA ACCAACCTAG AGGATAAAAT AAGAAGGCTA AGCCTTGAGG AACATGAAAA TATTAAAAGG TTAATTAACT ATGAAGATGA GCGGGCGGCT TTTGTAAAAT TAGCTGACAG GCTTCATAAC ATGCGTACCA TTGAAGGACA TTTGTCCTTA TCTAAACAAA AGCATATAGC CAATGAGACC TTGCTTTTCT TTGTACCACT TGCCAACCAA TTAGGGGTAG ACCATGTGGC ACAGGAATTA GAAAAACTTA GTTTGGAAGT ATTGGCTAAG AAATGA
|
Protein sequence | MDLMLKDYLD IILFTTFLLV NLIIGLIAGR RVRSLRDFSI GNKDFTTATV TSTIVATWFG GGFIFYGLQN AYTSGLQYII PLLGSSLCLL FTGQVLAVRM GEFLNNLSVA EAMGNLFSRL VRIITAISGI LGGIGYIAIQ FQVIAKMLNF LLGFEGPWVT VAAASIVIIY SAFGGIRSVI ITDLLQFIVF SIFIPILALV VWNHLKKPGQ VAHTLVTNPV FNFKEMLGWN PKFLSALGLV LYFAIPGITP AIFQRVTISK DLRQVKDSFT YAAASSLLMV ASLAWIAILL LSDNPNLESG SLVNYIITTY AYPGLKGLIA IGITAMAMST ADSYLNSSAV LAVNDILKPL QLYWKDSIKI VRVFSFVLGI FALLLALRST DLLQLMLLSG SFYMPIVTVP LLLAIFGFRS SSRSVLIGMS AGCITVILWN KFLTHTGMQS LIPGMIANLV FYVGSHYILR QQGGWIGIRD REPLLAARQG RREAWQKFIY KVRHPQIYTY LQKNLPAYEV VYTLFAIYVI GATYASFYTI PESTVANYQK LYDIAAHSVL VMTAGFLTYP AWPPTFKAKW FITFAWPAGI LYVLFIVGTI LVLMSGFHEV QVMIFLINLI MTAFLLSWPL MLFVSLVGII IGCLVFYMYC GDLYACTSST GSGQFKVIYG ILLFSSFLIA LFRFKQSQTR LENKNVYLEN IYKQLKDELS EILGYKETLV KELKEDELAL FGTTAAAYMQ QAIYRITDYV RLEVSQIKLE DLIVEIKDIL KLKDFDGQSP QLSSKKYTKE ESIYADADKI KQVLADSICY IHKHNLSNKT IVIALEDAML GHSVEHMKDY TRKLKAIKIT ITTQELLSNT SNRVYMFDQA PSISRMAGDG DRKALLNNAR IIDAHYGYAE LDKEDTHIYV LPINVREVRG KVMELLRAPA VASEEEVKHP LAIQLEEELL NKTKTAKLDT NVIEKSLNTI KRYHAGVKRK SGEPFFTHPI AVALILLDYC KDQDAVIAAL LHDTVEDTSL SITHIKAMFG EKVAFIVGKV TNLEDKIRRL SLEEHENIKR LINYEDERAA FVKLADRLHN MRTIEGHLSL SKQKHIANET LLFFVPLANQ LGVDHVAQEL EKLSLEVLAK K
|
| |