Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0816 |
Symbol | |
ID | 6376949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1033488 |
End bp | 1034840 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642681957 |
Product | hypothetical protein |
Protein accession | YP_001957919 |
Protein GI | 189502202 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase |
TIGRFAM ID | [TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.646747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT TAGTCGTTTT AGGAGCAGGG GAAAGTGGAA CAGGAGCTGC TCTATTGGCA CAAGCAAAAG GCTATCAAGT ATTTGTCTCA GATAAAAACA TAATTACAAA AGGCTATAAA CAGCAGCTAT TAGACCATAA AATTGAATTT GAGGAGTGCA AACATACCTG GGATAAGATA AAAATTGCCG ATGAAGTTAT TAAAAGCCCT GGTATCCCTA ATCATATAGC TATCATACAA GCACTTCAAG AAGCAAGTAT ACCTATCATA GACGAAGTAG AGTTTGCGAG TCGCTATACA AAGGCTTCGC TTATTGCCAT AACTGGCTCA AACGGTAAAT CTACAACCAC CCATTTAGCT TATCATTTAC TGAAAGCAGG TGGACTAAAT GTAGGTATAG CAGGAAATAT AGGGACAAGT TTTGCAAGAA AAGTTTTATT AGAAGAGCAT AATTATTATG TACTAGAGCT CAGCAGTTTT CAATTAGAAC ATCTACAGAC CTTTAAAGCA GATATAGCTT GTATACTCAA TATTACACCT GATCATTTAG ATCGTTATGA CAATCAACTA AATAATTATA TAGCCGCTAA GTTTAGGATA TTACGTAATA TGATTAAAGA GGGGTATTTT ATTTATAATC AAGATGATAC TAACATACGT GAGTACCTTG TTAAACAAAC TACTAATTTA CCTCAACTCT ATCCTATTTC TTTGACCCAA TCGACCCATC ACGGTGCTTA TGTAGAAAAT AACCATCTTC ATTTTGTAGG AGACAACTAC AATTTTCAAC TACCTACCCA AACCTTACCC CTGCTAGGCA AGCATAATAT ATATAATACT ATGGCTGCTA TCTCGATAGC TAGTCTATTA GGCCTCTCTT ACGCTTCTAT CTTAGATGGG CTAAGAACTT TTAAAGGGCT ACCTCACCGC ATGGAATGGA TTGCTAACGT TAATCAAGTA AGTTTTTATA ATGATTCTAA AGCCACCAAT GTAGAATCAG CTGCCGTAGC TCTAGAAAGC TTTACAAGTC CGATTATCTG GATAGCAGGT GGCTATGATA AAGGGAATGA TTATAATGTA CTAAAACCAA TAGTAAAAAA TTCTGTAAAA GCACTCATTT GCTTAGGAAA AGATAATCAA ACTATCTGCA AAGCTTTTAA GGAATCCAAT ATTCCTATTT ATGAAACACA AGTTATGCAG GAAGCAGTAA GCCTAGCATA TACCCTAGCA AAACCACAAG ATATAGTGCT ATTGTCACCT GCTTGTGCTA GCTTTGATTT ATTTAAAAAT TTTGAAGATA GAGGAGAACA GTTTAGGTAT GCAGTACAAC AACTTTTAAC TTCAATAAAT TAA
|
Protein sequence | MKKLVVLGAG ESGTGAALLA QAKGYQVFVS DKNIITKGYK QQLLDHKIEF EECKHTWDKI KIADEVIKSP GIPNHIAIIQ ALQEASIPII DEVEFASRYT KASLIAITGS NGKSTTTHLA YHLLKAGGLN VGIAGNIGTS FARKVLLEEH NYYVLELSSF QLEHLQTFKA DIACILNITP DHLDRYDNQL NNYIAAKFRI LRNMIKEGYF IYNQDDTNIR EYLVKQTTNL PQLYPISLTQ STHHGAYVEN NHLHFVGDNY NFQLPTQTLP LLGKHNIYNT MAAISIASLL GLSYASILDG LRTFKGLPHR MEWIANVNQV SFYNDSKATN VESAAVALES FTSPIIWIAG GYDKGNDYNV LKPIVKNSVK ALICLGKDNQ TICKAFKESN IPIYETQVMQ EAVSLAYTLA KPQDIVLLSP ACASFDLFKN FEDRGEQFRY AVQQLLTSIN
|
| |