Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0974 |
Symbol | |
ID | 6377117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1272089 |
End bp | 1273198 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642682098 |
Product | hypothetical protein |
Protein accession | YP_001958059 |
Protein GI | 189502342 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2971] Predicted N-acetylglucosamine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.131863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAGT ACCTCAAAAT AAAAATAGAC ATGGTAAACA GGATGTTTCT AGTAGCAGTT TCCCTTTTTT CTATTCTTGG CCAACAGGTT ATTGCCGATG ACCTAATAGT AGTACCTCAT GCTGACTACA TTTTATGTAT TGATGGACGT GGTTCTAAAA CTTCTTTACA AGTTGTTACT ACCCAAGGAG CAGTTATTCC TTTACAAGGA CCCGCGGGCA TAGTACAAGA AATTTATACC GAAGGAAGTA ACGTTGCAAG TTTGGGTTGG GATTTAGTAC AGAAACGACT AGAAAAACTT TTAAACCAAG TTAAGTTCCC TCCAGGTAAC AATCCTTTAC AGAATAAAAG CTCTGTTGCA GTTGTTGCAG GTTTTGCAGG CATTGGGCTC CCAGAAATAC GCCAAAAATT TATCGATTTA TTTCAACAAT GGGGTTTGAA CCCAGATAAA ATTGTTGTAA CCACAGATAT TAACTTAGCT AAAGAGCTTT TAAGCCAAAA AGATGGGGCT GTATTGATTG CCGGATTAGG CTCTGTTGCT TTTGTCAAGC ATCAGGGACA TTGCTTGCGC TTTGGGGGGC TTGGATGGTA CTTAGGAGAT GAAGGGAGTG GTTTCTCTGT AGGAAAAAAG GCTATAGCTG CAGCTATAGC TGAAGATAAG GGTTTTGGTA TGAAAACAGC TTTGACTCCT ATTTTAAAAG AAATGTTTCA AAAACAAGAA CTATATCGTC TAATTCCACT TTTGCAGGAT GGCACCATCA GTTCTGAACA GGTAGCAGCG ATTGCTCCAA TAGTTTTTGA ATGCGCTTAT AGTAAAAAAG ACCCGGTAGC ACACCTTATT GTCAAGCTGG CAGCACAGGA GTTAGCTAGT TTAATTCGCC AAGGGATAGA GATGATTCAG AAAGAGTTAA AGCCTTTACC TGCCAACTGG CCGATCTACT TGATTGGTGG CCAATTTAAA GGTCCCTATG CACAGGCTTG GACCCAAGAA CTCTGGTCAT TTTTACCACA AAGGGGAAAA ATGGTTCCAC ACAATCTAGC TAAGTCTAAT ACTACTACGG TGGTAGTGCA GCAAAAGTTA GCTGCAAGAC GAAACAAAAG GGGTTGGTAA
|
Protein sequence | MFKYLKIKID MVNRMFLVAV SLFSILGQQV IADDLIVVPH ADYILCIDGR GSKTSLQVVT TQGAVIPLQG PAGIVQEIYT EGSNVASLGW DLVQKRLEKL LNQVKFPPGN NPLQNKSSVA VVAGFAGIGL PEIRQKFIDL FQQWGLNPDK IVVTTDINLA KELLSQKDGA VLIAGLGSVA FVKHQGHCLR FGGLGWYLGD EGSGFSVGKK AIAAAIAEDK GFGMKTALTP ILKEMFQKQE LYRLIPLLQD GTISSEQVAA IAPIVFECAY SKKDPVAHLI VKLAAQELAS LIRQGIEMIQ KELKPLPANW PIYLIGGQFK GPYAQAWTQE LWSFLPQRGK MVPHNLAKSN TTTVVVQQKL AARRNKRGW
|
| |