Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0590 |
Symbol | |
ID | 6376392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 759643 |
End bp | 760701 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642681746 |
Product | hypothetical protein |
Protein accession | YP_001957720 |
Protein GI | 189502003 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTTA ATCTACCATT ATTTAAAAAA GTCAGTGAAA CACCCGGTGC TCCCGGATTT GAGCAGCGTA TTCGCCAACT AATTATAGAG GAAATAAGAA CTTTTGTAGA CCATGTAGAG GTTGACCATA TGGGTAATCT TATTGCTGTA AAATATGGTG TTCAGCAACC AAGCCATGAA AAGAAAGTAA TGGTAGCAGC ACATATGGAT GAGTTGGGGC TGATAGTTAA ATACATAGAC CAAGAAGGTT TTATTAGGTT TCATACATTA GGTGGATTTG ATCCTAGAAG TCTTATTGGT CAGCGGGTGA TCATACATGG TAAGCAAGAT TTGGTAGGCG TCATCGGCAT AAAAGCCATA CATTTTATGA CAGAAGAAGA AAGGAAACGC CCACTCGAAA TTAGTGATTT ATACATTGAC ATAGGTAGAA CGCAACAACA GGCTGCGACC TATATATCTA TAGGTGATTC TATTACACGC GAAAGAAGCT TGATAGAGTT AGGTAATTGT ATTACTGGCA AGTCGCTAGA TAATCGAACA GGTGTATTTG TGCTTATAGA AGCACTACGT ACCCTACAAG AAGTACCTTA TGATGTTTAT GCTGTTTTTA CAGTGCAAGA GGAAGTGGGC CTACGAGGTG CACAAGTGGC TGCACATCAT ATTGAGCCTT ATTTTAGCTT GGCATTAGAT ACCAGTACTT CATTAGATGT GCCTAATGTG CAACCCCATG ATAGGGTCGC TAGATTAGGT GATGGGGCAG GAATTAAAAT TATGGATGGC CATACCATAT GTGACTGTCG AATGGTAGAC TATCTGAAGA CAATAGCAAC TCAACATAAT ATTGCTTGGC AAACAGATAT TAAAGCAGTA GGAGGAACTG ATACTGCTCC CTTGCAACGT ATGCCTAAAA AAGGATCTAT TGCAGGAGCC TTAAGTATTC CTATACGCTA TGCTCACCAA GTGGTAGAAG TAGTACATCA AGCCGATGCA ATATCTGCTA TACAGCTTTT ACAACAAGCT TTAGTTGGCT TAGATACTTA TAGTTGGGAG CAAGTTTAG
|
Protein sequence | MQLNLPLFKK VSETPGAPGF EQRIRQLIIE EIRTFVDHVE VDHMGNLIAV KYGVQQPSHE KKVMVAAHMD ELGLIVKYID QEGFIRFHTL GGFDPRSLIG QRVIIHGKQD LVGVIGIKAI HFMTEEERKR PLEISDLYID IGRTQQQAAT YISIGDSITR ERSLIELGNC ITGKSLDNRT GVFVLIEALR TLQEVPYDVY AVFTVQEEVG LRGAQVAAHH IEPYFSLALD TSTSLDVPNV QPHDRVARLG DGAGIKIMDG HTICDCRMVD YLKTIATQHN IAWQTDIKAV GGTDTAPLQR MPKKGSIAGA LSIPIRYAHQ VVEVVHQADA ISAIQLLQQA LVGLDTYSWE QV
|
| |