Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0819 |
Symbol | |
ID | 8413684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 900739 |
End bp | 901950 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645022401 |
Product | Galactokinase |
Protein accession | YP_003179839 |
Protein GI | 257784622 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.847114 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0773074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACA CGTTTAACGA GAAAACCACA GCTCAGCTTA ATCGCGCAAA AGAGCACTTT GAAAAAATGT TCGGAAAAGC TAATCGCTAT CTTGCAGTAC ACGCGCCTGG TCGCTCCGAA ATTGCAGGAA ACCACACTGA TCACGAGGGT GGCCATGTTA TTGCTGGAGC CTTAGACGTT GCAATTAATG CTATTTGTGC CCCAAATAAC CTGGGTGTTA TTCGTGTAGC AAGTGTTGGT TATGATCCTT TTGAGATAGA TTACACAAAC TTAGAACCTT CAGAGGCTGA ATACCTAACT ACTCAAGCCA TTGTTCGCGG CATGGCAGCC AACCTGGTCA AGCTTGGTTT TAAGCCAACC GGTTTTGATA TGGCAGTAAT AAGCGACGTA CCTGGAGGCG GTGGACTCTC CTCCTCTGCT GCTTTTGAAG CTGCAACAGG CCGCGCAATG GAGGCGCTCT GGAAAGGTGG CAGTGAGATT TCTGCTGTTA AACTTGCTCA AATGAGTCAG AATACAGAAA ACGTCTTCTT TGGTAAGCCT TGCGGTCTTA TGGACCAGCT TGCTGTTTGC CTTGGTGGAC TTGCCTTTAT GAACTTTGAG GATACAGCTC AACCGCAAGC AGAAAAGCTG GACCTCAACT TTGAAGATTA CGGCTATGCG CTCTGCCTTG TTGACGTTGG CTGCGACCAC GTTGCTTTCA CTGATGAGTA TGCTGCCGTT CCTATTGAAA TGCAGAAAGT TGCAGCAGCT TTTGGCAAAA CTCGCCTATC TGAAGTTCCC GTTGAAGAAT TCCAGGCTCA CGTTAATGAG TTGCGAGAAG ATCTTGGAGA CCGCGCTCTT CTCCGTGCTA TTCACTACTG GTATGAGAAT GACCTGGTAG ACAAGCGTTG GGAAAACCTT CAAAACTTTG ATATTAAGTC CTTTATTGCA CTCACCAATG CTTCTGGTGC AAGTTCAGGT ATGTATCTGC AGAATGTTTC TACTTCCGGC TCTTACCAAC CGGCAATGCT TGCTCTTGGT TTAGCCGAAA GCATCTTAAA AGGTTCTGGC GCCGTTCGTA TTCATGGCGG TGGTTTTGGT GGCTCTATCC AGTGCTTTGT TCCTCTCGCC CTTGTCGAGA CCTTCATTGC ACAGATGAAC CAGTGGTTTG GCGAAGGTGC TTGTCGTCAC TACGCCATCT CTGACCAGGG AGCTTGCGCA CAATGGCTGT AG
|
Protein sequence | MAHTFNEKTT AQLNRAKEHF EKMFGKANRY LAVHAPGRSE IAGNHTDHEG GHVIAGALDV AINAICAPNN LGVIRVASVG YDPFEIDYTN LEPSEAEYLT TQAIVRGMAA NLVKLGFKPT GFDMAVISDV PGGGGLSSSA AFEAATGRAM EALWKGGSEI SAVKLAQMSQ NTENVFFGKP CGLMDQLAVC LGGLAFMNFE DTAQPQAEKL DLNFEDYGYA LCLVDVGCDH VAFTDEYAAV PIEMQKVAAA FGKTRLSEVP VEEFQAHVNE LREDLGDRAL LRAIHYWYEN DLVDKRWENL QNFDIKSFIA LTNASGASSG MYLQNVSTSG SYQPAMLALG LAESILKGSG AVRIHGGGFG GSIQCFVPLA LVETFIAQMN QWFGEGACRH YAISDQGACA QWL
|
| |