Gene Apar_0819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0819 
Symbol 
ID8413684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp900739 
End bp901950 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content48% 
IMG OID645022401 
ProductGalactokinase 
Protein accessionYP_003179839 
Protein GI257784622 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.847114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0773074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACA CGTTTAACGA GAAAACCACA GCTCAGCTTA ATCGCGCAAA AGAGCACTTT 
GAAAAAATGT TCGGAAAAGC TAATCGCTAT CTTGCAGTAC ACGCGCCTGG TCGCTCCGAA
ATTGCAGGAA ACCACACTGA TCACGAGGGT GGCCATGTTA TTGCTGGAGC CTTAGACGTT
GCAATTAATG CTATTTGTGC CCCAAATAAC CTGGGTGTTA TTCGTGTAGC AAGTGTTGGT
TATGATCCTT TTGAGATAGA TTACACAAAC TTAGAACCTT CAGAGGCTGA ATACCTAACT
ACTCAAGCCA TTGTTCGCGG CATGGCAGCC AACCTGGTCA AGCTTGGTTT TAAGCCAACC
GGTTTTGATA TGGCAGTAAT AAGCGACGTA CCTGGAGGCG GTGGACTCTC CTCCTCTGCT
GCTTTTGAAG CTGCAACAGG CCGCGCAATG GAGGCGCTCT GGAAAGGTGG CAGTGAGATT
TCTGCTGTTA AACTTGCTCA AATGAGTCAG AATACAGAAA ACGTCTTCTT TGGTAAGCCT
TGCGGTCTTA TGGACCAGCT TGCTGTTTGC CTTGGTGGAC TTGCCTTTAT GAACTTTGAG
GATACAGCTC AACCGCAAGC AGAAAAGCTG GACCTCAACT TTGAAGATTA CGGCTATGCG
CTCTGCCTTG TTGACGTTGG CTGCGACCAC GTTGCTTTCA CTGATGAGTA TGCTGCCGTT
CCTATTGAAA TGCAGAAAGT TGCAGCAGCT TTTGGCAAAA CTCGCCTATC TGAAGTTCCC
GTTGAAGAAT TCCAGGCTCA CGTTAATGAG TTGCGAGAAG ATCTTGGAGA CCGCGCTCTT
CTCCGTGCTA TTCACTACTG GTATGAGAAT GACCTGGTAG ACAAGCGTTG GGAAAACCTT
CAAAACTTTG ATATTAAGTC CTTTATTGCA CTCACCAATG CTTCTGGTGC AAGTTCAGGT
ATGTATCTGC AGAATGTTTC TACTTCCGGC TCTTACCAAC CGGCAATGCT TGCTCTTGGT
TTAGCCGAAA GCATCTTAAA AGGTTCTGGC GCCGTTCGTA TTCATGGCGG TGGTTTTGGT
GGCTCTATCC AGTGCTTTGT TCCTCTCGCC CTTGTCGAGA CCTTCATTGC ACAGATGAAC
CAGTGGTTTG GCGAAGGTGC TTGTCGTCAC TACGCCATCT CTGACCAGGG AGCTTGCGCA
CAATGGCTGT AG
 
Protein sequence
MAHTFNEKTT AQLNRAKEHF EKMFGKANRY LAVHAPGRSE IAGNHTDHEG GHVIAGALDV 
AINAICAPNN LGVIRVASVG YDPFEIDYTN LEPSEAEYLT TQAIVRGMAA NLVKLGFKPT
GFDMAVISDV PGGGGLSSSA AFEAATGRAM EALWKGGSEI SAVKLAQMSQ NTENVFFGKP
CGLMDQLAVC LGGLAFMNFE DTAQPQAEKL DLNFEDYGYA LCLVDVGCDH VAFTDEYAAV
PIEMQKVAAA FGKTRLSEVP VEEFQAHVNE LREDLGDRAL LRAIHYWYEN DLVDKRWENL
QNFDIKSFIA LTNASGASSG MYLQNVSTSG SYQPAMLALG LAESILKGSG AVRIHGGGFG
GSIQCFVPLA LVETFIAQMN QWFGEGACRH YAISDQGACA QWL