Gene Apar_0818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0818 
Symbol 
ID8413683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp899639 
End bp900697 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content42% 
IMG OID645022400 
ProductAldose 1-epimerase 
Protein accessionYP_003179838 
Protein GI257784621 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.376675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.120929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATG TTAGGCCTTT TGATAGTTTT GTTAACTCTC CCGCTGCTCT TACTTATACT 
ATTCGAGGCG CTCATTCAGC AATTAAATTA AGTAATTTTG GCGCCACTAT TCTTGACATT
TCAGTACCAG ATGTCTATGG AACACAAGCA GATGTAGTTC TTGGATATGG CCTGTTTGAC
CTGTATTTAG ATAATCCTGC TTGCTTTGGT GCCTCCATTG GACCTTCTGC AAATCGTGCG
GATAAAGCAG AGATTCCACT TAATGGAGTT GTCTATCATC TTCCTAAAAA CAATGGTCCA
AATAATCAAA ATAATCTTCA CACTGATTTA GTTGATGGTA TTCATAAGCG CATTTGGCAA
GCAGAAATCG ATGAATCACA TAATACCGTG ACATTCAGCA TTGACCTGAT AGATGGAGAA
TATGGACTAC CAGGTAACCG CCATATCACC GCTACATATG AGCTTGTTGA AGAATCTGCA
CAGTCAACGG TAAATCTTAC GTATGCCTGT ACTACTGATG CTGCAACATT CGTAAACATG
ACCAACCACG TGTACTTTAA CCTCAACGGT CACGATTCCG GAGATGTCTG CGGTCACCAA
CTCACTATCC AGGCAGAATC ATACCTTCCT CTACGAGAAG ATTCAGTTTC TGCAGGAATC
GTAAACTCTG TTGCAGGAAC TCCTTTTGAT TTCCGTACAC CTAAGGCCAT TGGAAAAGAT
CTTGGTGTTG AAAACGAGCA GCTTAAAATT GCTCACGGCT ATGATCATTG CTTTGTAATT
AACAATTACA AAAATGGTCA GCTTCGCCCC GCTCTTCTCG CCACTTCAGA AGGCGGTCGA
TCTCTTGAAA TTCAAATCAC CGCTCCCGGT GCTCATCTAT ATACTGGCAA CTGGCTTGAT
GAGGCACGCG CAAAAGACGG TGCTATCTAC AAACCTCAAG CTGGCTTTGC ATTTGAAAGC
GAATTTTATC CAGACTGTGC TCACCATGCA GAGTGGCCTC AGCCTATTTG CACACCTGAG
CATCCTTACA ACTCACAAAT TGTTTATCGA TTCTTTTAA
 
Protein sequence
MIDVRPFDSF VNSPAALTYT IRGAHSAIKL SNFGATILDI SVPDVYGTQA DVVLGYGLFD 
LYLDNPACFG ASIGPSANRA DKAEIPLNGV VYHLPKNNGP NNQNNLHTDL VDGIHKRIWQ
AEIDESHNTV TFSIDLIDGE YGLPGNRHIT ATYELVEESA QSTVNLTYAC TTDAATFVNM
TNHVYFNLNG HDSGDVCGHQ LTIQAESYLP LREDSVSAGI VNSVAGTPFD FRTPKAIGKD
LGVENEQLKI AHGYDHCFVI NNYKNGQLRP ALLATSEGGR SLEIQITAPG AHLYTGNWLD
EARAKDGAIY KPQAGFAFES EFYPDCAHHA EWPQPICTPE HPYNSQIVYR FF