Gene Apar_0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0346 
Symbol 
ID8413195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp395517 
End bp396572 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content51% 
IMG OID645021914 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_003179368 
Protein GI257784151 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA CGTCTACCTC ATCAAATGGC TGCGTACTTG TTACTGGTGG TGCGGGCTTT 
ATTGGTAGCC ATACCGTTGT TGAACTGCTC AACTCCGGCT ACGAGGCTGT AATTGTTGAT
GACCTCTCCA ATGCCTCGGC AAAGGTTATT GATCGCATCA AAACCATTGT GGGAGAAGAG
AACGCCAAGC GTCTTTCGTT CTATAAGGCA GACGTCAATA ATGCAGACGC ACTGAACCGT
ATCTTTGACG AGCATCCTAT TAATTTTGTT ATTCATTTTG CAGGTTTCAA AGCTGTTGGC
GAGTCTGTTA CCAAGCCTAT TGAGTACTAC ACAAACAACC TGGGTAACAC CCTCACCCTT
CTTGATGTCA TGAGGAACCA TGGCTGCAAG TCAATTATTT TCTCGAGCTC TGCAACTGTC
TACGGAGACC CAGATAGTCT TCCTCTGACC GAGCGAAGTC CCAAGAAGAA CGCAACCAAC
CCTTACGGCT GGACCAAGTG GATGATTGAG CAAATTCTCA CCGACGTTCA CACTGCAGAT
CCAGAGTGGA ACGTTGTTCT GCTCAGATAC TTCAATCCTA TTGGCGCTCA CCAATCTGGC
CTCATCGGAG AAGATCCTGC AGGTATTCCA AATAACCTAG TCCCTTACGT GGCTCAGGTT
GCCGTGGGCA AGCGCGAGGC AGTTCACGTC TTTGGCGACG ACTACAACAC TCCAGATGGC
ACGGGCGTTC GCGACTACAT CCACGTCTGT GACCTTGGCT CCGGCCACGT TGCTGCACTT
AAGTGGATGG CTGGCCGTAC CGGCGTTGAG GTCTTCAACC TGGGTACCGG CACTGGTAGC
TCAGTACTTG ACGTTATCAA GGCATTCTCA AAGGCTTGCG GCAAAGAGAT TCCTTATGTC
ATTGAGCCTC GTCGTGCAGG TGACGTTGCA ACCAACTACG CCGACTGCCA GAAGGCTGCT
AAGGTACTTG GCTGGAAGGC TCAGTACAAT CTTGCCGACA TGTGCCGCGA CTCCTGGAAG
TGGCAGTCAA TGAACCCAGA CGGCTATCGC GGCTAG
 
Protein sequence
MSDTSTSSNG CVLVTGGAGF IGSHTVVELL NSGYEAVIVD DLSNASAKVI DRIKTIVGEE 
NAKRLSFYKA DVNNADALNR IFDEHPINFV IHFAGFKAVG ESVTKPIEYY TNNLGNTLTL
LDVMRNHGCK SIIFSSSATV YGDPDSLPLT ERSPKKNATN PYGWTKWMIE QILTDVHTAD
PEWNVVLLRY FNPIGAHQSG LIGEDPAGIP NNLVPYVAQV AVGKREAVHV FGDDYNTPDG
TGVRDYIHVC DLGSGHVAAL KWMAGRTGVE VFNLGTGTGS SVLDVIKAFS KACGKEIPYV
IEPRRAGDVA TNYADCQKAA KVLGWKAQYN LADMCRDSWK WQSMNPDGYR G