Gene Emin_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1040 
Symbol 
ID6262903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1133330 
End bp1134256 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content42% 
IMG OID642611520 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001875930 
Protein GI187251448 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.56373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACA GAAAATATAC ATGGGCTATA ACGGGCGGAG CGGGGTTTAT AGGCTCGCAC 
ACAGTACGTG AACTTTTAAA AAACGGCCAA AATGTTATTG TCATAGATAA TACCAAACAC
ATAGGCAAAA CCCCTTTAGC GCCCTTTGCC GACCGGGTTA CCTTTTTAAA CTTTGACGTA
AGAAATTTTG AAAATATCCT TAACGCTTTA AAAAATGTTG ATTATGTTAT CCATTTAGCG
GCCTTGGTGT CGGTAGCGGA ATCAATGCAC AACCCTCAGT TATCGCTTGA AATAAATATA
CACGGCACAG CCAATGTTTT GGAAGCCGCC AGACTAAACA AAGTTAAACG TTTTATTTTC
GCGTCATCCA GCGCGGTATA CGGCAATAAC CCGGACGCGC CTTACCAGGA AACAGCCCAA
ACAAACATTC AATCCCCATA TGCTTTAGGC AAACTGGCGG GGGACGAGCT TTGCCAAATG
TACACTGATT TATACGGGCT TGAAACTGTT ATATTAAGAT ACTTTAACGT CTTTGGCCCC
GGGCAGGACG CCGACTCACC TTATTCGGCC GTTATAGCTA AATTTATAGC TTTAGCTAAA
GAAAATAAGT CTTATAATAT CCAGTGGGAC GGCACCCAAA CACGTGATTT TATTTATGTG
TCGGACGTGG CCAACGCCAA CCTGCTTGCC GCCGCTAAAG CTAAACCCGG CGAAATTTAC
AATGTAGCCA GCGGACAAAC AACCACTTTA CTAAAACTTA CCGAAATGAT TGACGCCGTC
AGCGGCGTTA AAAATAAAAA AGAATTCTCC CCCAAAAGAG AAGGCGACGT AAAACATTCC
GCAGCGGTTA TTTCTAAAAT AGAAAAACTT GGTTTTAAGA CTACGATATC TTTGCAAGAA
GGCCTTAAAC TTATGTGGAA TAAATAA
 
Protein sequence
MFDRKYTWAI TGGAGFIGSH TVRELLKNGQ NVIVIDNTKH IGKTPLAPFA DRVTFLNFDV 
RNFENILNAL KNVDYVIHLA ALVSVAESMH NPQLSLEINI HGTANVLEAA RLNKVKRFIF
ASSSAVYGNN PDAPYQETAQ TNIQSPYALG KLAGDELCQM YTDLYGLETV ILRYFNVFGP
GQDADSPYSA VIAKFIALAK ENKSYNIQWD GTQTRDFIYV SDVANANLLA AAKAKPGEIY
NVASGQTTTL LKLTEMIDAV SGVKNKKEFS PKREGDVKHS AAVISKIEKL GFKTTISLQE
GLKLMWNK