Gene Emin_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0447 
Symbol 
ID6262589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp480390 
End bp481340 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content41% 
IMG OID642610917 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001875341 
Protein GI187250859 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00302715 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000000109049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTTTG CCATAACGGG CGGAGCCGGT TTTATAGGCG GCGCGCTTAC AAAAAAATTA 
AATTCTATGG GCCACAGCGT TCGTATATTA ACAAGGGGCT CGGGCCGTAA ATCAGCTGAT
CCGCAAGTAG AATACATTAC CGCGAAGTAC ACGGATGTTG ATTCTTTAGC TAATGCGTTG
GAAGGATGTG ACGGCGTTTT TCATTTAGCC GCGGCGATAT TTGCTTTTAA TTATAAAGAA
TTTGAAGCAG CTAATGTCCT TACCACCCGT AATTTAGTTG ACGCCGCGGC TAAAACAAAC
AGCGTAAAAT ATTTTACCTA TATGTCAAGC CAGGCGGCGG GAGGATACAG CGCTGATTTG
GAACATATAA GAACCGAAGA CGATAAACCT AAACCCGCTT CAGATTACGG ACGTACAAAA
TTAGGGGGGG AAAACGCCGT TGAGTCCCTT CCCGCGCGTA TAAAAAAAAT AATATTTCGC
CCGCCAATAG TCTATGGTAA AAATGATTCA GGCGTAAGCA AAATAGCCGA TTGGGTAAAA
ATGGGCATAA TGGTTAACAC CTCTAAGGGG GACGCGTATT TTAACTTTAT TCATGTGGAC
GATTTGGTTA ATGCAATAGT TAAACCTATT GAAGACGAAT CTTTATTCGG CGGCATTTAC
TATGTATGCG AAAATAAACC TTATAATTGG AAATTTTTTA TATATTCAAT GGCGGACGCA
ATGAAAGTCA AACGCCCTTT TATGTTTACG GCGCCATTAT TTGTTTTACA CATTGTGGCG
TTTTTATATG AAATTATAGC CAAGCTTTTT AATATAGCCC CTGCTTTAAA TTACGATAAA
GTAAAGGAGG CCTCTATAAA AGGGCATTGG GTAAGCAGCA GTAAAAAATG GATTGACCGC
ACAGGCCAGC AGTTTACCTC TTTAGAGGAC GGACTTAGAA AAAGTTTTTA G
 
Protein sequence
MKFAITGGAG FIGGALTKKL NSMGHSVRIL TRGSGRKSAD PQVEYITAKY TDVDSLANAL 
EGCDGVFHLA AAIFAFNYKE FEAANVLTTR NLVDAAAKTN SVKYFTYMSS QAAGGYSADL
EHIRTEDDKP KPASDYGRTK LGGENAVESL PARIKKIIFR PPIVYGKNDS GVSKIADWVK
MGIMVNTSKG DAYFNFIHVD DLVNAIVKPI EDESLFGGIY YVCENKPYNW KFFIYSMADA
MKVKRPFMFT APLFVLHIVA FLYEIIAKLF NIAPALNYDK VKEASIKGHW VSSSKKWIDR
TGQQFTSLED GLRKSF