Gene Emin_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0199 
Symbol 
ID6264042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp210427 
End bp211551 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content42% 
IMG OID642610663 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001875100 
Protein GI187250618 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA TAAAAGTTGG TTTATTAGGC TGTGGTTTTA TGGGGCGCGC GCATATGGCG 
GGCTACCAAA CAATACCCCT GTATTACAAT AATGATTTTA AAATAAAGTT TGCGGGTGTT
TGCAACAGAA CGCTTGAAAA GGCCGAGTAT TTTAAAGAGG CCTTTGGCTT TGAATACGCG
ACTTCAAACC CGGAAGATAT TTTAAACGAC CCTTCTATAG ACGTTATTGA TATTTGCACG
CCAAACGCCA ATCATAAAAA TGAAATTTTA AAAGCATTAG ACAATAAAAA ACATATTTAT
TGCGAAAAGC CCGTTGTAGT GGGGGAAGAG GAAATTAAGG CCGTTCTTTC CCATCCTAAT
TTAGATAAAG TAACTACGCA GGTTGTTTTT AATAACCGTT TTTACCCCGC GGCTATACGC
GCCAAACAAT TAATTGGGGA AGGTCGTTTG GGCAAAATAT TTTCTTTTAG AGGGGTTATG
CTTCATAACA GTTCGGTTGA CGTTTTAAAG CCTGTATCCT GGAGGCAGAC GGTTGAGGGC
AACGGCGGAG TGGTTTATGA TTTAAGCGCG CATTTAAGCG ACATGGTTTA CAATCTTTTG
GGCGAATTTG AAAGCGTTTA CACTAAAAAC CAAATAGCCC ATCCCGTACG CAAGGATAAG
GACGGCAAAG ACGTTAAAAT CGAAATTGAG GACGCCACCT ACTCTTTAGT AAAACTTAAA
AACGGTGCTA TGGGCACAAT GGAAACAACA AAAATAGCCA CGGGTAAAAA CGGCAATTTT
AAGTTAGAAA TACACGGGGA AAAAGGCGCT TTGGCGCTTG ACCTTATTGA TCCTAACTGG
CTTTATTTTT ACGATAATAC GGCTCCTGCC TCCCCAAACG GCGGAATGAA GGGGTTTACA
AAAATAGAAA CCATGCAGCG CTATGGGGCG GATTGTGTTT TTCCTCCCGG CAACCACACT
TTGGGCTTTT TGCGCGCGCA TGTTGACTGT TTACATAACT TTTTAACCTG CGTAAGCAAA
GGCGCCGCGG CAAATCCTTC AATTAAAGAC GGGCTGTATG TTCAAAGCGT GCTTGAGGCC
ATGCATAAAT CAGCCAAAAC AAGAACAGAG GCTTTTGTTA GTTAA
 
Protein sequence
MKEIKVGLLG CGFMGRAHMA GYQTIPLYYN NDFKIKFAGV CNRTLEKAEY FKEAFGFEYA 
TSNPEDILND PSIDVIDICT PNANHKNEIL KALDNKKHIY CEKPVVVGEE EIKAVLSHPN
LDKVTTQVVF NNRFYPAAIR AKQLIGEGRL GKIFSFRGVM LHNSSVDVLK PVSWRQTVEG
NGGVVYDLSA HLSDMVYNLL GEFESVYTKN QIAHPVRKDK DGKDVKIEIE DATYSLVKLK
NGAMGTMETT KIATGKNGNF KLEIHGEKGA LALDLIDPNW LYFYDNTAPA SPNGGMKGFT
KIETMQRYGA DCVFPPGNHT LGFLRAHVDC LHNFLTCVSK GAAANPSIKD GLYVQSVLEA
MHKSAKTRTE AFVS