Gene Emin_0810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0810 
Symbol 
ID6262504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp887055 
End bp888305 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content45% 
IMG OID642611288 
Productmajor facilitator transporter 
Protein accessionYP_001875702 
Protein GI187251220 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000673475 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATGA CGCGTGAAAT TTTTATAATC ACACTCGCTA ACGCGCTTAT ATTGGCAGGC 
TATGGAATGA GCGTGCCTTT TTTTGCGATA TATCTTAACG TAGAGCGCGC TTTGCCCGCA
AGTATGGTAG GCGCGGTTAT AGCTATTTCC ACTATAGGCA GGGCTTTCGC GAGCGGCATA
AGCGGGGAAC TTTCGGACAT TTTCGGCAGA AAGAAATTAA TGAACGCCTC GCTCGTATTA
GGGGCGGTTT TTTTAGCTTT AATGGGCCTT GGAATGATTT TGGACGCGCA TTATATATGG
ATTATAATCT TTCATATGAT AGCAAGCTTT TTCGGCGCGT TTTTCCGTCC TTCGTCAAAC
GCCTGGGTGG CGGACAATGT CGCCCCGAAA AACAGGCTTG AAGCTTTCGG ATATATAAGA
ATAGGCCTTA ATTTGGGCTG GGCGGCAGGC CCCGCTTTGG GCGGTTTGAT AGCAAACTAC
TCTTTCGGCG CGGCTTTTAT TATCACCGCG GTATCTTATA TCGGCACGGC AGCTATGCTT
CAAGCAAAAA TTAAAGAAAC TTTTGTAAGA AGCAAAGAAC ATAAATCTAA ATTTACCGAT
ATGATTCTGG AACTTAAAAA CCAAAGGCTG GCTAAACTTT GCGCCCTTAT TTTTTTAATT
TCGGTAGTGG CGGCGCAGTT GGTAACGGGT CTTTCTCTGC ACGGGGTAAA ATATATAGGC
CTTACGCAGG CCCAAATAGG TTTATTGTTT ACGATAGACG GCTCCATAGT AGTCTTGCTG
CAATACCACG CAAGTAAAAT TATGACTGAT ATGCGCATAA CAACAGCTTT AATAATAGGC
TGTATAATAT ACGGAAGCGG ATACATTTTA GTAGGCGCGG CACACGGTTT TACCCTTGCC
GCAATAGGCA TAGCTCTTGT AGCCACGGGC GAAATGGCAA TATCACCCGG CAGCAATACA
TTAATTTCCA ACATCGCGCC CGAAAAAGCA AGAGGAAGAT ATTTGGGTAT GCAGGAAGTT
TCGCGCCATA TAGGCACGGC GGCCGGCATG TTCGGCGCGG GCGTTATGAT TGAGTATTTA
AGCCCGATAT CACAAATTAT TCCCTGGCTT TTAATAGGAT TTATTTCATT TACGGTAGCG
TTTTTGTTTT ATAAAATTAA ACCTATGTTT ACTGATGAGG AGGACGGCAT AAACCAAACG
CCTGTTCCTC CGCCGCCCGC GCTTGAGGAA ACGCAGATAA CGGGCAATTA A
 
Protein sequence
MKMTREIFII TLANALILAG YGMSVPFFAI YLNVERALPA SMVGAVIAIS TIGRAFASGI 
SGELSDIFGR KKLMNASLVL GAVFLALMGL GMILDAHYIW IIIFHMIASF FGAFFRPSSN
AWVADNVAPK NRLEAFGYIR IGLNLGWAAG PALGGLIANY SFGAAFIITA VSYIGTAAML
QAKIKETFVR SKEHKSKFTD MILELKNQRL AKLCALIFLI SVVAAQLVTG LSLHGVKYIG
LTQAQIGLLF TIDGSIVVLL QYHASKIMTD MRITTALIIG CIIYGSGYIL VGAAHGFTLA
AIGIALVATG EMAISPGSNT LISNIAPEKA RGRYLGMQEV SRHIGTAAGM FGAGVMIEYL
SPISQIIPWL LIGFISFTVA FLFYKIKPMF TDEEDGINQT PVPPPPALEE TQITGN