Gene Emin_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0368 
Symbol 
ID6262638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp393298 
End bp394644 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content43% 
IMG OID642610834 
Productsugar transporter 
Protein accessionYP_001875262 
Protein GI187250780 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.968703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000305958 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATAGAA AAGTATTAAA GGTATCTCTT ATAGCCGCTT TGGGCGGGCT TTTGTTCGGG 
TTTGACACAG CGGTTATTTC CGGCACTACT GAAGCGTTGA CAAAAGTTTT TTCTTTAACG
CCTTCATCTT TAGGTTTTAC TGTGGCTATC GCGCTTATAG GCACCATATT AGGCGCGGTT
TTTGTGGGTT ATCCCGCAAA CTCTTACGGC AGGAAAAACA CTCTTAAAAT GATCGCGCTG
TTATATTTCT TTTCTTCTTT AGGCACGGCC ATGGCGTGGA ATTGGGGCGT TTTTTTAACG
TTTAGATTTT TAGGCGGCAT AGCTGTAGGT GCTTCAAGCG TTGTGGCTCC TATGTATATT
GCGGAAATAG TGCCCGCTTC GTTTAGGGGC CGTATGGTCG CTTTAGCCCA GTTTAACGTG
GTGTTTGGTA TTTTACTTGC GTTTTTCTCT AACCTTATAA TAAGTAATGT AATTACTGGC
CCCGCGCAAT GGCGCTGTAT GTTAGGCATA TTGGCGGTTC CTTCTATTAT TTTCTTTGGT
CTTTTGTATT TGATTCCGTT CAGCCCCCGC TGGCTTGCTT CCAAAGGCAG GGTTGAGGAA
GCAGGTTCTA TTATAAAATA TTTAGCCGGA CCAGAAGACA ATGCCGAAAA GGCTTTAAAA
GAAATTGTGG ATTCCATTAA ATCAGACAGT CAAAGCAAAA ACGAAAAATT GTTTTCCACA
AAATACACAA AAGTTATTTT ACTTGCTGTT GCCATAGCGG CGTTTAACCA GCTTTCAGGC
ATTAACGCCG TGCTTTATTA CGCGCCGTAT ATTTTTAAAA TGGCCGGCGC GGGCACTAAC
GCGGCCTTAA TTCAGTCTGT TGTTGTGGGC TTTACAAACC TTATATTTAC AATGGCGGCA
TTACTTGTTA TTGATAAACT TGGCCGCAGA AAGCTTATGT TAACAGGTTC TTTGGGGTAT
ATAGTAAGTT TAGGTGCGCT TACGGTTATT TTCGCGGCGC AAGGCTCAGT CTTTTCTCCT
TTGGGAGGGG CTTTGGTTTT GGCAAGTTTG GTTGTTTTTA TAGCTTCACA CGCTTTCGGG
CAGGGCGCTG TAATTTGGGT GTTTATAAGC GAAATTTTCC CCACTAAGGT ACGCGCGCAG
GGTTCTGCTT TGGGCAGCTT TACGCACTGG ATTATGGCTG CCGTAATAAG CTGGACTTTC
CCGATATTTG CCAATATATC CGGCGCTGTT ATATTCGGCG TCTATACTTT CTTTATGGTG
TTGCAGCTGC TTTGGGTAAT TTTCATTATG CCCGAAACAA AAGGTATACC TCTTGAGAAA
ATGACAAAAG AGCTTGGCAT AGAATAA
 
Protein sequence
MNRKVLKVSL IAALGGLLFG FDTAVISGTT EALTKVFSLT PSSLGFTVAI ALIGTILGAV 
FVGYPANSYG RKNTLKMIAL LYFFSSLGTA MAWNWGVFLT FRFLGGIAVG ASSVVAPMYI
AEIVPASFRG RMVALAQFNV VFGILLAFFS NLIISNVITG PAQWRCMLGI LAVPSIIFFG
LLYLIPFSPR WLASKGRVEE AGSIIKYLAG PEDNAEKALK EIVDSIKSDS QSKNEKLFST
KYTKVILLAV AIAAFNQLSG INAVLYYAPY IFKMAGAGTN AALIQSVVVG FTNLIFTMAA
LLVIDKLGRR KLMLTGSLGY IVSLGALTVI FAAQGSVFSP LGGALVLASL VVFIASHAFG
QGAVIWVFIS EIFPTKVRAQ GSALGSFTHW IMAAVISWTF PIFANISGAV IFGVYTFFMV
LQLLWVIFIM PETKGIPLEK MTKELGIE