Gene Emin_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0994 
Symbol 
ID6262754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1081353 
End bp1082576 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content37% 
IMG OID642611474 
Productmajor facilitator transporter 
Protein accessionYP_001875884 
Protein GI187251402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.404838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000117748 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAAATAA AACAATATTA TAATAATATT TTAGGTGAAA ATCTGGGTAG TTTTTTAAAG 
GCCAATATAA TGGCGTTTAT AGGCCTTAAT ATAGGTATTA TCGGCGTTAA CTGGTTTATT
ATAAACGTTA CGGGGCAAAA CAGGATTTTA GGTGTTTATG GGGCGGTTTC ATTAATAGCG
TCTTTTTGCG CGTTGCTGTT TTTCGGCTCT CTGGCTGATA AATATAATAA GATAAAAATA
CTTAAGTTTT GTCTGCTTAT AGAAGCATTT ATTTTTATTG CTGCGGCAGG GCTTAATTAT
TTGAACTTTC CGGTTATTTT TCTTATTTAC GGTTTGGCTG TGTTAAGCAT GCCTGTTATG
ATGCTGTATG CGGCGGTTTC CCGCGCCGCT TTGGCGCAAG TTGCTCCCGC GCAAAAACTT
ATTAAAGGCA ATTCTGTTTT TGAAATAGCA ATACAATGCG GTGCAGTTTT AGCGGCGCTG
GCTACCGGAT TTATATACCA CGGTTTCGGG TTTAACGTTC TTATGCTTAC CGCCTCTTTT
ACTTTATTGC TTTCTTACAT TATGTTAGAT GAAGATTTGG CCGGAACGGA TTTAAATTCC
AAATCTCACG CGGGGAAAAC CTATTTTGAA AATTTAAAAG AAGGTTTGCG GTATTTTAGA
GAAAATAAAG TTTTGCTGCT GTTTGGTTTA ATAGTGTTTT TCCCAGGTAT AGTTATCGCT
GCGTCTAATA CTGTTATTCC CGGTTATGTT GAGCAGTTTT TAAAGCAGGA TTCAAGAGTA
TACGGAGCGG GGGAAATGTT TTTTGCTTCC GGCGCGCTGC TTTCGGGTTT TCTTACAGCG
TGGGTTTCGT CATTTATAAA AAAAGAACTC CTGCAGTTTG TTTTATTTGT TTTATCTGCC
GCGGTGCTTT TTAGTTTCTC ATTAAACAGA TTTGTGGCGG GTTTTTATAT CGCGATATTT
CTAAGCGGTT TGTTTATAGC TTCTTTAAGA ATTATTTTAA ACGCTAAATT TATGGAGCTT
ACCGGCAAAG AATTTCTTGG GCGCACCATT GTGTTTTTAA CGGCAATTAC AACCGTTTTT
CAGGCCGCGT TGGTTTATTT TATAGGTTAT TATATGGACG TGTTTAAAGT TACCGACGGA
TATCTGATTT TAACAATAGT GATTTTGGCG GGATTTGCGG GGGTTTATAT TTTAAAACCG
GAACAAAAAA AAAGAGAGCC TTAA
 
Protein sequence
MKIKQYYNNI LGENLGSFLK ANIMAFIGLN IGIIGVNWFI INVTGQNRIL GVYGAVSLIA 
SFCALLFFGS LADKYNKIKI LKFCLLIEAF IFIAAAGLNY LNFPVIFLIY GLAVLSMPVM
MLYAAVSRAA LAQVAPAQKL IKGNSVFEIA IQCGAVLAAL ATGFIYHGFG FNVLMLTASF
TLLLSYIMLD EDLAGTDLNS KSHAGKTYFE NLKEGLRYFR ENKVLLLFGL IVFFPGIVIA
ASNTVIPGYV EQFLKQDSRV YGAGEMFFAS GALLSGFLTA WVSSFIKKEL LQFVLFVLSA
AVLFSFSLNR FVAGFYIAIF LSGLFIASLR IILNAKFMEL TGKEFLGRTI VFLTAITTVF
QAALVYFIGY YMDVFKVTDG YLILTIVILA GFAGVYILKP EQKKREP