Gene Emin_0921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0921 
Symbol 
ID6262623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1020784 
End bp1022439 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content43% 
IMG OID642611400 
ProductPTS system, glucose-like IIB subunint 
Protein accessionYP_001875811 
Protein GI187251329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000171728 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value9.61967e-19 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCTGAAA ACGGTATTAT AAGCAAAGTT TTTACTGTAT TGCAAAAAGT AGGCCGTTCG 
TTTTTCCTGC CCATATCAAT TCTTCCCGTA GCCGGGCTTT TGCTGGGCAT CGGGGCGTCA
TTTTCCAATA TGGCAACGGT TGCGGATTAC GGCCTTGAAA GTATAATGGG CGCGGGTACG
TTTCTCAACT ATACTTTTAT AATTATGAGC GCGGTGGGCG GCGCAGTTTT CGCTAACCTG
CCTTTAATTT TCGCCATGTC GGTAGCTTTG GGCATGGCCA ATGAAGAAAA AGCGGTCGCC
ACATTATCGG CGGCCATTTC TTTTATTATC ATGCACGTTA CAATAAGCAA AATGCTTTTG
TTTACAGGCT ATATTCTGCC CGACGGCGCC CTGGCCGAAA AAGTTGTGGC GGGAACCATA
GGCACGGTAC TTGGCATACA ATCGCTTGAA ATGGGCGTGT TCGGCGGTAT CGTGGTAGGC
TTGGGCGTGG CCGCTTTACA TAATAGATTT TACAAGATAG AGTTGCCCGT GTTTTTATCT
TTTTTCGGCG GTATAAGATT TGTGCCTATT ATATGCACAT TTGTGTTTCT TTTGGTCGGC
GCAGGTTTCT TTTTTGTATG GCCGCCTATA CAAAAGTTAA TTTTGGCAAG CGGGCAGCTG
GTTATTAAAT CGGGCTATTT TGGCTCGTTT ATTTACGGTT TTATGGAACG CGCTTTAATA
CCCTTCGGCC TGCACCACGT TTTTTATATG CCTTTCTGGC AGACGGGACT TGGCGGAGCG
CAGTTAATAG ACGGCGTTAT GGTATATGGC GCGCAGAATA TCTTCTTTGC GGAGCTGGCT
TCCCCCAACA CACAGCACTT TACAATTGAA TCGGCCAGGT TCTTAACCGG CAAATACCCA
TTTATGATAG CGGGTCTTCC CGGTGCGGCG CTTGCCATGT ACCACACAGC TAAAACCCAT
AAAAAGAAAC TTGTGGGCGG GCTTTTGTTC TCAGCCGCTT TAACTTCTTT TTTAACAGGT
ATTACCGAAC CAATTGAGTT TACATTCCTC TTTGTTGCGC CGGTTGTATT TATTATACAC
TGCGGCTTTG CCGGCATAGC GTTTGTTCTT ACTCATTTAT TACAAATAGC TGTCGGAACC
ACGTTTTCCT GCGGATTTAT AGACCTTACC CTTTACGGTA TTTTGCAAGG ACACGCGAAA
ACAAACTGGA TGTGGCTTAT ACCCATATTT ATAGTTTATT TTATAGGTTA TTATTTCTTT
TTCAGGTTTG TTATAACAAA ATGGAATCTT ATGACCCCCG GCAGAGAACC TGACGAACAA
GACACAAAAC TTTACACAAA AGCAGATTAC CAGGCCAAAC AGCAAGACGG TAAAAGTGAA
ACAACCCCGT CGGCAGCACT GCCCGCTTCT AAAGACGAGC AGCTTGAAAC CATATTACAA
GGTTTGGGCG GTAAGGATAA TATTGAAAAT CTTGACTCTT GCGCCACAAG ATTAAGACTT
AATGTTAAAG ACCCCTCTTT AGTTAATAAA GATTTATTAA AAAAAGGCGG AGCTTTGGGC
GTGCTTTTAA AAGGCAACGG ATTACAGGTA GTATTCGGGC CTAAAGTAAG TTCAATCAAA
CCTAAGCTTG AAGAATATAT AAATAAAATG AGATAG
 
Protein sequence
MSENGIISKV FTVLQKVGRS FFLPISILPV AGLLLGIGAS FSNMATVADY GLESIMGAGT 
FLNYTFIIMS AVGGAVFANL PLIFAMSVAL GMANEEKAVA TLSAAISFII MHVTISKMLL
FTGYILPDGA LAEKVVAGTI GTVLGIQSLE MGVFGGIVVG LGVAALHNRF YKIELPVFLS
FFGGIRFVPI ICTFVFLLVG AGFFFVWPPI QKLILASGQL VIKSGYFGSF IYGFMERALI
PFGLHHVFYM PFWQTGLGGA QLIDGVMVYG AQNIFFAELA SPNTQHFTIE SARFLTGKYP
FMIAGLPGAA LAMYHTAKTH KKKLVGGLLF SAALTSFLTG ITEPIEFTFL FVAPVVFIIH
CGFAGIAFVL THLLQIAVGT TFSCGFIDLT LYGILQGHAK TNWMWLIPIF IVYFIGYYFF
FRFVITKWNL MTPGREPDEQ DTKLYTKADY QAKQQDGKSE TTPSAALPAS KDEQLETILQ
GLGGKDNIEN LDSCATRLRL NVKDPSLVNK DLLKKGGALG VLLKGNGLQV VFGPKVSSIK
PKLEEYINKM R