Gene Emin_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0920 
Symbol 
ID6262622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1019312 
End bp1020622 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content41% 
IMG OID642611399 
Productamino acid/peptide transporter 
Protein accessionYP_001875810 
Protein GI187251328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000308736 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.10372e-19 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACCCAGG AAACAGCAGT TAAACAAAAA CAGCCTTCGG GCCTTTATTT ACTTTTCGCC 
ACGGAGATGT GGGAAAGGTT CAGCTACTAC TCTTTAAGAG GTTTGTTCGT TTTATACTTA
ACAAAAGCTT TAGCTTTTGA CGTGCCTAGA GCCACAAGTC TTTACGGCAC GTTTACCAGT
CTTATATATC TTTCGCCCCT TTTGGGCGGT TATATGGCCG ACAGGTGGCT CGGTAAAAGA
AGTTCTATTA TAATAGGCGG CATTTTAATA GCCGCGGGCC AGTTTGTTAT GGGTACGGGC
GGCATAGGCG CCGTTTACGT TGCTATGGGC CTTATTATTT TGGGTAATGG TTTTTTTAAA
CCCAATATTT CAAGCATTTT AGGCGAAATA TATGAAAAAA ATGACGTAAG ACGAGACGGC
GGTTTTACCA TTTTCTACAT GGGTATTAAC CTTGGTTCTT TTTTAGCTAA CTTAATAGCC
GGCACAATAG GCGAAAAAGT GGGCTGGGTA TACGGCTTTT GGACGGCGGG TTTCGGCATG
ATTTTAGGCC TTATCATTTT TATCTGGGGT AAAGATAAAT TCTTACAAGG CAAAGGCCAC
GCCCCCAAAC ATTACGCAAA AATAGAAAAA GAAGCCGGTA AAGAAGAAGA AAAGAAACCT
CTTACCAAAC AGGAAATACA AAGAATAGCT GTTATTTTCA TTATGGCGTT TTTCTCAATA
TTTTTCTTTG TTTTGTTTGA GCAAAAAGGC GCGGCGCTTA ACCTTTTGGC TGAACACAGC
GTTAACAGAA CTATATTCGG ATGGACAATG CCTACAACCT GGTTCCAGTC TTTTAACCCG
TTATTTATTA TTTTATTTGC CCCGGTATTT TCAAAAATGT GGATAGGCCT TTCAACAAAA
GGTAAAGAGC CTTCTGTAAC GGGCAAATTT TCAATAGCGT TTTGGCTTAT AGCCATAGGT
TACGCCGTAT TGTTAATGGC CGCCATGAGG CTTGGCCCCG GCATGAAAAT GGGTATGATG
TGGCTTGTTG CGGCTTACTT CTTTTTTACC ATGGGCGAAC TTTGTTTATC CCCTGTAGGT
CTTTCGCTGG TTACCAAACT CTCGCCTCCT AAGTTTGTGT CTATAATGAT GGGCATTTGG
TTCTTAGCCA ACTCGGCGGC TAATAAAATA GCCGGCTTCT ATTCCGGTTT TATAGCTTCA
TGGCCTTTAG ATAAGTTTTT CACATGGCTC ATGATTATTC CTATAATAGC ATCGGTAATC
TTGTTGCTGC TGTCTAAAAA AATCAACGCC TGGATGCACG GCGTAAAATA G
 
Protein sequence
MTQETAVKQK QPSGLYLLFA TEMWERFSYY SLRGLFVLYL TKALAFDVPR ATSLYGTFTS 
LIYLSPLLGG YMADRWLGKR SSIIIGGILI AAGQFVMGTG GIGAVYVAMG LIILGNGFFK
PNISSILGEI YEKNDVRRDG GFTIFYMGIN LGSFLANLIA GTIGEKVGWV YGFWTAGFGM
ILGLIIFIWG KDKFLQGKGH APKHYAKIEK EAGKEEEKKP LTKQEIQRIA VIFIMAFFSI
FFFVLFEQKG AALNLLAEHS VNRTIFGWTM PTTWFQSFNP LFIILFAPVF SKMWIGLSTK
GKEPSVTGKF SIAFWLIAIG YAVLLMAAMR LGPGMKMGMM WLVAAYFFFT MGELCLSPVG
LSLVTKLSPP KFVSIMMGIW FLANSAANKI AGFYSGFIAS WPLDKFFTWL MIIPIIASVI
LLLLSKKINA WMHGVK