Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0920 |
Symbol | |
ID | 6262622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 1019312 |
End bp | 1020622 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642611399 |
Product | amino acid/peptide transporter |
Protein accession | YP_001875810 |
Protein GI | 187251328 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000308736 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 5.10372e-19 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGACCCAGG AAACAGCAGT TAAACAAAAA CAGCCTTCGG GCCTTTATTT ACTTTTCGCC ACGGAGATGT GGGAAAGGTT CAGCTACTAC TCTTTAAGAG GTTTGTTCGT TTTATACTTA ACAAAAGCTT TAGCTTTTGA CGTGCCTAGA GCCACAAGTC TTTACGGCAC GTTTACCAGT CTTATATATC TTTCGCCCCT TTTGGGCGGT TATATGGCCG ACAGGTGGCT CGGTAAAAGA AGTTCTATTA TAATAGGCGG CATTTTAATA GCCGCGGGCC AGTTTGTTAT GGGTACGGGC GGCATAGGCG CCGTTTACGT TGCTATGGGC CTTATTATTT TGGGTAATGG TTTTTTTAAA CCCAATATTT CAAGCATTTT AGGCGAAATA TATGAAAAAA ATGACGTAAG ACGAGACGGC GGTTTTACCA TTTTCTACAT GGGTATTAAC CTTGGTTCTT TTTTAGCTAA CTTAATAGCC GGCACAATAG GCGAAAAAGT GGGCTGGGTA TACGGCTTTT GGACGGCGGG TTTCGGCATG ATTTTAGGCC TTATCATTTT TATCTGGGGT AAAGATAAAT TCTTACAAGG CAAAGGCCAC GCCCCCAAAC ATTACGCAAA AATAGAAAAA GAAGCCGGTA AAGAAGAAGA AAAGAAACCT CTTACCAAAC AGGAAATACA AAGAATAGCT GTTATTTTCA TTATGGCGTT TTTCTCAATA TTTTTCTTTG TTTTGTTTGA GCAAAAAGGC GCGGCGCTTA ACCTTTTGGC TGAACACAGC GTTAACAGAA CTATATTCGG ATGGACAATG CCTACAACCT GGTTCCAGTC TTTTAACCCG TTATTTATTA TTTTATTTGC CCCGGTATTT TCAAAAATGT GGATAGGCCT TTCAACAAAA GGTAAAGAGC CTTCTGTAAC GGGCAAATTT TCAATAGCGT TTTGGCTTAT AGCCATAGGT TACGCCGTAT TGTTAATGGC CGCCATGAGG CTTGGCCCCG GCATGAAAAT GGGTATGATG TGGCTTGTTG CGGCTTACTT CTTTTTTACC ATGGGCGAAC TTTGTTTATC CCCTGTAGGT CTTTCGCTGG TTACCAAACT CTCGCCTCCT AAGTTTGTGT CTATAATGAT GGGCATTTGG TTCTTAGCCA ACTCGGCGGC TAATAAAATA GCCGGCTTCT ATTCCGGTTT TATAGCTTCA TGGCCTTTAG ATAAGTTTTT CACATGGCTC ATGATTATTC CTATAATAGC ATCGGTAATC TTGTTGCTGC TGTCTAAAAA AATCAACGCC TGGATGCACG GCGTAAAATA G
|
Protein sequence | MTQETAVKQK QPSGLYLLFA TEMWERFSYY SLRGLFVLYL TKALAFDVPR ATSLYGTFTS LIYLSPLLGG YMADRWLGKR SSIIIGGILI AAGQFVMGTG GIGAVYVAMG LIILGNGFFK PNISSILGEI YEKNDVRRDG GFTIFYMGIN LGSFLANLIA GTIGEKVGWV YGFWTAGFGM ILGLIIFIWG KDKFLQGKGH APKHYAKIEK EAGKEEEKKP LTKQEIQRIA VIFIMAFFSI FFFVLFEQKG AALNLLAEHS VNRTIFGWTM PTTWFQSFNP LFIILFAPVF SKMWIGLSTK GKEPSVTGKF SIAFWLIAIG YAVLLMAAMR LGPGMKMGMM WLVAAYFFFT MGELCLSPVG LSLVTKLSPP KFVSIMMGIW FLANSAANKI AGFYSGFIAS WPLDKFFTWL MIIPIIASVI LLLLSKKINA WMHGVK
|
| |