Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0133 |
Symbol | |
ID | 6263089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 141009 |
End bp | 142739 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642610596 |
Product | extracellular solute-binding protein |
Protein accession | YP_001875036 |
Protein GI | 187250554 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAA TTTTCATTAT TGCGCTTGCG CTTTTTATAG CCGCCTGCGG CGGGAAAAAA GAGAGCGACA AGCAAACTCT TATTTTTGCC CATAAAGGGG AAATGCAGTC TTTAGACCCA ATTTATTCTT ATGACGGTGT TACACAAGGG CTTATTTTAA ATATCTATGA CACGGTAATA AAATTTAAAG GAAGCTCAAT ATCAAAATTT GAACCTCTTA TATCAACGCA GGTCCCTTCA ATTGAAAACG GCCTTATTTC TAAAGACGGA CTGACATACA CTTTTCCCAT AAGAAAAAAT GTGAAGTTCC ATAACGGTGA AATCTTAACA CCCGAAGACG TAAAATATTC AATACTCCGT TTTATTTTAT CTGACCGCGC GGGCGGGCCT TCCAATTTAT TGCTTGAACC TATTTTGGGC GTAAACTCAA CACGCGACGG AAGCAAATTT ACAATAACAA ATAAAGATAT AGAAGAGGCC GTAAAAATAG AAGGGGATAA TGTTGTAATT AAATTAAAAA GACCGTTCGC TCCTTTTTTA TCAATTATGG CACGATGGTC ATACATAATG AATAAAAAAT GGTGCGCCGA AAACGGTGAG TGGGACGGAC GCCTTGAAAC CTGGCAAAAA TTTAATAACC GCGAGCGCGA CGACTCCTAT CTGTTTAACC ACATGAACGG CACCGGGCCT TTTAAATTAA ACCGCTGGGA TATCACGGGA AAAAGACTTT CACTCTTAAG TAATGAAAAT TACTTTTTGG GCGCCCCTAA AATAAAGAAT ATTTTACTTA TGACGGTAGA TGAACCTTCC ACCATGCGCC TTATGCTTGA AAGCGGCGAT GTTGACGTGG CGGAAATTTC TCAAAAGTTT GATAAGCAGT ATGACGGACA TGAAGGCATT ATCCTGGCGG ATAATTTGCC AAGATTAAGG ACTGACCCGG CCATATTTTT TACTTATGAA ATTAACACAA CCGCCAACCC AGATGTGGGC AGCAGTAAAC TTGACGGAAA AGGAATACCG CATGATTTTT TTACTGATAA AGATTTGCGA AAAGCCTTTG CCCACGCTTT TGACTACCAA GCGTTTTTAA CGCAAACCAT GCAAAATAAA GGCACGCTTG CCAACGGGCC TGTACCGCCG GGATTAATAG GTTATGACAA AAACGCGCCT CATTATAATT TTGATTTGGA AAAATCAAAG GAATACTTTA AAAAAGCCTG GGGCGGCAAA GTTTGGGAAA ACGGATTTAA GTTTACAATA ACTTATAATA CCAGCGGCGA AATGAGGCAG ATAGCCTGTG AAATTTTAAA AAGAAATATT GAGTCTTTAA ACCCCAAATT TAAAATTGAA CTGCGCGGCG TGCCGTGGGC TTCTTTTTTA GAAAAAACCG ATAAACGCCA AATGCCCATG TGGTCGCGCG GCTGGATTGC CGATTACGCG GACCCGCACA ACTTTGTGTT CCCCTTTTTA CACAGTCGGG GCCGCTATGC TTTAAGCCAG GGTTTTAAAA ACCCTAAACT TGACGCTCTT ATTGAACAAG CGGTAAACAG CGTAAACGTT TCCGAGAGGG AAAAGCTTTA CTCTCAAATA CAAAAAATTG CTTATGAGGA AGCGCCCCAA ATTTACACCG TGCACCCGAC AGCTTTATGG GCTTTTAGGA AAAATGTGAA AGGATTTTAC GATAACCCGG TTTTTATGGG TATTTACTTT TATCCTTTAT ATAAAGAATA A
|
Protein sequence | MRKIFIIALA LFIAACGGKK ESDKQTLIFA HKGEMQSLDP IYSYDGVTQG LILNIYDTVI KFKGSSISKF EPLISTQVPS IENGLISKDG LTYTFPIRKN VKFHNGEILT PEDVKYSILR FILSDRAGGP SNLLLEPILG VNSTRDGSKF TITNKDIEEA VKIEGDNVVI KLKRPFAPFL SIMARWSYIM NKKWCAENGE WDGRLETWQK FNNRERDDSY LFNHMNGTGP FKLNRWDITG KRLSLLSNEN YFLGAPKIKN ILLMTVDEPS TMRLMLESGD VDVAEISQKF DKQYDGHEGI ILADNLPRLR TDPAIFFTYE INTTANPDVG SSKLDGKGIP HDFFTDKDLR KAFAHAFDYQ AFLTQTMQNK GTLANGPVPP GLIGYDKNAP HYNFDLEKSK EYFKKAWGGK VWENGFKFTI TYNTSGEMRQ IACEILKRNI ESLNPKFKIE LRGVPWASFL EKTDKRQMPM WSRGWIADYA DPHNFVFPFL HSRGRYALSQ GFKNPKLDAL IEQAVNSVNV SEREKLYSQI QKIAYEEAPQ IYTVHPTALW AFRKNVKGFY DNPVFMGIYF YPLYKE
|
| |