Gene Emin_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0133 
Symbol 
ID6263089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp141009 
End bp142739 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content40% 
IMG OID642610596 
Productextracellular solute-binding protein 
Protein accessionYP_001875036 
Protein GI187250554 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA TTTTCATTAT TGCGCTTGCG CTTTTTATAG CCGCCTGCGG CGGGAAAAAA 
GAGAGCGACA AGCAAACTCT TATTTTTGCC CATAAAGGGG AAATGCAGTC TTTAGACCCA
ATTTATTCTT ATGACGGTGT TACACAAGGG CTTATTTTAA ATATCTATGA CACGGTAATA
AAATTTAAAG GAAGCTCAAT ATCAAAATTT GAACCTCTTA TATCAACGCA GGTCCCTTCA
ATTGAAAACG GCCTTATTTC TAAAGACGGA CTGACATACA CTTTTCCCAT AAGAAAAAAT
GTGAAGTTCC ATAACGGTGA AATCTTAACA CCCGAAGACG TAAAATATTC AATACTCCGT
TTTATTTTAT CTGACCGCGC GGGCGGGCCT TCCAATTTAT TGCTTGAACC TATTTTGGGC
GTAAACTCAA CACGCGACGG AAGCAAATTT ACAATAACAA ATAAAGATAT AGAAGAGGCC
GTAAAAATAG AAGGGGATAA TGTTGTAATT AAATTAAAAA GACCGTTCGC TCCTTTTTTA
TCAATTATGG CACGATGGTC ATACATAATG AATAAAAAAT GGTGCGCCGA AAACGGTGAG
TGGGACGGAC GCCTTGAAAC CTGGCAAAAA TTTAATAACC GCGAGCGCGA CGACTCCTAT
CTGTTTAACC ACATGAACGG CACCGGGCCT TTTAAATTAA ACCGCTGGGA TATCACGGGA
AAAAGACTTT CACTCTTAAG TAATGAAAAT TACTTTTTGG GCGCCCCTAA AATAAAGAAT
ATTTTACTTA TGACGGTAGA TGAACCTTCC ACCATGCGCC TTATGCTTGA AAGCGGCGAT
GTTGACGTGG CGGAAATTTC TCAAAAGTTT GATAAGCAGT ATGACGGACA TGAAGGCATT
ATCCTGGCGG ATAATTTGCC AAGATTAAGG ACTGACCCGG CCATATTTTT TACTTATGAA
ATTAACACAA CCGCCAACCC AGATGTGGGC AGCAGTAAAC TTGACGGAAA AGGAATACCG
CATGATTTTT TTACTGATAA AGATTTGCGA AAAGCCTTTG CCCACGCTTT TGACTACCAA
GCGTTTTTAA CGCAAACCAT GCAAAATAAA GGCACGCTTG CCAACGGGCC TGTACCGCCG
GGATTAATAG GTTATGACAA AAACGCGCCT CATTATAATT TTGATTTGGA AAAATCAAAG
GAATACTTTA AAAAAGCCTG GGGCGGCAAA GTTTGGGAAA ACGGATTTAA GTTTACAATA
ACTTATAATA CCAGCGGCGA AATGAGGCAG ATAGCCTGTG AAATTTTAAA AAGAAATATT
GAGTCTTTAA ACCCCAAATT TAAAATTGAA CTGCGCGGCG TGCCGTGGGC TTCTTTTTTA
GAAAAAACCG ATAAACGCCA AATGCCCATG TGGTCGCGCG GCTGGATTGC CGATTACGCG
GACCCGCACA ACTTTGTGTT CCCCTTTTTA CACAGTCGGG GCCGCTATGC TTTAAGCCAG
GGTTTTAAAA ACCCTAAACT TGACGCTCTT ATTGAACAAG CGGTAAACAG CGTAAACGTT
TCCGAGAGGG AAAAGCTTTA CTCTCAAATA CAAAAAATTG CTTATGAGGA AGCGCCCCAA
ATTTACACCG TGCACCCGAC AGCTTTATGG GCTTTTAGGA AAAATGTGAA AGGATTTTAC
GATAACCCGG TTTTTATGGG TATTTACTTT TATCCTTTAT ATAAAGAATA A
 
Protein sequence
MRKIFIIALA LFIAACGGKK ESDKQTLIFA HKGEMQSLDP IYSYDGVTQG LILNIYDTVI 
KFKGSSISKF EPLISTQVPS IENGLISKDG LTYTFPIRKN VKFHNGEILT PEDVKYSILR
FILSDRAGGP SNLLLEPILG VNSTRDGSKF TITNKDIEEA VKIEGDNVVI KLKRPFAPFL
SIMARWSYIM NKKWCAENGE WDGRLETWQK FNNRERDDSY LFNHMNGTGP FKLNRWDITG
KRLSLLSNEN YFLGAPKIKN ILLMTVDEPS TMRLMLESGD VDVAEISQKF DKQYDGHEGI
ILADNLPRLR TDPAIFFTYE INTTANPDVG SSKLDGKGIP HDFFTDKDLR KAFAHAFDYQ
AFLTQTMQNK GTLANGPVPP GLIGYDKNAP HYNFDLEKSK EYFKKAWGGK VWENGFKFTI
TYNTSGEMRQ IACEILKRNI ESLNPKFKIE LRGVPWASFL EKTDKRQMPM WSRGWIADYA
DPHNFVFPFL HSRGRYALSQ GFKNPKLDAL IEQAVNSVNV SEREKLYSQI QKIAYEEAPQ
IYTVHPTALW AFRKNVKGFY DNPVFMGIYF YPLYKE