Gene ECD_02108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02108 
SymbolyejB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2169028 
End bp2170122 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content51% 
IMG OID 
Productpredicted oligopeptide transporter subunit 
Protein accessionACT43931 
Protein GI253978261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCTT ACCTGATTCG CCGTCTGTTG CTGGTGATCC CAACATTATG GGCGATTATC 
ACCATCAACT TTTTCATCGT GCAAATTGCG CCTGGCGGTC CGGTCGATCA GGCCATCGCC
GCCATTGAGT TTGGTAATGC CGGAGTATTA CCCGGCGCAG GCGGTGAAGG TGTTCGTGCC
AGCCATGCGC AAACGGGTGT CGGCAATATC AGCGACAGTA ATTACCGTGG TGGACGCGGA
TTAGATCCAG AAGTGATCGC TGAGATCACT CATCGCTACG GTTTCGATAA GCCGATCCAC
GAACGTTACT TCAAAATGCT CTGGGACTAC ATCCGCTTTG ATTTTGGTGA TAGCCTGTTT
CGCAGCGCCT CGGTGCTGAC GCTGATTAAA GACAGTCTGC CGGTTTCCAT CACCCTCGGA
TTGTGGAGCA CGCTGATTAT CTATCTGGTG TCGATTCCGT TAGGCATTCG CAAAGCTGTT
TATAATGGGA GCCGCTTTGA CGTCTGGAGT AGCGCATTTA TCATCATCGG CTACGCCATT
CCGGCCTTTT TGTTTGCCAT CCTGCTGATT GTCTTCTTCG CGGGCGGCAG CTATTTCGAC
CTGTTCCCTC TACGCGGCCT GGTTTCCGCT AACTTTGATT CGCTGCCGTG GTATCAGAAA
ATCACCGATT ATCTGTGGCA TATCACGCTG CCGGTGCTGG CGACAGTGAT TGGTGGCTTT
GCGGCGCTGA CCATGCTGAC AAAAAACTCA TTCCTTGATG AAGTGCGTAA GCAATACGTG
GTGACCGCCC GAGCGAAAGG GGTAAGTGAA AAAAATATTC TCTGGAAACA TGTGTTCCGC
AACGCCATGC TGCTGGTGAT TGCCGGTTTT CCGGCGACGT TTATCAGCAT GTTTTTTACC
GGCTCGCTGC TGATTGAGGT GATGTTTTCA CTCAATGGTC TTGGCTTACT GGGCTACGAA
GCGACCGTCT CGCGCGATTA TCCTGTAATG TTTGGTACCT TGTATATTTT CACCCTGATT
GGCCTGCTGC TGAATATTGT CAGTGATATC AGCTATACGC TGGTTGATCC GCGTATAGAT
TTTGAGGGAC GTTAA
 
Protein sequence
MGAYLIRRLL LVIPTLWAII TINFFIVQIA PGGPVDQAIA AIEFGNAGVL PGAGGEGVRA 
SHAQTGVGNI SDSNYRGGRG LDPEVIAEIT HRYGFDKPIH ERYFKMLWDY IRFDFGDSLF
RSASVLTLIK DSLPVSITLG LWSTLIIYLV SIPLGIRKAV YNGSRFDVWS SAFIIIGYAI
PAFLFAILLI VFFAGGSYFD LFPLRGLVSA NFDSLPWYQK ITDYLWHITL PVLATVIGGF
AALTMLTKNS FLDEVRKQYV VTARAKGVSE KNILWKHVFR NAMLLVIAGF PATFISMFFT
GSLLIEVMFS LNGLGLLGYE ATVSRDYPVM FGTLYIFTLI GLLLNIVSDI SYTLVDPRID
FEGR