Gene EcSMS35_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1895 
SymboloppC 
ID6143595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1915737 
End bp1916642 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content53% 
IMG OID641616771 
Productoligopeptide ABC transporter, permease protein OppC 
Protein accessionYP_001743949 
Protein GI170683469 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.213821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.110744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGTA AGAAAAACAG CGAGACGCTG GAAAATTTCA GTGAAAAGCT GGAGGTCGAA 
GGGCGCAGCT TGTGGCAGGA CGCACGTCGA CGTTTTATGC ATAACCGTGC GGCGGTTGCC
AGTCTGATAG TGCTGGTGCT GATCGCGTTA TTTGTCATCC TGGCACCGAT GCTTTCGCAG
TTTGCCTATG ACGATACTGA CTGGGCGATG ATGTCCAGCG CCCCGGATAT GGAGTCCGGT
CACTACTTTG GTACTGACTC ATCTGGTCGC GACCTGCTTG TGCGCGTTGC GATTGGCGGG
CGTATCTCAC TCATGGTCGG TGTTGCTGCG GCACTGGTGG CAGTGGTCGT GGGGACACTT
TATGGTTCGC TTTCCGGTTA TCTGGGCGGT AAAGTGGATT CGGTAATGAT GCGTCTGCTG
GAAATCCTCA ACTCCTTCCC ATTCATGTTC TTCGTCATTT TGCTGGTGAC CTTTTTCGGT
CAAAACATCC TGCTGATTTT CGTGGCGATT GGCATGGTTT CCTGGCTGGA TATGGCTCGT
ATTGTGCGTG GGCAAACCCT GAGTCTGAAG CGCAAAGAGT TTATTGAGGC GGCACAAGTT
GGCGGTGTAT CGACGCCGGG CATTGTTATT CGCCACATTG TGCCGAACGT ACTCGGTGTG
GTGGTGGTCT ACGCATCGCT ATTGGTGCCC AGCATGATCC TCTTTGAATC TTTCCTTAGC
TTCCTGGGGT TGGGGACGCA AGAGCCGTTA AGCAGCTGGG GCGCATTGCT GAGTGATGGC
GCGAACTCGA TGGAAGTCTC TCCATGGTTA CTGTTGTTCC CAGCGGGATT CCTCGTGGTG
ACGCTATTTT GTTTCAACTT TATCGGCGAT GGCTTGCGTG ATGCCCTCGA CCCGAAAGAT
CGTTAA
 
Protein sequence
MLSKKNSETL ENFSEKLEVE GRSLWQDARR RFMHNRAAVA SLIVLVLIAL FVILAPMLSQ 
FAYDDTDWAM MSSAPDMESG HYFGTDSSGR DLLVRVAIGG RISLMVGVAA ALVAVVVGTL
YGSLSGYLGG KVDSVMMRLL EILNSFPFMF FVILLVTFFG QNILLIFVAI GMVSWLDMAR
IVRGQTLSLK RKEFIEAAQV GGVSTPGIVI RHIVPNVLGV VVVYASLLVP SMILFESFLS
FLGLGTQEPL SSWGALLSDG ANSMEVSPWL LLFPAGFLVV TLFCFNFIGD GLRDALDPKD
R