Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1895 |
Symbol | oppC |
ID | 6143595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1915737 |
End bp | 1916642 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616771 |
Product | oligopeptide ABC transporter, permease protein OppC |
Protein accession | YP_001743949 |
Protein GI | 170683469 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.213821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.110744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAGTA AGAAAAACAG CGAGACGCTG GAAAATTTCA GTGAAAAGCT GGAGGTCGAA GGGCGCAGCT TGTGGCAGGA CGCACGTCGA CGTTTTATGC ATAACCGTGC GGCGGTTGCC AGTCTGATAG TGCTGGTGCT GATCGCGTTA TTTGTCATCC TGGCACCGAT GCTTTCGCAG TTTGCCTATG ACGATACTGA CTGGGCGATG ATGTCCAGCG CCCCGGATAT GGAGTCCGGT CACTACTTTG GTACTGACTC ATCTGGTCGC GACCTGCTTG TGCGCGTTGC GATTGGCGGG CGTATCTCAC TCATGGTCGG TGTTGCTGCG GCACTGGTGG CAGTGGTCGT GGGGACACTT TATGGTTCGC TTTCCGGTTA TCTGGGCGGT AAAGTGGATT CGGTAATGAT GCGTCTGCTG GAAATCCTCA ACTCCTTCCC ATTCATGTTC TTCGTCATTT TGCTGGTGAC CTTTTTCGGT CAAAACATCC TGCTGATTTT CGTGGCGATT GGCATGGTTT CCTGGCTGGA TATGGCTCGT ATTGTGCGTG GGCAAACCCT GAGTCTGAAG CGCAAAGAGT TTATTGAGGC GGCACAAGTT GGCGGTGTAT CGACGCCGGG CATTGTTATT CGCCACATTG TGCCGAACGT ACTCGGTGTG GTGGTGGTCT ACGCATCGCT ATTGGTGCCC AGCATGATCC TCTTTGAATC TTTCCTTAGC TTCCTGGGGT TGGGGACGCA AGAGCCGTTA AGCAGCTGGG GCGCATTGCT GAGTGATGGC GCGAACTCGA TGGAAGTCTC TCCATGGTTA CTGTTGTTCC CAGCGGGATT CCTCGTGGTG ACGCTATTTT GTTTCAACTT TATCGGCGAT GGCTTGCGTG ATGCCCTCGA CCCGAAAGAT CGTTAA
|
Protein sequence | MLSKKNSETL ENFSEKLEVE GRSLWQDARR RFMHNRAAVA SLIVLVLIAL FVILAPMLSQ FAYDDTDWAM MSSAPDMESG HYFGTDSSGR DLLVRVAIGG RISLMVGVAA ALVAVVVGTL YGSLSGYLGG KVDSVMMRLL EILNSFPFMF FVILLVTFFG QNILLIFVAI GMVSWLDMAR IVRGQTLSLK RKEFIEAAQV GGVSTPGIVI RHIVPNVLGV VVVYASLLVP SMILFESFLS FLGLGTQEPL SSWGALLSDG ANSMEVSPWL LLFPAGFLVV TLFCFNFIGD GLRDALDPKD R
|
| |