Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4779 |
Symbol | |
ID | 5587867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4772569 |
End bp | 4773534 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640928389 |
Product | hypothetical protein |
Protein accession | YP_001465717 |
Protein GI | 157157606 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00613349 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAGCG GCGTGCTGTA CGCCCTGTTA GCAGGGTTGA TGTGGGGGCT TATTTTTGTC GGGCCGTTGA TCGTACCGGA ATACCCGGCG ATGTTGCAGT CGATGGGGCG TTATCTGGCG TTAGGGTTAA TTGCGCTGCC CATTGCCTGG CTGGGACGCG TGCGTCTGCG TCAGTTGGCG CGTCGGGACT GGCTTACCGC CTTGATGCTC ACTATGATGG GCAACCTCAT TTATTACTTC TGCCTTGCCA GTGCCATTCA ACGTACTGGC GCGCCTGTTT CCACGATGAT TATCGGCACC CTGCCGGTGG TCATTCCTGT CTTTGCCAAT CTGCTTTATA GCCAGCGCGA CGGCAAACTC GCGTGGGGAA AACTCGCCCC GGCACTGATT TGTATTGGCA TCGGCCTGGC GAGTGTGAAT ATTGCTGAGT TAAACCACGG ACTCCCCGAT TTTGACTGGG CACGTTATAC CTCTGGCATC GTGCTAGCGT TAGTTTCCGT GGTCTGCTGG GCATGGTATG CCCTGCGCAA CGCCCGCTGG CTGCGGGAAA ATCCCGACAA ACATCCGATG ATGTGGGCGA CGGCGCAGGC GCTGGTCACG CTGCCGGTTT CTCTCATCGG CTATCTCGTC GCCTGTTACT GGCTGAATAT ACAAACGCCG GACTTCTCCT TACCTTTTGG CCCCCGTCCG CTGGTGTTTA TTAGTCTGAT GGTTGCGATA GCCGTGCTTT GCTCATGGGT TGGCGCACTC TGCTGGAACG TCGCCAGCCA GCGATTACCG ACAGTGATTC TCGGGCCGCT GATTGTTTTC GAAACGCTGG CAGGTTTGCT GTACACCTTT TTACTCCGCC AGCAAATGCC GCCGCTAATG ACGCTGAGCG GTATCGCGCT GTTAGTGATT GGCGTGGTCA TTGCGGTCAG AGCAAAACCA GAAAAACCTT TAACTGAATC TGTCTCAGAA AGTTGA
|
Protein sequence | MISGVLYALL AGLMWGLIFV GPLIVPEYPA MLQSMGRYLA LGLIALPIAW LGRVRLRQLA RRDWLTALML TMMGNLIYYF CLASAIQRTG APVSTMIIGT LPVVIPVFAN LLYSQRDGKL AWGKLAPALI CIGIGLASVN IAELNHGLPD FDWARYTSGI VLALVSVVCW AWYALRNARW LRENPDKHPM MWATAQALVT LPVSLIGYLV ACYWLNIQTP DFSLPFGPRP LVFISLMVAI AVLCSWVGAL CWNVASQRLP TVILGPLIVF ETLAGLLYTF LLRQQMPPLM TLSGIALLVI GVVIAVRAKP EKPLTESVSE S
|
| |