Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3796 |
Symbol | |
ID | 6067652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4155662 |
End bp | 4156627 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603209 |
Product | hypothetical protein |
Protein accession | YP_001726728 |
Protein GI | 170021774 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0293258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAGCG GCGTGCTGTA CGCCCTGTTA GCAGGGTTGA TGTGGGGGCT TATTTTTGTC GGGCCGTTGA TCGTGCCGGA ATACCCGGCG ATGTTGCAGT CGATGGGGCG TTATCTGGCG TTAGGGTTAA TTGCGCTACC CATTGCCTGG CTGGGACGCG TGCGTCTGCG TCAGTTGGCG CGTCGCGACT GGCTTACCGC CTTGATGCTC ACTATGATGG GCAACCTCAT CTATTACTTC TGCCTTGCCA GTGCCATTCA ACGTACTGGC GCGCCTGTTT CTACGATGAT TATTGGCACC CTGCCGGTGG TCATTCCTGT CTTTGCCAAT CTGCTTTATA GCCAGCGCGA CGGCAAACTC GCGTGGGGAA AACTCGCCCC GGCACTGATT TGTATTGGCA TCGGCCTGGC GTGTGTGAAT ATTGCTGAGT TAAACCACGG ACTCCCCGAT TTTGACTGGG CACGTTATAC CTCTGGCATC GTGCTAGCGT TAGTTTCCGT GGTCTGCTGG GCATGGTATG CCCTGCGTAA CGCCCGCTGG CTGCGGGAAA ATCCCGACAA ACATCCGATG ATGTGGGCGA CGGCGCAGGC GCTGGTCACG CTGCCGGTTT CTCTCATCGG CTATCTCGTC GCCTGTTACT GGCTGAATAT ACAAACGCCG GACTTCTCCT TACCTTTTGG CCCCCGTCCG CTGGTGTTTA TTAGTCTGAT GGTTGCGATA GCCGTGCTTT GCTCATGGGT TGGCGCACTC TGCTGGAACG TCGCCAGCCA GCGATTACCG ACAGTGATTC TCGGGCCGCT GATCGTTTTC GAAACACTGG CAGGTTTGCT GTACACCTTT TTGATACGCC AGCAAATGCC GCCGCTGATG ACGCTGAGCG GTATCGCGCT GTTAGTGGTT GGCGTGGTCA TTGCAGTCAG AGCAAAACCG GAAAAGCCTT TAACTGAATC TGTCTCAGAA AGTTGA
|
Protein sequence | MISGVLYALL AGLMWGLIFV GPLIVPEYPA MLQSMGRYLA LGLIALPIAW LGRVRLRQLA RRDWLTALML TMMGNLIYYF CLASAIQRTG APVSTMIIGT LPVVIPVFAN LLYSQRDGKL AWGKLAPALI CIGIGLACVN IAELNHGLPD FDWARYTSGI VLALVSVVCW AWYALRNARW LRENPDKHPM MWATAQALVT LPVSLIGYLV ACYWLNIQTP DFSLPFGPRP LVFISLMVAI AVLCSWVGAL CWNVASQRLP TVILGPLIVF ETLAGLLYTF LIRQQMPPLM TLSGIALLVV GVVIAVRAKP EKPLTESVSE S
|
| |