Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4160 |
Symbol | |
ID | 6142956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4260534 |
End bp | 4261919 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618983 |
Product | putative transport protein YifK |
Protein accession | YP_001746115 |
Protein GI | 170680511 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000124284 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAGG CTCGACATAT CGAACTCATC GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TATTCGTCTT TTTCATCATG CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCTGTTACCG GTTCGTTCGC CGTTTATGCG CATCGTTATA TGAGCCCGTT TTTTGGCTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG ATGGCGGTGG GGATCTCAGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG GAGATGGCGC AATGGATACC GGCATTGATC GCGGTGGCGC TGGTAGCATT GGCGAATCTG GCGGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTTG CGATGATCAA AGTCACCACG ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG GGGCTGGAAA GGGTTCCTGA CCGCGCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACGCTGC GCAGTGCCGT AGGCAAGGTG CTGTGGCGAA TCCTGATTTT CTACGTAGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG TGGAATGAAA TAGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTCGCCAA AATCGGCATT ACCGCAGCGG CGGGTATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG GCGGCAATGG CGAAAGTTTC CCGTCACGGC GTACCGGTTG CGGGTGTGGC AGTATCTATT GCTATCCTGT TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA AGCCAGCTGC GTTTTCGTCG TGCGCATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA ATCCTGTTCC CGTGGGCCAA CTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTCG GCATCATCTT TATGCTGGCG GTGACGGCGA TTTATAAAAT TTTTGGGCTT AATCGCCACG GTAAAGCGCA TAAACTGGAG GAATAA
|
Protein sequence | MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG MYFNEDTRMS LFVGIIFMLA VTAIYKIFGL NRHGKAHKLE E
|
| |