Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4207 |
Symbol | |
ID | 6067731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4647712 |
End bp | 4649097 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603639 |
Product | putative transport protein YifK |
Protein accession | YP_001727131 |
Protein GI | 170022177 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000449535 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAAG CTCGACATAT CGAACTCATC GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TGTTCGTCTT TTTCATCATG CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCGGTTACCG GTTCGTTCGC CGTTTATGCG CATCGTTATA TGAGCCCGTT TTTTGGTTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG ATGGCGGTGG GGATCTCAGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG GAGATGGCGC AGTGGATACC CGCATTGATC GCAGTGGCGC TGGTGGCGTT GGCGAATCTG GCGGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTCG CGATGATCAA AGTCACCACG ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG TGGCTGGAAA GGGTTCCTGA CCGCGCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACACTGC GCAGTGCCGT AGGCAAAGTG CTGTGGCGGA TCCTGATTTT CTACGTAGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG TGGAATGAAA TTGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTTGCCAA AATCGGTATT ACCGCAGCGG CGGGCATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG GCGGCGATGG CGAAAGTTTC CCGTCACGGC GTACCAGTTG CGGGTGTGGC AGTATCTATT GCTATTCTGC TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA AGCCAGCTGC GTTTTCGGCG TGCACATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA ATCCTGTTCC CGTGGGCCAA CTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTTG GCATCATCTT TATGCTGGCG GTGACGGCGA TTTATAAAGT TTTTGGCCTT AATCGCCACG GGAAAGCGCA TAAACTGGAG GAATAA
|
Protein sequence | MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG MYFNEDTRMS LFVGIIFMLA VTAIYKVFGL NRHGKAHKLE E
|
| |