Gene EcolC_4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4207 
Symbol 
ID6067731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4647712 
End bp4649097 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID641603639 
Productputative transport protein YifK 
Protein accessionYP_001727131 
Protein GI170022177 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000449535 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAAG CTCGACATAT CGAACTCATC 
GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG
GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TGTTCGTCTT TTTCATCATG
CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCGGTTACCG GTTCGTTCGC CGTTTATGCG
CATCGTTATA TGAGCCCGTT TTTTGGTTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG
ATGGCGGTGG GGATCTCAGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG
GAGATGGCGC AGTGGATACC CGCATTGATC GCAGTGGCGC TGGTGGCGTT GGCGAATCTG
GCGGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTCG CGATGATCAA AGTCACCACG
ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG
CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG TGGCTGGAAA
GGGTTCCTGA CCGCGCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC
ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACACTGC GCAGTGCCGT AGGCAAAGTG
CTGTGGCGGA TCCTGATTTT CTACGTAGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG
TGGAATGAAA TTGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTTGCCAA AATCGGTATT
ACCGCAGCGG CGGGCATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC
AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG
GCGGCGATGG CGAAAGTTTC CCGTCACGGC GTACCAGTTG CGGGTGTGGC AGTATCTATT
GCTATTCTGC TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT
GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA
AGCCAGCTGC GTTTTCGGCG TGCACATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA
ATCCTGTTCC CGTGGGCCAA CTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC
ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTTG GCATCATCTT TATGCTGGCG
GTGACGGCGA TTTATAAAGT TTTTGGCCTT AATCGCCACG GGAAAGCGCA TAAACTGGAG
GAATAA
 
Protein sequence
MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM 
RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP
EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG
QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV
LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN
SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF
VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG
MYFNEDTRMS LFVGIIFMLA VTAIYKVFGL NRHGKAHKLE E