Gene EcSMS35_4160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4160 
Symbol 
ID6142956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4260534 
End bp4261919 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID641618983 
Productputative transport protein YifK 
Protein accessionYP_001746115 
Protein GI170680511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000124284 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAGG CTCGACATAT CGAACTCATC 
GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG
GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TATTCGTCTT TTTCATCATG
CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCTGTTACCG GTTCGTTCGC CGTTTATGCG
CATCGTTATA TGAGCCCGTT TTTTGGCTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG
ATGGCGGTGG GGATCTCAGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG
GAGATGGCGC AATGGATACC GGCATTGATC GCGGTGGCGC TGGTAGCATT GGCGAATCTG
GCGGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTTG CGATGATCAA AGTCACCACG
ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG
CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG GGGCTGGAAA
GGGTTCCTGA CCGCGCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC
ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACGCTGC GCAGTGCCGT AGGCAAGGTG
CTGTGGCGAA TCCTGATTTT CTACGTAGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG
TGGAATGAAA TAGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTCGCCAA AATCGGCATT
ACCGCAGCGG CGGGTATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC
AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG
GCGGCAATGG CGAAAGTTTC CCGTCACGGC GTACCGGTTG CGGGTGTGGC AGTATCTATT
GCTATCCTGT TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT
GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA
AGCCAGCTGC GTTTTCGTCG TGCGCATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA
ATCCTGTTCC CGTGGGCCAA CTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC
ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTCG GCATCATCTT TATGCTGGCG
GTGACGGCGA TTTATAAAAT TTTTGGGCTT AATCGCCACG GTAAAGCGCA TAAACTGGAG
GAATAA
 
Protein sequence
MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM 
RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP
EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG
QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV
LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN
SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF
VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG
MYFNEDTRMS LFVGIIFMLA VTAIYKIFGL NRHGKAHKLE E