Gene EcHS_A4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4014 
Symbol 
ID5591833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4009300 
End bp4010685 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID640923118 
Productputative transport protein YifK 
Protein accessionYP_001460589 
Protein GI157163271 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.1864e-16 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAAG CTCGACATAT CGAACTCATC 
GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG
GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TGTTCGTCTT TTTCATCATG
CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCGGTTACCG GTTCGTTCGC CGTTTATGCG
CATCGTTATA TGAGCCCGTT TTTTGGTTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG
ATGGCGGTGG GGATCTCAGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG
GAGATGGCGC AGTGGATACC CGCATTGATC GCAGTGGCGC TGGTGGCGTT GGCGAATCTG
GCGGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTCG CGATGATCAA AGTCACCACG
ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG
CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG TGGCTGGAAA
GGGTTCCTGA CCGCGCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC
ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACACTGC GCAGTGCCGT AGGCAAAGTG
CTGTGGCGGA TCCTGATTTT CTACGTAGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG
TGGAATGAAA TTGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTTGCCAA AATCGGTATT
ACCGCAGCGG CGGGCATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC
AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG
GCGGCGATGG CGAAAGTTTC CCGTCACGGC GTACCAGTTG CGGGTGTGGC AGTATCTATT
GCTATTCTGC TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT
GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA
AGCCAGCTGC GTTTTCGGCG TGCACATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA
ATCCTGTTCC CGTGGGCCAA CTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC
ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTTG GCATCATCTT TATGCTGGCG
GTGACGGCGA TTTATAAAGT TTTTGGCCTT AATCGCCACG GGAAAGCGCA TAAACTGGAG
GAATAA
 
Protein sequence
MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM 
RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP
EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG
QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV
LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN
SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF
VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG
MYFNEDTRMS LFVGIIFMLA VTAIYKVFGL NRHGKAHKLE E