Gene EcE24377A_4501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4501 
Symbol 
ID5587717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4494068 
End bp4495351 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content51% 
IMG OID640928115 
Productphthalate permease family protein 
Protein accessionYP_001465459 
Protein GI157155164 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.131797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTTG ATGTGTCAAC CAGTGTTGCG GGTAATAAAC CGCAACGGAT TCGTCGTATT 
CAGACAGTGA CCCTGGTTTT ATTATTTATG GCGGGGATCG TTAATTTTCT CGACCGTTCG
TCATTGAGCG TGGCAGGTGA AGCGATTCGT GGCGAGTTAG GATTATCGGC GACAGAGTTT
GGTGTTTTGC TTTCTGCATT TTCTCTGTCT TATGGTTTTT CACAACTACC TTCCGGTATT
TTGCTAGACC GCTTTGGCCC ACGAATTGTT TTAGGCGCAG GCTTAATATT CTGGTCATTA
ATGCAGGCAT TAACAGGAAT GGTTAATTCT TTTAGCCACT TTATTTTAAT GCGTATCGGT
CTGGGTATTG GCGAAGCGCC ATTTATGCCT GCGGGAGTAA AGTCGATCAC CGACTGGTAT
GCACAAAAAG AACGCGGCAC GGCGCTGGGG ATTTTTAACT CATCTACCGT TATCGGTCAG
GCTATCGCGC CTCCTGCTCT TGTATTGATG CAGCTGGCAT GGGGCTGGCG GACGATGTTC
GTTATCATCG GCGTGGCAGG GATTCTGGTT GGGATCTGTT GGTACGCGTG GTATCGCAAC
CGGGCGCAGT TTGTCCTGAC TGACGAAGAA CGGACGTATC TCTCCGCGCC GGTTAAACCG
CGTCCACAGC TGCAATTTAG CGAGTGGCTG GCGCTGTTTA AGCATCGGAC AACCTGGGGG
ATGATTTTAG GTTTCTCTGG CGTCAATTAT ACCGGTTGGC TCTACATCGC GTGGTTACCC
GGTTATTTGC AAGCCGAGCA AGGTTTCAGT CTGGCGAAAA CCGGTTGGGT GGCGGCGATT
CCTTTCCTCG CGGCGGCAGT CGGGATGTGG GTTAACGGTA TTGTTGTCGA TCGACTGGCG
AAAAAAGGCT ACGACCTGGC GAAGACGCGT AAAACGGCTA TTGTCTGCGG TTTGATGATG
TCGGCATTAG GCACGTTGCT GGTCGTGCAA TCTTCCTCGC CAGCCCAGGC GGTAGCGTTT
ATCTCAATGG CGCTATTCTG CGTGCATTTC GCTGGAACGT CTGCATGGGG GCTGGTGCAG
GTGATGGTGT CAGAAACAAA AGTGGCTTCC ATCGCCGGGA TTCAAAACTT TGGCAGTTTT
GTTTTTGCTT CCTTTGCTCC GATCGTAACC GGTTGGGTAG TGGATACCAC ACACTCGTTT
AATCTGGCGT TGGTTATCGC GGCCTGTGTG ACGTTCACCG GGGCGCTGTG TTACTTCTTT
ATTGTCAAAG ATCGCATTGA GTAA
 
Protein sequence
MDVDVSTSVA GNKPQRIRRI QTVTLVLLFM AGIVNFLDRS SLSVAGEAIR GELGLSATEF 
GVLLSAFSLS YGFSQLPSGI LLDRFGPRIV LGAGLIFWSL MQALTGMVNS FSHFILMRIG
LGIGEAPFMP AGVKSITDWY AQKERGTALG IFNSSTVIGQ AIAPPALVLM QLAWGWRTMF
VIIGVAGILV GICWYAWYRN RAQFVLTDEE RTYLSAPVKP RPQLQFSEWL ALFKHRTTWG
MILGFSGVNY TGWLYIAWLP GYLQAEQGFS LAKTGWVAAI PFLAAAVGMW VNGIVVDRLA
KKGYDLAKTR KTAIVCGLMM SALGTLLVVQ SSSPAQAVAF ISMALFCVHF AGTSAWGLVQ
VMVSETKVAS IAGIQNFGSF VFASFAPIVT GWVVDTTHSF NLALVIAACV TFTGALCYFF
IVKDRIE