Gene ECD_00307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00307 
SymbolmhpT 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp344794 
End bp346005 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content57% 
IMG OID 
Productpredicted 3-hydroxyphenylpropionic transporter 
Protein accessionACT42206 
Protein GI253976536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACTC GTACCCCTTC ATCATCTTCA TCCCGCCTGA TGCTGACCAT CGGGCTTTGT 
TTTTTGGTCG CTCTGATGGA AGGGCTGGAT CTTCAGGCGG CTGGCATTGC GGCGGGTGGC
ATCGCCCAGG CTTTCGCACT CGATAAAATG CAAATGGGCT GGATATTTAG CGCCGGAATA
CTCGGTTTGC TACCCGGCGC GTTGGTTGGC GGAATGCTGG CGGACCGTTA TGGTCGTAAG
CGTATTTTGA TTGGCTCAGT TGCGCTGTTT GGTTTGTTCT CACTGGCAAC GGCGATTGCC
TGGGATTTCC CCTCACTGGT CTTTGCGCGG CTGATGACCG GTGTCGGGCT GGGGGCGGCG
TTGCCGAATC TTATCGCCCT GACGTCTGAA GCCGCGGGTC CACGTTTTCG TGGGACGGCA
GTGAGCCTGA TGTATTGCGG TGTTCCCATT GGCGCGGCGC TGGCGGCGAC ACTGGGTTTC
GCGGGGGCAA ACTTAGCATG GCAAACGGTG TTTTGGGTAG GTGGTGTGGT GCCGTTGATT
CTGGTGCCGC TATTAATGCG CTGGCTGCCG GAGTCGGCGG TTTTCGCTGG CGAAAAACAG
TCTGCGCCAC CACTGCGTGC CTTATTTGCG CCAGAAACGG CAACCGCGAC GCTGCTGCTG
TGGTTGTGTT ATTTCTTCAC TCTGCTGGTG GTCTACATGT TGATCAACTG GCTACCGCTA
CTTTTGGTGG AGCAAGGATT CCAGCCATCG CAGGCGGCAG GGGTGATGTT TGCCCTGCAA
ATGGGGGCGG CAAGCGGGAC GTTAATGTTG GGCGCATTGA TGGATAAGCT GCGTCCAGTA
ACCATGTCGC TACTGATTTA TAGCGGCATG TTAGCTTCGC TGCTGGCGCT TGGAACGGTG
TCGTCATTTA ACGGTATGTT GCTGGCGGGA TTTGTCGCGG GGTTGTTTGC GACAGGTGGG
CAAAGCGTTT TGTATGCCCT GGCACCGTTG TTTTACAGTT CGCAGATCCG CGCAACAGGT
GTGGGAACAG CCGTGGCGGT AGGGCGTCTG GGGGCTATGA GCGGTCCGTT ACTGGCCGGG
AAAATGCTGG CATTAGGCAC TGGCACGGTC GGCGTAATGG CCGCTTCTGC ACCGGGTATT
CTTGTTGCTG GGTTGGCGGT GTTTATTTTG ATGAGCCGGA GATCACGAAT ACAGCCGTGC
GCCGATGCCT GA
 
Protein sequence
MSTRTPSSSS SRLMLTIGLC FLVALMEGLD LQAAGIAAGG IAQAFALDKM QMGWIFSAGI 
LGLLPGALVG GMLADRYGRK RILIGSVALF GLFSLATAIA WDFPSLVFAR LMTGVGLGAA
LPNLIALTSE AAGPRFRGTA VSLMYCGVPI GAALAATLGF AGANLAWQTV FWVGGVVPLI
LVPLLMRWLP ESAVFAGEKQ SAPPLRALFA PETATATLLL WLCYFFTLLV VYMLINWLPL
LLVEQGFQPS QAAGVMFALQ MGAASGTLML GALMDKLRPV TMSLLIYSGM LASLLALGTV
SSFNGMLLAG FVAGLFATGG QSVLYALAPL FYSSQIRATG VGTAVAVGRL GAMSGPLLAG
KMLALGTGTV GVMAASAPGI LVAGLAVFIL MSRRSRIQPC ADA