Gene ECD_01391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01391 
SymbolydcO 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1458968 
End bp1460143 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID 
Productpredicted benzoate transporter 
Protein accessionACT43274 
Protein GI253977604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTGT TTTCTATTCC TCCACCCACG CTACTGGCGG GGTTTCTGGC GGTATTAATT 
GGCTACGCCA GTTCAGCGGC AATAATCTGG CAAGCAGCGA TTGTCGCCGG AGCCACCACT
GCACAAATCT CTGGCTGGAT GACGGCGCTG GGGCTGGCAA TGGGCGTCAG TACGCTGACT
CTGACATTAT GGTATCGCGT ACCTGTTCTC ACCGCATGGT CAACGCCTGG CGCGGCTTTG
TTGGTCACCG GATTGCAGGG ACTAACACTT AACGAAGCCA TCGGCGTTTT TATTGTCACC
AACGCGCTAA TAGTCCTCTG CGGCATAACG GGACTCTTTG CTCGTCTGAT GCGCATTATT
CCGCACTCGC TTGCGGCGGC AATGCTTGCC GGGATTTTAT TACGCTTTGG TTTACAGGCG
TTTGCCAGTC TGGACGGTCA ATTTACGTTG TGTGGAAGTA TGTTGCTGGT ATGGCTGGCA
ACCAAGGCCG TTGCGCCGCG CTATGCGGTA ATTGCCGCGA TGATTATTGG GATCGTGATC
GTCATCGCGC AAGGTGACGT TGTCACAACT GATGTTGTCT TTAAACCCGT TCTCCCCACT
TATATTACCC CTGATTTTTC GTTTGCTCAC AGCCTGAGCG TTGCACTCCC CCTTTTTCTG
GTGACGATGG CATCGCAAAA CGCACCGGGT ATCGCAGCAA TGAAAGCAGC TGGATATTCG
GCTCCTGTTT CGCCATTAAT TGTATTTACT GGATTGCTGG CACTGGTTTT TTCCCCTTTC
GGCGTTTATT CCGTCGGTAT TGCGGCAATC ACCGCGGCTA TTTGCCAAAG CCCGGAAGCG
CATCCGGATA AAGATCAACG TTGGCTGGCC GCTGCCGTTG CAGGCATTTT CTATTTGCTC
GCAGGTCTGT TTGGTAGTGC CATTACCGGG ATGATGGCTG CCCTGCCCGT AAGTTGGATC
CAGATGCTGG CAGGTCTGGC GCTGTTAAGT ACCATCGGCG GCAGTTTGTA TCAGGCGCTG
CATAATGAGC GTGAGCGAGA CGCGGCGGTG GTGGCATTTC TGGTAACGGC AAGTGGATTG
ACGCTGGTCG GGATTGGTTC TGCGTTTTGG GGATTAATTG CCGGAGGCGT TTGTTACGTG
GTGTTGAATT TAATCGCTGA CAGAAACCGA TATTGA
 
Protein sequence
MRLFSIPPPT LLAGFLAVLI GYASSAAIIW QAAIVAGATT AQISGWMTAL GLAMGVSTLT 
LTLWYRVPVL TAWSTPGAAL LVTGLQGLTL NEAIGVFIVT NALIVLCGIT GLFARLMRII
PHSLAAAMLA GILLRFGLQA FASLDGQFTL CGSMLLVWLA TKAVAPRYAV IAAMIIGIVI
VIAQGDVVTT DVVFKPVLPT YITPDFSFAH SLSVALPLFL VTMASQNAPG IAAMKAAGYS
APVSPLIVFT GLLALVFSPF GVYSVGIAAI TAAICQSPEA HPDKDQRWLA AAVAGIFYLL
AGLFGSAITG MMAALPVSWI QMLAGLALLS TIGGSLYQAL HNERERDAAV VAFLVTASGL
TLVGIGSAFW GLIAGGVCYV VLNLIADRNR Y