Gene ECD_03542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03542 
SymbolyicK 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3725320 
End bp3726504 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content46% 
IMG OID 
Productpredicted sugar efflux system 
Protein accessionACT45338 
Protein GI253979668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA CGGCTACCAC TCCATCAAAA ATACTTGATC TCACTGCCGC GGCATTTTTA 
CTTGTCGCCT TTCTGACGGG TATTGCGGGC GCTCTTCAGA CTCCTACCCT AAGTATATTC
CTCGCAGATG AACTGAAAGC CCGTCCTATA ATGGTAGGTT TTTTCTTCAC CGGTAGCGCT
ATTATGGGAA TTCTGGTCAG TCAATTTCTG GCAAGGCACT CCGATAAACA AGGCGACCGT
AAATTACTGA TTCTGCTATG TTGCTTATTT GGAGTGCTGG CCTGCACGCT TTTTGCGTGG
AATCGCAACT ACTTCATTCT CCTCTCAACG GGCGTACTTC TGAGTAGTTT TGCTTCCACC
GCAAACCCGC AAATGTTCGC CCTCGCCCGT GAACACGCCG ACAGAACAGG CCGTGAGACG
GTCATGTTCA GTACATTTTT ACGTGCTCAG ATCTCGCTTG CCTGGGTTAT CGGGCCACCG
CTCGCTTATG AACTGGCAAT GGGATTTAGT TTTAAAGTGA TGTATCTCAC CGCTGCCATC
GCATTTGTTG TTTGCGGACT GATAGTCTGG TTGTTTTTGC CATCAATACA AAGAAATATT
CCTGTCGTTA CCCAACCCGT AGAAATTTTA CCCTCCACCC ACAGGAAGCG GGATACGCGG
CTACTTTTTG TGGTCTGTTC AATGATGTGG GCGGCGAATA ATCTCTACAT GATAAATATG
CCGCTATTTA TTATTGATGA ACTGCATCTA ACCGATAAAC TGGCTGGGGA AATGATTGGT
ATCGCTGCCG GTCTGGAAAT TCCGATGATG TTAATCGCAG GCTATTACAT GAAACGTATT
GGCAAGCGAC TATTAATGCT CATTGCTATC GTGAGTGGAA TGTGTTTTTA CGCCAGCGTA
CTCATGGCGA CGACTCCGGC GGTTGAGCTG GAATTGCAAA TTCTTAATGC CATCTTCCTT
GGTATTCTCT GTGGTATCGG CATGCTTTAT TTTCAGGACT TGATGCCTGA AAAAATAGGC
TCTGCGACAA CGTTATATGC AAATACTTCA CGCGTCGGCT GGATTATCGC CGGCTCTGTT
GACGGAATTA TGGTTGAAAT CTGGAGCTAC CATGCGTTGT TCTGGCTGGC GATAGGGATG
TTGGGTATTG CGATGATTTG CCTGCTGTTT ATTAAAGATA TTTAG
 
Protein sequence
MQKTATTPSK ILDLTAAAFL LVAFLTGIAG ALQTPTLSIF LADELKARPI MVGFFFTGSA 
IMGILVSQFL ARHSDKQGDR KLLILLCCLF GVLACTLFAW NRNYFILLST GVLLSSFAST
ANPQMFALAR EHADRTGRET VMFSTFLRAQ ISLAWVIGPP LAYELAMGFS FKVMYLTAAI
AFVVCGLIVW LFLPSIQRNI PVVTQPVEIL PSTHRKRDTR LLFVVCSMMW AANNLYMINM
PLFIIDELHL TDKLAGEMIG IAAGLEIPMM LIAGYYMKRI GKRLLMLIAI VSGMCFYASV
LMATTPAVEL ELQILNAIFL GILCGIGMLY FQDLMPEKIG SATTLYANTS RVGWIIAGSV
DGIMVEIWSY HALFWLAIGM LGIAMICLLF IKDI