Gene ECD_00787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00787 
SymbolybiT 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp839955 
End bp841547 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content52% 
IMG OID 
Productfused predicted transporter subunits of ABC superfamily: ATP-binding components 
Protein accessionACT42688 
Protein GI253977018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTAGTTT CCAGTAACGT CACCATGCAG TTCGGCAGTA AGCCGTTGTT TGAAAACATT 
TCCGTCAAAT TTGGCGGCGG CAACCGTTAC GGCCTGATTG GCGCGAACGG TAGTGGTAAA
TCCACCTTTA TGAAGATCCT CGGCGGCGAC TTAGAGCCGA CTTTGGGTAA CGTTTCCCTC
GATCCCAACG AGCGCATTGG TAAGCTGCGT CAGGATCAGT TTGCCTTTGA AGAGTTCACT
GTGCTGGATA CGGTGATCAT GGGGCATAAA GAGTTGTGGG AAGTGAAGCA GGAGCGTGAC
CGCATCTATG CTTTGCCGGA AATGAGTGAG GAAGACGGCT ATAAAGTGGC CGATCTGGAA
GTTAAATACG GCGAAATGGA CGGTTACTCT GCGGAAGCTC GCGCCGGTGA ATTGTTGCTT
GGCGTGGGAA TTCCAGTGGA ACAGCACTAC GGCCCGATGA GTGAAGTTGC TCCTGGCTGG
AAGCTGCGTG TGCTTCTGGC GCAGGCGCTG TTTGCTGATC CGGATATTCT CCTGCTCGAC
GAACCGACCA ATAACCTCGA CATCGACACC ATTCGCTGGC TGGAACAGGT GCTGAACGAG
CGTGACAGCA CCATGATCAT CATCTCGCAC GACCGTCACT TCCTTAACAT GGTTTGTACC
CACATGGCGG ATCTGGATTA CGGCGAGCTG CGCGTTTATC CGGGTAACTA CGATGAGTAC
ATGACGGCGG CGACCCAGGC GCGTGAACGT CTGCTGGCCG ATAACGCCAA GAAGAAAGCG
CAGATTGCTG AGTTGCAATC TTTCGTTAGC CGCTTTAGCG CCAACGCCTC GAAATCTCGC
CAGGCAACTT CGCGCGCGCG CCAGATTGAT AAAATCAAAC TGGAAGAGGT GAAAGCCTCC
AGCCGTCAGA ACCCGTTCAT CCGTTTTGAA CAGGATAAGA AACTGTTCCG TAACGCGCTG
GAAGTGGAAG GTCTGACCAA AGGGTTTGAT AACGGTCCAC TGTTTAAAAA TCTCAACCTG
CTGCTGGAGG TCGGTGAAAA ACTGGCGGTA CTGGGTACCA ACGGCGTCGG TAAATCAACG
CTGCTGAAAA CGCTGGTGGG CGATCTGCAA CCGGACAGCG GCACCGTAAA ATGGTCTGAA
AACGCGCGCA TTGGTTACTA TGCTCAGGAC CACGAATATG AGTTTGAAAA TGATCTGACC
GTGTTCGAAT GGATGAGCCA GTGGAAGCAG GAAGGCGATG ACGAGCAGGC GGTACGCAGC
ATTCTCGGTC GTTTGCTGTT CAGCCAGGAC GACATCAAAA AGCCAGCTAA AGTGCTTTCC
GGTGGGGAAA AAGGGCGGAT GCTGTTTGGT AAGTTAATGA TGCAGAAGCC GAACATTCTG
ATCATGGACG AACCGACCAA CCACCTGGAT ATGGAATCCA TTGAGTCGCT GAACATGGCA
CTGGAACTGT ATCAGGGCAC GCTGATCTTT GTTTCACACG ACCGTGAGTT CGTAAGCTCC
CTGGCGACCC GCATTCTGGA AATCACCCCG GAACGCGTGA TCGACTTTAG CGGTAATTAC
GAAGATTACC TGCGTAGTAA AGGGATCGAG TAA
 
Protein sequence
MLVSSNVTMQ FGSKPLFENI SVKFGGGNRY GLIGANGSGK STFMKILGGD LEPTLGNVSL 
DPNERIGKLR QDQFAFEEFT VLDTVIMGHK ELWEVKQERD RIYALPEMSE EDGYKVADLE
VKYGEMDGYS AEARAGELLL GVGIPVEQHY GPMSEVAPGW KLRVLLAQAL FADPDILLLD
EPTNNLDIDT IRWLEQVLNE RDSTMIIISH DRHFLNMVCT HMADLDYGEL RVYPGNYDEY
MTAATQARER LLADNAKKKA QIAELQSFVS RFSANASKSR QATSRARQID KIKLEEVKAS
SRQNPFIRFE QDKKLFRNAL EVEGLTKGFD NGPLFKNLNL LLEVGEKLAV LGTNGVGKST
LLKTLVGDLQ PDSGTVKWSE NARIGYYAQD HEYEFENDLT VFEWMSQWKQ EGDDEQAVRS
ILGRLLFSQD DIKKPAKVLS GGEKGRMLFG KLMMQKPNIL IMDEPTNHLD MESIESLNMA
LELYQGTLIF VSHDREFVSS LATRILEITP ERVIDFSGNY EDYLRSKGIE