Gene EcolC_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3782 
Symbol 
ID6067580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4137907 
End bp4139409 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID641603195 
ProductABC transporter related 
Protein accessionYP_001726714 
Protein GI170021760 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTA ACCAACACCA GGAGATCCTC CGCACCGAAG GATTAAGTAA ATTTTTCCCC 
GGCGTCAAAG CGTTAGACAA CGTTGATTTC AGCCTGCGCC GCGGCGAAAT CATGGCGCTG
CTCGGTGAAA ACGGGGCAGG AAAATCAACG CTAATCAAAG CATTAACTGG TGTATACCAC
GCCGATCGCG GCACCATCTG GCTGGAAGGC CAGGCTATCT CACCGAAAAA TACCGCCCAT
GCACAACAAC TCGGTATCGG CACCGTCTAT CAGGAAGTAA ACCTGCTACC CAATATGTCG
GTCGCTGATA ATCTATTTAT AGGCCGCGAA CCCAAACGCT TCGGCCTTCT ACGCCGCAAA
GAGATGGAAA AGCGCGCCAC CGAACTGATG GCATCTTACG GTTTCTCCCT CGACGTGCGC
GAACCGCTCA ACCGCTTTTC AGTCGCGATG CAGCAAATCG TCGCTATTTG CCGGGCTATC
GATCTCTCTG CCAAAGTGCT GATCCTCGAT GAACCCACCG CCAGTCTCGA CACCCAGGAA
GTGGAGTTAC TGTTTGACCT GATGCGTCAG TTGCGCGATC GCGGCGTCAG CCTGATTTTT
GTCACTCACT TTCTCGATCA GGTCTATCAG GTCAGCGATC GGATCACCGT CTTACGCAAC
GGCAGTTTCG TAGGCTGTCG GGAAACGTGC GAGCTACCGC AGATCGAACT GGTAAAAATG
ATGCTGGGGC GCGAGCTGGA TACCCACGCG CTACAGCGTG CCGGGCGAAC ATTGTTGAGC
GACAAACCCG TTGCCGCGTT CAAAAATTAC GGCAAAAAAG GAACGATCGC ACCGTTTGAT
CTCGAAGTAC GCCCCGGCGA GATCGTCGGT CTGGCTGGAT TGCTGGGATC AGGACGTACC
GAAACCGCCG AAGTGATCTT CGGTATCAAA CCTGCTGACA GCGGCACGGC GTTGATCAAA
GGCAAACCGC AAAACCTGCG ATCGCCACAT CAGGCTTCGG TACTGGGCAT TGGCTTCTGC
CCGGAAGACA GGAAAACCGA TGGCATCATC GCTGCCGCCT CGGTGCGGGA AAATATCATC
CTCGCTCTCC AGGCCCAGCG CGGCTGGCTA CGTCCCATTT CCCGCAAAGA ACAGCAAGAG
ATTGCCGAAC GCTTTATCCG CCAGCTTGGC ATTCGCACAC CTTCAACTGA ACAACCGATT
GAATTTCTCT CTGGCGGCAA TCAGCAAAAA GTGTTGCTTT CACGTTGGCT ACTGACCCGA
CCGCAATTTC TGATCCTTGA TGAGCCAACT CGCGGCATTG ATGTTGGTGC CCACGCCGAG
ATCATCCGCC TGATTGAAAC GCTATGCGCC GATGGTCTGG CGCTGCTGGT GATCTCCTCC
GAACTGGAAG AACTGGTGGG CTATGCCGAC CGGGTGATCA TCATGCGCGA TCGCAAACAG
GTGGCGGAGA TCCCGCTGGC AGAGCTTTCC GTTCCGGCGA TCATGAACGC CATTGCGGCG
TAA
 
Protein sequence
MTTNQHQEIL RTEGLSKFFP GVKALDNVDF SLRRGEIMAL LGENGAGKST LIKALTGVYH 
ADRGTIWLEG QAISPKNTAH AQQLGIGTVY QEVNLLPNMS VADNLFIGRE PKRFGLLRRK
EMEKRATELM ASYGFSLDVR EPLNRFSVAM QQIVAICRAI DLSAKVLILD EPTASLDTQE
VELLFDLMRQ LRDRGVSLIF VTHFLDQVYQ VSDRITVLRN GSFVGCRETC ELPQIELVKM
MLGRELDTHA LQRAGRTLLS DKPVAAFKNY GKKGTIAPFD LEVRPGEIVG LAGLLGSGRT
ETAEVIFGIK PADSGTALIK GKPQNLRSPH QASVLGIGFC PEDRKTDGII AAASVRENII
LALQAQRGWL RPISRKEQQE IAERFIRQLG IRTPSTEQPI EFLSGGNQQK VLLSRWLLTR
PQFLILDEPT RGIDVGAHAE IIRLIETLCA DGLALLVISS ELEELVGYAD RVIIMRDRKQ
VAEIPLAELS VPAIMNAIAA