Gene EcHS_A4482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4482 
Symbol 
ID5593900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4486253 
End bp4487755 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content56% 
IMG OID640923580 
Productputative sugar ABC transporter, ATP-binding protein 
Protein accessionYP_001461021 
Protein GI157163703 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCG ACCAACACCA GGAGATCCTC CGCACCGAAG GATTAAGTAA ATTTTTCCCC 
GGCGTCAAAG CGTTAGACAA CGTTGATTTC AGCCTGCGCC GTGGCGAAAT CATGGCGCTG
CTTGGTGAAA ACGGGGCGGG AAAATCAACG CTAATCAAAG CATTAACTGG TGTATACCAC
GCCGATCGCG GCACCATCTG GCTGGAAGGC CAGGCTATCT CACCGAAAAA TACCGCCCAT
GCACAACAAC TCGGTATCGG CACCGTCTAT CAGGAAGTCA ACCTGCTACC CAATATGTCG
GTTGCTGATA ATCTATTTAT AGGCCGCGAA CCCAAACGTT TCGGCCTTCT ACGCCGTAAA
GAGATGGAAA AGCGCGCCAC CGAACTGATG GCATCTTACG GTTTCTCCCT CGACGTGCGC
GAACCGCTTA ACCGTTTTTC GGTGGCGATG CAGCAAATCG TCGCCATTTG CCGGGCTATC
GATCTCTCCG CCAAAGTGCT GATCCTCGAT GAACCCACCG CCAGCCTCGA CACCCAGGAA
GTGGAGCTAT TATTTGGCCT GATGCGTCAG TTGCGCGATC GCGGCGTCAG CCTGATCTTT
GTCACTCACT TTCTCGATCA GGTCTATCAG GTCAGCGATC GGATCACCGT CTTACGCAAC
GGCAGTTTCG TAGGCTGTCG GGAAACGCGC GAGCTACCGC AGATCGAACT GGTAAAAATG
ATGCTGGGGC GCGAGCTGGA TACCCACGCG CTACAGCGTG CCGGGCGAAC ATTGTTGAGC
GACAAACCCG TTGCCGCGTT CAAAAATTAC GGCAAAAAAG GAACGATAGC ACCGTTTGAT
CTCGAAGTGC GCCCCGGCGA GATCGTCGGT CTGGCTGGCT TGCTGGGATC CGGACGTACC
GAAACCGCCG AAGTGATCTT CGGTATCAAA CCTGCTGACA GCGGCACGGC GTTGATCAAA
GGCAAACCGC AAACCCTGCG ATCGCCACAT CAGGCTTCAG TACTGGGCAT CGGATTCTGC
CCGGAAGACA GGAAAACCGA TGGCATCATC GCCGCCGCCT CGGTGCGGGA AAATATCATC
CTCGCTCTCC AGGCCCAGCG CGGCTGGCTA CGACCCATTT CCCGCAAAGA GCAGCAAGAG
ATTGCCGAAC GCTTTATCCG CCAGCTTGGC ATTCGCACAC CTTCAACTGA ACAACCGATT
GAATTTCTCT CTGGCGGCAA TCAGCAAAAA GTGTTGCTTT CACGTTGGCT ACTGACCCGA
CCGCAATTTC TGATCCTCGA TGAGCCAACC CGCGGCATTG ATGTTGGTGC CCACGCCGAG
ATCATCCGCC TGATTGAAAC GCTGTGTGCT GACGGTCTGG CGCTGTTGGT GATCTCCTCC
GAACTGGAAG AGCTGGTGGG CTATGCCGAT CGGGTGATTA TCATGCGCGA TCGCAAACAG
GTGGCGGAGA TCCCGCTGGC AGAGCTTTCC GTTCCGGCGA TCATGAACGC CATTGCGGCG
TAA
 
Protein sequence
MTTDQHQEIL RTEGLSKFFP GVKALDNVDF SLRRGEIMAL LGENGAGKST LIKALTGVYH 
ADRGTIWLEG QAISPKNTAH AQQLGIGTVY QEVNLLPNMS VADNLFIGRE PKRFGLLRRK
EMEKRATELM ASYGFSLDVR EPLNRFSVAM QQIVAICRAI DLSAKVLILD EPTASLDTQE
VELLFGLMRQ LRDRGVSLIF VTHFLDQVYQ VSDRITVLRN GSFVGCRETR ELPQIELVKM
MLGRELDTHA LQRAGRTLLS DKPVAAFKNY GKKGTIAPFD LEVRPGEIVG LAGLLGSGRT
ETAEVIFGIK PADSGTALIK GKPQTLRSPH QASVLGIGFC PEDRKTDGII AAASVRENII
LALQAQRGWL RPISRKEQQE IAERFIRQLG IRTPSTEQPI EFLSGGNQQK VLLSRWLLTR
PQFLILDEPT RGIDVGAHAE IIRLIETLCA DGLALLVISS ELEELVGYAD RVIIMRDRKQ
VAEIPLAELS VPAIMNAIAA