Gene EcSMS35_4941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4941 
Symbol 
ID6143281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5055742 
End bp5057409 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content55% 
IMG OID641619744 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001746848 
Protein GI170683563 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.963786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCAAT TCGTTTATAC CATGCATCGT GTCGGCAAAG TTGTTCCGCC GAAACGTCAT 
ATTTTGAAAA ACATCTCTCT GAGTTTCTTC CCTGGGGCAA AAATTGGTGT CCTGGGTCTG
AACGGCGCGG GTAAGTCTAC CCTGCTGCGC ATTATGGCGG GCATTGATAA AGATATCGAA
GGTGAAGCGC GTCCGCAGCC AGACATCAAG ATTGGTTACC TGCCGCAGGA ACCTCAGCTG
AACCCGGAAC ACACCGTGCG TGAGTCCATT GAAGAAGCGG TTTCTGAAGT GGTTAACGCT
CTGAAACGTC TGGATGAAGT GTATGCGCTG TACGCCGATC CGGATGCCGA TTTTGACAAG
CTGGCCGCTG AACAAGGCCG TCTGGAAGAG ATCATTCAGG CTCATGACGG TCATAACCTG
AACGTTCAGC TGGAGCGTGC GGCTGATGCG CTGCGTCTGC CGGACTGGGA CGCGAAAATC
GCTAACCTTT CCGGTGGTGA GCGTCGTCGC GTGGCGTTGT GCCGCCTGCT GTTGGAAAAA
CCAGACATGC TGCTGCTCGA CGAACCGACC AACCACCTGG ATGCCGAATC CGTGGCCTGG
CTGGAACGCT TCCTGCACGA CTTCGAGGGT ACCGTGGTGG CGATTACCCA CGACCGTTAC
TTCCTCGATA ACGTTGCAGG CTGGATCCTC GAACTTGACC GCGGTGAAGG TATTCCGTGG
GAAGGCAACT ACTCCTCCTG GCTGGAGCAG AAAGATCAGC GCCTGGCGCA GGAAGCTTCA
CAAGAAGCGG CGCGTCGTAA GTCGATTGAG AAAGAGCTGG AGTGGGTACG TCAGGGAACT
AAAGGCCGCC AGTCGAAAGG TAAAGCCCGT CTGGCGCGCT TTGAAGAGCT GAACAGCACC
GAATATCAGA AACGTAACGA AACCAACGAA CTGTTTATTC CACCTGGACC ACGTCTGGGC
GACAAAGTGC TGGAAGTCAG CAATCTGCGT AAATCCTATG GCGATCGTCT GCTGATTGAC
TCCCTGAGTT TCTCGATCCC GAAAGGGGCG ATCGTCGGGA TCATCGGTCC GAACGGTGCG
GGTAAATCGA CCCTGTTCCG TATGATCTCC GGTCAGGAAC AGCCGGACAG CGGCACCATC
ACTTTGGGTG AAACGGTGAA ACTGGCATCG GTTGATCAGT TCCGTGACTC AATGGATAAC
AGCAAAACCG TTTGGGAAGA AGTTTCCGGC GGGCTGGATA TCATGAAGAT CGGCAACACC
GAGATGCCAA GCCGCGCCTA CGTTGGCCGC TTTAACTTTA AAGGGGTTGA TCAGGGTAAA
CGCGTTGGCG AACTTTCCGG TGGTGAGCGT GGTCGTCTGC ATCTGGCGAA GCTGCTGCAG
GTTGGCGGCA ACATGCTGCT GCTCGACGAA CCGACCAACG ACCTGGATAT CGAAACCCTG
CGCGCGCTGG AAAACGCCCT GCTGGAGTTC CCGGGCTGCG CGATGGTTAT CTCGCACGAC
CGTTGGTTCC TCGACCGTAT CGCCACACAC ATTCTGGATT ACCAGGATGA AGGTAAAGTT
GAGTTCTTTG AAGGTAACTT TACCGAGTAC GAAGAGTACA AGAAACGCAC GCTGGGCGCA
GACGCGCTGG AGCCGAAGCG TATCAAGTAC AAGCGTATTG CGAAGTAA
 
Protein sequence
MAQFVYTMHR VGKVVPPKRH ILKNISLSFF PGAKIGVLGL NGAGKSTLLR IMAGIDKDIE 
GEARPQPDIK IGYLPQEPQL NPEHTVRESI EEAVSEVVNA LKRLDEVYAL YADPDADFDK
LAAEQGRLEE IIQAHDGHNL NVQLERAADA LRLPDWDAKI ANLSGGERRR VALCRLLLEK
PDMLLLDEPT NHLDAESVAW LERFLHDFEG TVVAITHDRY FLDNVAGWIL ELDRGEGIPW
EGNYSSWLEQ KDQRLAQEAS QEAARRKSIE KELEWVRQGT KGRQSKGKAR LARFEELNST
EYQKRNETNE LFIPPGPRLG DKVLEVSNLR KSYGDRLLID SLSFSIPKGA IVGIIGPNGA
GKSTLFRMIS GQEQPDSGTI TLGETVKLAS VDQFRDSMDN SKTVWEEVSG GLDIMKIGNT
EMPSRAYVGR FNFKGVDQGK RVGELSGGER GRLHLAKLLQ VGGNMLLLDE PTNDLDIETL
RALENALLEF PGCAMVISHD RWFLDRIATH ILDYQDEGKV EFFEGNFTEY EEYKKRTLGA
DALEPKRIKY KRIAK