Gene EcSMS35_3634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3634 
Symbol 
ID6143313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3693419 
End bp3695332 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content56% 
IMG OID641618461 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001745601 
Protein GI170679823 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000338837 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGTTT TCTCCTCGTT ACAAATTCGT CGCGGCGTGC GCGTCCTGCT GGATAATGCC 
ACCGCCACCA TCAACCCCGG GCAGAAAGTC GGCCTGGTGG GTAAAAACGG CTGTGGTAAA
TCTACCCTGC TGGCATTGCT GAAAAATGAA ATCAGCGCCG ACGGCGGCAG CTACACCTTT
CCGGGAAGCT GGCAACTGGC GTGGGTGAAT CAGGAAACAC CAGCGTTACC GCAAGCGGCG
CTGGAATATG TCATTGACGG CGACCGTGAA TATCGTCAAC TGGAAGCGCA GCTATACGAC
GCCAACGAAC GTAACGACGG TCACGCCATT GCGACCATTC ATGGCAAGCT GGATGCTATC
GACGCATGGA GTATTCGCTC CCGCGCCGCC AGCCTGCTGC ACGGCCTCGG TTTCAGCAAC
GAACAACTGG AGCGTCCGGT AAGTGATTTC TCCGGTGGCT GGCGTATGCG CCTTAACCTT
GCTCAGGCGC TGATTTGCCG TTCAGACTTG CTGCTGCTCG ACGAACCGAC TAACCACCTC
GATCTCGATG CCGTTATCTG GCTGGAAAAA TGGCTGAAGA GCTATCAGGG CACGCTGATC
CTGATCTCTC ACGACCGCGA CTTCCTCGAT CCGATCGTCG ATAAAATCAT TCATATCGAA
CAACAAAGCA TGTTCGAGTA CACCGGTAAC TACAGCTCAT TTGAAGTGCA GCGCGCCACC
CGTCTGGCGC AGCAACAGGC GATGTATGAA AGCCAGCAGG AACGCGTGGC GCATCTGCAA
AGTTATATCG ACCGTTTCCG TGCCAAAGCC ACCAAAGCGA AGCAGGCCCA GAGCCGCATT
AAGATGCTCG AGCGTATGGA GCTGATTGCC CCCGCACACG TCGACAACCC GTTCCGCTTT
AGCTTCCGCG CGCCGGAAAG CCTGCCAAAT CCGTTACTGA AGATGGAAAA AGTCAGTGCG
GGCTATGGTG ATCGCATTAT TCTCGACTCG ATTAAACTGA ATCTGGTCCC CGGCTCGCGC
ATTGGTCTGT TAGGCCGCAA CGGCGCGGGT AAATCGACAT TAATCAAACT GTTAGCCGGC
GAACTTGCGC CAGTCAGCGG TGAAATTGGT CTGGCAAAAG GGATCAAGCT CGGTTACTTC
GCCCAGCATC AACTGGAATA CCTGCGCGCC GACGAATCGC CTATTCAACA TCTGGCACGT
TTAGCGCCGC AGGAGCTGGA GCAAAAACTG CGTGACTACC TCGGCGGCTT TGGTTTCCAG
GGCGATAAAG TAACCGAAGA AACACGCCGC TTCTCCGGTG GGGAAAAAGC CCGCCTGGTG
CTGGCATTAA TTGTCTGGCA GCGTCCGAAT CTGCTGCTGC TCGACGAACC GACCAACCAC
CTTGACCTCG ACATGCGTCA GGCACTCACC GAAGCATTAA TCGAGTTCGA AGGTGCGCTG
GTTGTCGTCT CGCACGACCG TCATTTGCTG CGTTCCACCA CTGACGATCT CTACCTGGTT
CACGATCGTA AAGTCGAACC GTTCGACGGC GATCTGGAAG ATTATCAACA GTGGTTGAGC
GACGTACAAA AGCAGGAAAA CCAGACCGAC GAAGCGCCAA AAGAGAACGC GAACAGCGCC
CAGGCACGTA AAGATCAGAA GCGTCGGGAA GCGGAGCTGC GTGCGCAAAC CCAGCCACTG
CGTAAAGAGA TTGCCCGTCT GGAAAAAGAG ATGGAGAAGC TGAACGCGCA ACTGGCGCAG
GCGGAAGAGA AACTCGGCGA CAGCGAACTG TATGATCAGA GCCGTAAAGC GGAGTTGACC
GCCTGCCTGC AACAGCAAGC CAGCGCCAAA TCCGGCCTGG AAGAGTGCGA AATGGCGTGG
CTGGAAGCCC AGGAGCAGCT TGAGCAGATG CTGCTGGAAG GCCAAAGCAA CTGA
 
Protein sequence
MIVFSSLQIR RGVRVLLDNA TATINPGQKV GLVGKNGCGK STLLALLKNE ISADGGSYTF 
PGSWQLAWVN QETPALPQAA LEYVIDGDRE YRQLEAQLYD ANERNDGHAI ATIHGKLDAI
DAWSIRSRAA SLLHGLGFSN EQLERPVSDF SGGWRMRLNL AQALICRSDL LLLDEPTNHL
DLDAVIWLEK WLKSYQGTLI LISHDRDFLD PIVDKIIHIE QQSMFEYTGN YSSFEVQRAT
RLAQQQAMYE SQQERVAHLQ SYIDRFRAKA TKAKQAQSRI KMLERMELIA PAHVDNPFRF
SFRAPESLPN PLLKMEKVSA GYGDRIILDS IKLNLVPGSR IGLLGRNGAG KSTLIKLLAG
ELAPVSGEIG LAKGIKLGYF AQHQLEYLRA DESPIQHLAR LAPQELEQKL RDYLGGFGFQ
GDKVTEETRR FSGGEKARLV LALIVWQRPN LLLLDEPTNH LDLDMRQALT EALIEFEGAL
VVVSHDRHLL RSTTDDLYLV HDRKVEPFDG DLEDYQQWLS DVQKQENQTD EAPKENANSA
QARKDQKRRE AELRAQTQPL RKEIARLEKE MEKLNAQLAQ AEEKLGDSEL YDQSRKAELT
ACLQQQASAK SGLEECEMAW LEAQEQLEQM LLEGQSN