Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4941 |
Symbol | |
ID | 6143281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 5055742 |
End bp | 5057409 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619744 |
Product | putative ABC transporter ATP-binding protein |
Protein accession | YP_001746848 |
Protein GI | 170683563 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.963786 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTCAAT TCGTTTATAC CATGCATCGT GTCGGCAAAG TTGTTCCGCC GAAACGTCAT ATTTTGAAAA ACATCTCTCT GAGTTTCTTC CCTGGGGCAA AAATTGGTGT CCTGGGTCTG AACGGCGCGG GTAAGTCTAC CCTGCTGCGC ATTATGGCGG GCATTGATAA AGATATCGAA GGTGAAGCGC GTCCGCAGCC AGACATCAAG ATTGGTTACC TGCCGCAGGA ACCTCAGCTG AACCCGGAAC ACACCGTGCG TGAGTCCATT GAAGAAGCGG TTTCTGAAGT GGTTAACGCT CTGAAACGTC TGGATGAAGT GTATGCGCTG TACGCCGATC CGGATGCCGA TTTTGACAAG CTGGCCGCTG AACAAGGCCG TCTGGAAGAG ATCATTCAGG CTCATGACGG TCATAACCTG AACGTTCAGC TGGAGCGTGC GGCTGATGCG CTGCGTCTGC CGGACTGGGA CGCGAAAATC GCTAACCTTT CCGGTGGTGA GCGTCGTCGC GTGGCGTTGT GCCGCCTGCT GTTGGAAAAA CCAGACATGC TGCTGCTCGA CGAACCGACC AACCACCTGG ATGCCGAATC CGTGGCCTGG CTGGAACGCT TCCTGCACGA CTTCGAGGGT ACCGTGGTGG CGATTACCCA CGACCGTTAC TTCCTCGATA ACGTTGCAGG CTGGATCCTC GAACTTGACC GCGGTGAAGG TATTCCGTGG GAAGGCAACT ACTCCTCCTG GCTGGAGCAG AAAGATCAGC GCCTGGCGCA GGAAGCTTCA CAAGAAGCGG CGCGTCGTAA GTCGATTGAG AAAGAGCTGG AGTGGGTACG TCAGGGAACT AAAGGCCGCC AGTCGAAAGG TAAAGCCCGT CTGGCGCGCT TTGAAGAGCT GAACAGCACC GAATATCAGA AACGTAACGA AACCAACGAA CTGTTTATTC CACCTGGACC ACGTCTGGGC GACAAAGTGC TGGAAGTCAG CAATCTGCGT AAATCCTATG GCGATCGTCT GCTGATTGAC TCCCTGAGTT TCTCGATCCC GAAAGGGGCG ATCGTCGGGA TCATCGGTCC GAACGGTGCG GGTAAATCGA CCCTGTTCCG TATGATCTCC GGTCAGGAAC AGCCGGACAG CGGCACCATC ACTTTGGGTG AAACGGTGAA ACTGGCATCG GTTGATCAGT TCCGTGACTC AATGGATAAC AGCAAAACCG TTTGGGAAGA AGTTTCCGGC GGGCTGGATA TCATGAAGAT CGGCAACACC GAGATGCCAA GCCGCGCCTA CGTTGGCCGC TTTAACTTTA AAGGGGTTGA TCAGGGTAAA CGCGTTGGCG AACTTTCCGG TGGTGAGCGT GGTCGTCTGC ATCTGGCGAA GCTGCTGCAG GTTGGCGGCA ACATGCTGCT GCTCGACGAA CCGACCAACG ACCTGGATAT CGAAACCCTG CGCGCGCTGG AAAACGCCCT GCTGGAGTTC CCGGGCTGCG CGATGGTTAT CTCGCACGAC CGTTGGTTCC TCGACCGTAT CGCCACACAC ATTCTGGATT ACCAGGATGA AGGTAAAGTT GAGTTCTTTG AAGGTAACTT TACCGAGTAC GAAGAGTACA AGAAACGCAC GCTGGGCGCA GACGCGCTGG AGCCGAAGCG TATCAAGTAC AAGCGTATTG CGAAGTAA
|
Protein sequence | MAQFVYTMHR VGKVVPPKRH ILKNISLSFF PGAKIGVLGL NGAGKSTLLR IMAGIDKDIE GEARPQPDIK IGYLPQEPQL NPEHTVRESI EEAVSEVVNA LKRLDEVYAL YADPDADFDK LAAEQGRLEE IIQAHDGHNL NVQLERAADA LRLPDWDAKI ANLSGGERRR VALCRLLLEK PDMLLLDEPT NHLDAESVAW LERFLHDFEG TVVAITHDRY FLDNVAGWIL ELDRGEGIPW EGNYSSWLEQ KDQRLAQEAS QEAARRKSIE KELEWVRQGT KGRQSKGKAR LARFEELNST EYQKRNETNE LFIPPGPRLG DKVLEVSNLR KSYGDRLLID SLSFSIPKGA IVGIIGPNGA GKSTLFRMIS GQEQPDSGTI TLGETVKLAS VDQFRDSMDN SKTVWEEVSG GLDIMKIGNT EMPSRAYVGR FNFKGVDQGK RVGELSGGER GRLHLAKLLQ VGGNMLLLDE PTNDLDIETL RALENALLEF PGCAMVISHD RWFLDRIATH ILDYQDEGKV EFFEGNFTEY EEYKKRTLGA DALEPKRIKY KRIAK
|
| |