Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2326 |
Symbol | |
ID | 6146038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2358022 |
End bp | 2359836 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617200 |
Product | ABC transporter, periplasmic solute-binding protein |
Protein accession | YP_001744373 |
Protein GI | 170681154 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00440714 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGTGC GCATACTGCT GCTGTTTATC GCTCTGTTCA CCTTTGGTGC GCAGGCGCAG GCTATCAAGG AAAGCTATGC CTTTGCCGTA CTGGGCGAAC CCCGGTACGC ATTTAATTTC AACCATTTTG ATTATGTGAA CCCCGCCGCG CCAAAAGGTG GGCAAATAAC GTTGTCCGCC CTCGGCACCT TCGATAATTT CAACCGCTAT GCACTACGCG GCAACCCGGG CGCACGCACC GAGCAGCTGT ACGACACGCT ATTTACGACT TCCGATGACG AACCAGGCAG TTATTACCCG CTGATTGCTG AAAGCGCACG CTATGCTGAC GATTATTCCT GGGTGGAGGT CGCTATTAAT CCGCGCGCCC GTTTTCATGA TGGTTCGCCC ATTACTGCCC GCGATGTAGA GTTTACTTTT CAAAAATTTA TGACCGAAGG CGTGCCGCAA TTTCGTCTGG TCTACAAAGG CACCACCGTC AAAGCCATTG CGCCGTTAAC CGTGCGCATT GAGTTAGCTA AACCCGGCAA AGAAGATATG CTGAGTCTGT TTTCGCTGCC GGTATTTCCA GAAAAGTACT GGAAAGATCA CAAACTTAGC GACCCGCTCG CCACGCCTCC GCTTGCCAGT GGTCCGTACC GCATTACGTC CTGGAAAATG GGGCAAAATA TTGTCTATTC CCGCGTAAAA GATTACTGGG CAGCAAACTT ACCGGTAAAC CGTGGACGCT GGAATTTCGA CACCATTCGC TACGATTATT ACCTCGATGA TAATGTCGCC TTTGAAGCGT TTAAAGCAGG TGCCTTTGAT TTGCGTATGG AAAACGACGC CAAAAACTGG GCCACGCGCT ATACCGGTAA AAATTTCGAT AAAAAATACA TCATCAAAGA TGAGCAAAAG AACGAATCAG CCCAGGATAC GCGCTGGCTG GCGTTTAATA TCCAACGTCC GGTATTCAGC GATCGCCGGG TGCGGGAAGC GATCACTCTC GCCTTTGACT TTGAATGGAT GAACAAAGCG TTGTTTTACA ATGCCTGGAG TCGCACGAAC AGTTATTTTC AGAATACCGA ATACGCGGCC AGAAATTACC CCGACGCCGC GGAGCTGGTG CTTCTGGCAC CAATGAAAAA AGATCTACCG CCAGAAGTCT TCACGCAAAT CTACCAGCCG CCGGTATCCA AAGGCGATGG CTACGATCGT GACAACCTGT TAAAAGCCGA CAAACTTCTT AACGAAGCAG GCTGGGTGCT GAAGGGTCAG CAACGCGTTA ATGCCACAAC GGGTCAGCCA CTCAGCTTTG AATTATTGCT TCCCTCAAGC AGTAATAGTC AGTGGGTATT GCCGTTCCAG CACAGCCTGC AACGTCTGGG TATCAACATG GATATTCGCA AGGTGGATAA CTCTCAAATC ACCAACCGCA TGCGCAGTCG CGACTATGAC ATGATGCCGC GCGTATGGCG GGCGATGCCG TGGCCCAGTT CCGATTTACA GATTTCCTGG TCATCGGAAT ATATCAATTC CACTTATAAT GCCCCCGGCG TGCAAAGCCC GGTTATCGAC TCGCTGATCA ACCAAATTAT TGCCGCGCAG GGAAATAAAG AAAAATTACT GCCGTTGGGG CGAGCACTGG ATCGCGTATT AACGTGGAAT TATTACATGC TGCCAATGTG GTATATGGCG GAAGACCGTC TCGCCTGGTG GGATAAATTC TCCCACCCCG CTGTACGCCC TGTTTACAGC CTGGGTATCG ATACCTGGTG GTATGACGTT AACAAAGCGA CGAAACTGCC GTCAGCCAGA CAACAGGGAG AGTAG
|
Protein sequence | MIVRILLLFI ALFTFGAQAQ AIKESYAFAV LGEPRYAFNF NHFDYVNPAA PKGGQITLSA LGTFDNFNRY ALRGNPGART EQLYDTLFTT SDDEPGSYYP LIAESARYAD DYSWVEVAIN PRARFHDGSP ITARDVEFTF QKFMTEGVPQ FRLVYKGTTV KAIAPLTVRI ELAKPGKEDM LSLFSLPVFP EKYWKDHKLS DPLATPPLAS GPYRITSWKM GQNIVYSRVK DYWAANLPVN RGRWNFDTIR YDYYLDDNVA FEAFKAGAFD LRMENDAKNW ATRYTGKNFD KKYIIKDEQK NESAQDTRWL AFNIQRPVFS DRRVREAITL AFDFEWMNKA LFYNAWSRTN SYFQNTEYAA RNYPDAAELV LLAPMKKDLP PEVFTQIYQP PVSKGDGYDR DNLLKADKLL NEAGWVLKGQ QRVNATTGQP LSFELLLPSS SNSQWVLPFQ HSLQRLGINM DIRKVDNSQI TNRMRSRDYD MMPRVWRAMP WPSSDLQISW SSEYINSTYN APGVQSPVID SLINQIIAAQ GNKEKLLPLG RALDRVLTWN YYMLPMWYMA EDRLAWWDKF SHPAVRPVYS LGIDTWWYDV NKATKLPSAR QQGE
|
| |