Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3563 |
Symbol | |
ID | 6145678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3646435 |
End bp | 3647460 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618391 |
Product | amino acid ABC transporter, periplasmic amino acid-binding protein |
Protein accession | YP_001745538 |
Protein GI | 170682914 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA TGATGATAGC CACACTGGCT GCCGCCAGCG TGCTGCTTGC CGTTGCAAAT CAGGCGCATG CTGGCGCGAC GCTTGATGCC GTACAGAAAA AAGGTTTTGT GCAATGCGGG ATCAGTGATG GATTACCTGG GTTCTCTTAT GCCGATGCTG ACGGTAAATT TTCAGGTATT GATGTTGATA TTTGTCGTGG TGTTGCCGCT GCTGTATTTG GTGACGACAC GAAAGTGAAA TATACCCCGC TCACTGCAAA AGAACGCTTC ACCGCTTTAC AGTCAGGGGA GGTGGATTTG CTCTCCCGTA ATACGACCTG GACTTCATCT CGCGATGCCG GGATGGGAAT GGCATTTACC GGCGTCACTT ATTACGACGG CATTGGCTTC CTGACTCACG ATAAAGCGGG ACTAAAAAGC GCGAAAGAAC TGGATGGCGC TACCGTCTGT ATTCAGGCGG GTACTGATAC CGAACTCAAC GTTGCCGATT ACTTTAAGGC AAACAATATG AAGTACACAC CAGTGACTTT CGATCGCTCT GACGAATCAG CGAAGGCACT GGAATCTGGT CGCTGCGATA CGCTGGCCTC GGATCAATCA CAACTGTATG CCCTGCGCAT CAAATTAAGC AACCCGGCTG AATGGATTGT CTTACCGGAA GTTATCTCCA AAGAACCGCT TGGGCCGGTA GTTCGCCGTG GCGATGATGA ATGGTTCTCG ATTGTACGCT GGACGCTTTT CGCCATGCTG AATGCTGAAG AGATGGGCAT CAATTCCCAG AACGTCGATG AAAAAGCGGC TAATCCGGCA ACGCCTGATA TGGCACATCT GCTGGGTAAA GAAGGCGATT ACGGCAAGGA TCTGAAGCTG GATAATAAAT GGGCTTACAA CATCATCAAA CAGGTGGGTA ACTACTCGGA AATTTTTGAG CGTAACGTAG GTTCAGAAAG CCCGCTGAAA ATTAAACGTG GGCAAAATAA TCTCTGGAAT AACGGCGGAA TTCAGTACGC GCCGCCCGTG CGTTAA
|
Protein sequence | MKKMMIATLA AASVLLAVAN QAHAGATLDA VQKKGFVQCG ISDGLPGFSY ADADGKFSGI DVDICRGVAA AVFGDDTKVK YTPLTAKERF TALQSGEVDL LSRNTTWTSS RDAGMGMAFT GVTYYDGIGF LTHDKAGLKS AKELDGATVC IQAGTDTELN VADYFKANNM KYTPVTFDRS DESAKALESG RCDTLASDQS QLYALRIKLS NPAEWIVLPE VISKEPLGPV VRRGDDEWFS IVRWTLFAML NAEEMGINSQ NVDEKAANPA TPDMAHLLGK EGDYGKDLKL DNKWAYNIIK QVGNYSEIFE RNVGSESPLK IKRGQNNLWN NGGIQYAPPV R
|
| |