Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1838 |
Symbol | |
ID | 5592129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1854155 |
End bp | 1855321 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640920982 |
Product | putative ABC transporter solute-binding protein |
Protein accession | YP_001458534 |
Protein GI | 157161216 |
COG category | [R] General function prediction only |
COG ID | [COG4134] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 70 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCATT GTGGGTGGTT GCTGGGATTG TTATCGCTGT TTTCTCTGGC AACACATGCC AGTGACTGGC AAGAAATTAA AAATGAGGCC AAAGGGCAAA CCGTCTGGTT TAACGCCTGG GGCGGCGATA CCGCAATTAA CCGCTATCTC GACTGGGTGA GCGGCGAGAT GAAAACCCAT TACGCTATAA ACCTGAAGAT TGTCCGTCTG GCGGATGCCG CAGACGCGGT GAAGCGCATT CAGACCGAAG CCGCAGCCGG ACGTAAAACG GGCGGCTCGG TGGATCTGCT CTGGGTGAAC GGCGAAAACT TCCGCACCTT AAAAGAGGCC AATTTATTAC AAACGGGCTG GGCGGAGACT CTGCCCAACT GGCGCTATGT CGACACACAG CTGCCGGTGC GGGAAGATTT TTCTGTGCCG ACACAAGGTG CGGAATCGCC CTGGGGCGGC GCACAACTGA CGTTTATCGC CCGCCGCGAT GTTACGCCAC AGCCACCACA AACGCCGCAA GCCTTACTGG AGTTTGCTAA AGCCAATCCC GGCACGGTTA CCTATCCGCG CCCACCGGAC TTTACCGGCA CGGCGTTTCT TGAACAGTTG CTGATTATGC TGACGCCCGA TCCCGCCGCA TTAAAAGAAG CGCCGGACGA TGCGACTTTC GCCCGTGTCA CTGCTCCCTT GTGGCAATAT CTTGATGTGC TGCATCCGTA TTTGTGGCGC GAAGGAAAGG ATTTCCCGCC TTCACCCGCG CGGATGGATG CTCTGCTGAA AGCCGGCACA TTGCGCCTGT CGCTGACCTT TAACCCCGCG CATGCGCAGC AAAAAATCGC CAGCGGCGAT TTGCCTGCAA GCAGTTACAG TTTTGGCTTT CGCGAGGGGA TGATTGGCAA CGTGCATTTC GTCACCATTC CTGCCAACGC GAATGCCAGT GCTGCGGCGA AGGTAGTTGC CAATTTCCTG CTCTCACCCG ATGCGCAACT GCGTAAAGCA GATCCCGCTG TCTGGGGCGA TCCTTCTGTT CTCGATCCGC AAAAACTGCC TGACGGGCAG CGCGAATCAT TGCAATCAAG AATGCCGCAG GATCTGCCGC CGGTACTGGC TGAACCGCAC GCAGGTTGGG TAAATGCGCT GGAACAAGAA TGGCTACACC GTTACGGTAC GCATTAA
|
Protein sequence | MRHCGWLLGL LSLFSLATHA SDWQEIKNEA KGQTVWFNAW GGDTAINRYL DWVSGEMKTH YAINLKIVRL ADAADAVKRI QTEAAAGRKT GGSVDLLWVN GENFRTLKEA NLLQTGWAET LPNWRYVDTQ LPVREDFSVP TQGAESPWGG AQLTFIARRD VTPQPPQTPQ ALLEFAKANP GTVTYPRPPD FTGTAFLEQL LIMLTPDPAA LKEAPDDATF ARVTAPLWQY LDVLHPYLWR EGKDFPPSPA RMDALLKAGT LRLSLTFNPA HAQQKIASGD LPASSYSFGF REGMIGNVHF VTIPANANAS AAAKVVANFL LSPDAQLRKA DPAVWGDPSV LDPQKLPDGQ RESLQSRMPQ DLPPVLAEPH AGWVNALEQE WLHRYGTH
|
| |