Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2170 |
Symbol | |
ID | 5594948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2150112 |
End bp | 2151407 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640921303 |
Product | putative ABC transporter |
Protein accession | YP_001458842 |
Protein GI | 157161524 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCA AAGTTCAGCA CGTCGGCAAG GCGTATAAAT ATTATCCATC TAAATGGAAC CGGGTCATTG AGAAACTACT GCCGGGCGAT AAGCCGCGGC ACAGCAAGAA ATGGGTATTG AAAGATATCA ATTTCAGCAT TGAACCTGGG GAAGCGGTCG GCATTGTTGG GGTGAACGGC GCAGGTAAAA GTACGTTACT GAAGCTGCTG ACTGGCACCA CTCAGCCCAC CAAAGGTAGC ATTGAAATCC AGGGGCGCGT CGCCGCGCTG CTGGAGCTGG GCATGGGCTT CCATCCTGAC TTTACCGGCC GGCAGAACGT GTATATGTCC GGGCTGATGA TGGGCCTGGG CCGGGAAGAG ATCGAGCGCT TAATGCCGGA GATCGAAGCC TTCGCCGATA TCGGCGACTA CATTGAAGAG CCCGTACGCA TCTATTCCAG CGGGATGCAA ATGCGCCTGG CGTTCGCTGT CGCCACGGCC TCCCGGCCGG ATATTCTGAT CGTCGATGAA GCGCTTTCCG TTGGTGATTC CCGTTTTCAA GCGAAGTGCT ATGCCCGTAT TGCGGACTTT AAAAAGCAGG GCACCACGCT GCTGCTGGTC TCACACAGCG CCGGGGATAT CGTCAAACAC TGCGACCGCG CCATTTTCCT CAAAAATGGT GATATCTGTA TGGACGGCAC CGCCCGTGAC GTGACCAACC GTTATCTGGA TGAGCTGTTT GGAAAAGCCG ACAAAAACAG CGCGCCAAAA AGCGAAACGG CAACCTCGTC AGCCAGCGGC GAAAGTCAGA TGTCTCTCGA TGAGATTGAA GATGTGTACC ACACGCGCCC AGGCTATCGT CCTGAAGAAT ACCGTTGGGG GCAGGGGGGC GCAAAAATCA TTGATTATCA CATCCAGAGC GCCGGGGTTG ATTTTCCTCC CTCACTGACG GGCAATCAGC AGACCGATTT TCTGATGAAG GTCGTGTTTG AATACGATTT TGATTGCGTG GTGCCAGGCA TCCTGATTAA GACCCTCGAT GGCTTATTCC TCTACGGAAC CAACTCTTTC CTCGCCTCTG AAGGGCGGGA AAATATTTCG GTTTCCCGCG GCGATGTTCG GGTCTTTAAA TTTAGCCTTC CGGTTGATTT AAATAGTGGC GATTACCTGC TGTCATTTGG TATCTCTGCC GGCAACCCGC AGACGGATAT GACGCCGCTG GACAGACGTT ACGACTCCAT TATTTTACAT GTGACCAAGA GCATGGATTT CTGGGGTGTC ATCGATCTTA AGTCGTCCTT TACTAGCTAC CAATGA
|
Protein sequence | MSIKVQHVGK AYKYYPSKWN RVIEKLLPGD KPRHSKKWVL KDINFSIEPG EAVGIVGVNG AGKSTLLKLL TGTTQPTKGS IEIQGRVAAL LELGMGFHPD FTGRQNVYMS GLMMGLGREE IERLMPEIEA FADIGDYIEE PVRIYSSGMQ MRLAFAVATA SRPDILIVDE ALSVGDSRFQ AKCYARIADF KKQGTTLLLV SHSAGDIVKH CDRAIFLKNG DICMDGTARD VTNRYLDELF GKADKNSAPK SETATSSASG ESQMSLDEIE DVYHTRPGYR PEEYRWGQGG AKIIDYHIQS AGVDFPPSLT GNQQTDFLMK VVFEYDFDCV VPGILIKTLD GLFLYGTNSF LASEGRENIS VSRGDVRVFK FSLPVDLNSG DYLLSFGISA GNPQTDMTPL DRRYDSIILH VTKSMDFWGV IDLKSSFTSY Q
|
| |