Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4553 |
Symbol | alsA |
ID | 6143484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4655056 |
End bp | 4656588 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619369 |
Product | D-allose transporter ATP-binding protein |
Protein accession | YP_001746481 |
Protein GI | 170683675 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGC CATATATATC GATGGCGGGG ATCGGCAAGT CCTTTGGTCC GGTTCACGCA TTAAAGTCGG TTAATTTAAC GGTTTATCCT GGTGAAATAC ATGCATTACT AGGAGAAAAT GGTGCAGGTA AATCTACGCT AATGAAAGTG TTATCCGGAA TACATGAGCC GACCAAAGGC ACCATTACCA TTAATAACAT CAACTATAAC AAACTGGATC ATAAATTAGC GGCACAACTC GGTATCGGGA TTATTTATCA GGAACTCAGC GTTATTGATG AATTAACCGT ACTGGAAAAT TTATATATTG GTCGTCATCT GACGAAAAAA ATCTGCGGTG TCAATATTAT CGACTGGCGA GAAATGCGTG TCCGCGCCGC CATGATGTTA TTACGCGTGG GCTTGAAAGT TGATTTAGAT GAGAAAGTGG CGAACTTATC TATCAGCCAC AAGCAGATGC TGGAAATTGC CAAAACGCTG ATGCTCGACG CCAAAGTCAT CATCATGGAT GAACCCACCT CCTCACTCAC CAATAAAGAG GTGGACTATC TGTTTCTGAT CATGAATCAG TTGCGTAAGG AGGGTACGGC TATCGTCTAT ATCTCGCATA AATTGGCGGA AATTCGCCGT ATTTGCGACC GCTATACGGT GATGAAAGAC GGCAGCAGCG TTTGCAGCGG CATGGTGAGC GATGTGTCAA ATGACGATAT CGTCCGTCTG ATGGTAGGCC GCGAACTGCA AAACCGTTTT AACGCGATGA AGGAGAATGT CAGCAACCTT GCGCACGAAA CGGTTTTTGA GGTGCGGAAC GTCACCAGCC GTGACAGAAA AAAGGTCCGG GATATCTCAT TTAGCGTCTG CCGGGGAGAA ATATTAGGCT TTGCCGGACT GGTCGGTTCC GGACGTACTG AACTGATGAA CTGCCTGTTT GGCGTTGATA AACGCGCTGG CGGAGAAATC CGTCTTAATG ACAAAGATAT CTCTCCACGC TCACCCCTGG ATGCCGTGAA AAAAGGGATG GCTTACATCA CTGAAAGCCG CCGGGATAAC GGTTTTTTCC CCAACTTTTC CATCGCTCAG AACATGGCGA TCAGCCGCAG TCTGAAAGAC GGCGGCTATA AAGGCGCGAT GGGCTTGTTT CATGAAGTTG ACGAGCAACG TACCGCTGAA AATCAACGCG AACTGCTGGC GCTGAAATGT CATTCGATAA ACCAGAATAT CACCGAACTC TCCGGGGGGA ATCAGCAGAA AGTCCTGATC TCCAAATGGC TGTGCTGTTG CCCGGAAGTG ATCATTTTCG ATGAACCTAC CCGCGGCATC GACGTTGGCG CGAAAGCCGA AATTTACAAA GTGATGCGCC AACTGGCGGA CGACGGAAAA GTCATCCTGA TGGTGTCATC TGAACTACCT GAAATTATCA CCGTCTGCGA CCGCATCGCC GTGTTCTGCG AAGGACGACT GACGCAAATC CTGACGAATC GCGATGACAT GAGCGAAGAG GAGATTATGG CATGGGCTTT ACCACAAGAG TAA
|
Protein sequence | MATPYISMAG IGKSFGPVHA LKSVNLTVYP GEIHALLGEN GAGKSTLMKV LSGIHEPTKG TITINNINYN KLDHKLAAQL GIGIIYQELS VIDELTVLEN LYIGRHLTKK ICGVNIIDWR EMRVRAAMML LRVGLKVDLD EKVANLSISH KQMLEIAKTL MLDAKVIIMD EPTSSLTNKE VDYLFLIMNQ LRKEGTAIVY ISHKLAEIRR ICDRYTVMKD GSSVCSGMVS DVSNDDIVRL MVGRELQNRF NAMKENVSNL AHETVFEVRN VTSRDRKKVR DISFSVCRGE ILGFAGLVGS GRTELMNCLF GVDKRAGGEI RLNDKDISPR SPLDAVKKGM AYITESRRDN GFFPNFSIAQ NMAISRSLKD GGYKGAMGLF HEVDEQRTAE NQRELLALKC HSINQNITEL SGGNQQKVLI SKWLCCCPEV IIFDEPTRGI DVGAKAEIYK VMRQLADDGK VILMVSSELP EIITVCDRIA VFCEGRLTQI LTNRDDMSEE EIMAWALPQE
|
| |