Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1286 |
Symbol | araG |
ID | 6145114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1274447 |
End bp | 1275961 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616164 |
Product | L-arabinose transporter ATP-binding protein |
Protein accession | YP_001743344 |
Protein GI | 170682274 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00939133 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000120675 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAACAGT CTACCCCGTA TCTCTCATTT CGCGGCATCG GTAAAACATT TCCCGGCGTT AAGGCGCTGA CGGATATTAG TTTTGATTGC TATGCCGGTC AGGTTCATGC GTTGATGGGT GAAAATGGCG CAGGAAAATC AACTCTCTTA AAAATCCTCA GCGGCAACTA TGCGCCAACC ACGGGTTCTG TAGTGATTAA TGGGCAGGAA ATGTCCTTTT CCGACACGAC CGCAGCACTT AATGCGGGTG TGGCGATTAT TTACCAGGAA CTGCATCTCG TGCCGGAAAT GACCGTCGCG GAAAACATCT ATCTCGGCCA GCTGCCGCAT AAAGGCGGCA TTGTGAATCG CTCATTGCTG AATTATGAGG CGGGTTTACA ACTTAAACAT CTTGGTATGG ATATTGACCC GGACACGCCG CTGAAATATC TCTCCATTGG TCAGTGGCAG ATGGTTGAAA TCGCCAAGGC GCTGGCGCGT AACGCCAAAA TTATCGCCTT TGATGAGCCA ACCAGCTCCC TTTCTGCCCG CGAAATCGAC AATCTTTTCC GCGTTATTCG TGAACTGCGA AAAGAGGGGC GGGTGATCTT ATACGTTTCT CACCGTATGG AAGAAATATT TGCCCTCAGC GATGCCATCA CCGTCTTTAA AGATGGACGT TATGTCAAAA CCTTTACCGA TATGCAGCAG GTTGACCACG ACGCGCTGGT GCAGGCGATG GTCGGGCGCG ACATTGGCGA TATCTACGGC TGGCAACCTC GTAGTTATGG CGAGGAGCGC CTGCGTCTTG ATGCTGTGAA AGCACCAGGC GTGCGTACGC CAATAAGTCT GGCGGTTCGC AGTGGTGAAA TTGTCGGTCT GTTTGGTCTG GTAGGAGCGG GGCGTAGCGA ATTAATGAAA GGCTTGTTTG GCGGGACGCA AATCACCGCC GGTCAGGTTT ATATCGACCA ACAGCCGATC GATATTCGTA AACCGAGCCA CGCCATTGCC GCAGGCATGA TGCTCTGCCC GGAAGATCGC AAAGCCGAAG GCATTATTCC CGTGCACTCC GTTCGCGACA ATATCAACAT CAGTGCCAGA CGTAAACATG TGCTCGGCGG TTGTGTAATC AACAACGGTT GGGAAGAAAA CAATGCCGAT CACCACATTC GTTCGCTCAA CATCAAAACG CCGGGCGCTG AGCAACTGAT CATGAATCTC TCAGGCGGAA ATCAGCAAAA AGCCATTCTG GGCCGCTGGT TATCGGAAGA GATGAAGGTC ATTTTGCTGG ATGAACCTAC GCGCGGCATT GATGTTGGTG CTAAGCATGA AATTTACAAC GTGATTTATG CGCTGGCGGC GCAGGGTGTG GCGGTGCTGT TTGCCTCCAG CGACTTACCT GAAGTCCTCG GCGTTGCCGA CCGGATTGTG GTGATGCGGG AAGGTGAAAT CGCCGGTGAA TTGTTACACG AGCAGGCAGA TGAGCGTCAG GCACTGAGCC TTGCGATGCC TAAAGTCAGC CAGGCAGTTG CCTGA
|
Protein sequence | MQQSTPYLSF RGIGKTFPGV KALTDISFDC YAGQVHALMG ENGAGKSTLL KILSGNYAPT TGSVVINGQE MSFSDTTAAL NAGVAIIYQE LHLVPEMTVA ENIYLGQLPH KGGIVNRSLL NYEAGLQLKH LGMDIDPDTP LKYLSIGQWQ MVEIAKALAR NAKIIAFDEP TSSLSAREID NLFRVIRELR KEGRVILYVS HRMEEIFALS DAITVFKDGR YVKTFTDMQQ VDHDALVQAM VGRDIGDIYG WQPRSYGEER LRLDAVKAPG VRTPISLAVR SGEIVGLFGL VGAGRSELMK GLFGGTQITA GQVYIDQQPI DIRKPSHAIA AGMMLCPEDR KAEGIIPVHS VRDNINISAR RKHVLGGCVI NNGWEENNAD HHIRSLNIKT PGAEQLIMNL SGGNQQKAIL GRWLSEEMKV ILLDEPTRGI DVGAKHEIYN VIYALAAQGV AVLFASSDLP EVLGVADRIV VMREGEIAGE LLHEQADERQ ALSLAMPKVS QAVA
|
| |