Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1996 |
Symbol | araG |
ID | 5595247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2003182 |
End bp | 2004696 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640921142 |
Product | L-arabinose transporter ATP-binding protein |
Protein accession | YP_001458690 |
Protein GI | 157161372 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.00548515 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAGT CTACCCCGTA TCTCTCATTT CGCGGCATCG GTAAAACGTT TCCCGGCGTT AAGGCGCTGA CGGATATTAG TTTTGACTGC TATGCCGGTC AGGTTCATGC GTTGATGGGT GAAAATGGCG CAGGAAAATC AACTCTCTTA AAAATCCTCA GCGGCAACTA TGCGCCAACC ACGGGTTCTG TAGTGATTAA TGGGCAGGAA ATGTCCTTTT CCGACACGAC CGCAGCACTT AACGCGGGCG TGGCGATTAT TTACCAGGAA CTGCATCTCG TGCCGGAAAT GACCGTCGCG GAAAACATCT ATCTCGGCCA GCTGCCGCAT AAAGGCGGCA TTGTGAATCG CTCATTGCTG AATTATGAGG CGGGTTTACA ACTTAAACAT CTTGGTATGG ATATTGACCC GGACACGCCG CTGAAATATC TCTCCATTGG TCAGTGGCAG ATGGTTGAAA TCGCCAAAGC GCTGGCGCGT AACGCCAAAA TTATCGCCTT TGATGAGCCA ACCAGCTCCC TCTCTGCCCG TGAAATCGAC AATCTTTTCC GCGTTATTCG TGAACTGCGA AAAGAGGGGC GGGTAATCTT ATACGTTTCT CACCGTATGG AAGAAATATT TGCCCTCAGC GATGCCATTA CTGTCTTTAA AGATGGACGT TATGTCAAAA CCTTTACCGA TATGCAGCAG GTTGACCACG ACGCGCTGGT GCAGGCGATG GTCGGGCGCG ACATTGGCGA TATCTACGGC TGGCAACCGC GTAGTTATGG CGAGGAGCGC CTACGTCTTG ATGCTGTGAA AGCACCAGGC GTGCGTACGC CAATAAGTCT GGCGGTTCGC AGTGGTGAAA TTGTTGGGCT GTTTGGTCTG GTAGGGGCGG GGCGTAGCGA ATTAATGAAA GGCATGTTTG GCGGGACGCA AATCACCGCC GGTCAGGTTT ATATCGACCA ACAGCCGATC GATATTCGTA AACCGAGCCA CGCCATTGCC GCAGGCATGA TGCTCTGCCC GGAAGATCGC AAAGCGGAAG GCATTATTCC CGTGCACTCC GTTCGCGACA ATATCAACAT CAGTGCCAGA CGTAAACATG TGCTCGGCGG TTGTGTAATC AACAACGGTT GGGAAGAAAA CAATGCCGAT CACCACATTC GTTCGCTCAA CATCAAAACG CCGGGCGCGG AGCAACTGAT CATGAATCTC TCAGGCGGAA ATCAGCAAAA AGCCATTCTG GGCCGCTGGT TATCGGAAGA GATGAAGGTC ATTTTGCTGG ATGAACCTAC GCGCGGCATT GATGTTGGCG CTAAGCACGA AATATATAAC GTAATTTATG CGCTGGCGGC GCAGGGCGTG GCGGTGCTGT TTGCCTCCAG CGACTTACCT GAAGTCCTCG GCGTTGCCGA CCGGATTGTG GTGATGCGGG AAGGTGAAAT CGCCGGTGAA TTGTTACACG AGCAGGCAGA TGAGCGTCAG GCACTGAGCC TTGCGATGCC TAAAGTCAGC CAGGCTGTTG CCTGA
|
Protein sequence | MQQSTPYLSF RGIGKTFPGV KALTDISFDC YAGQVHALMG ENGAGKSTLL KILSGNYAPT TGSVVINGQE MSFSDTTAAL NAGVAIIYQE LHLVPEMTVA ENIYLGQLPH KGGIVNRSLL NYEAGLQLKH LGMDIDPDTP LKYLSIGQWQ MVEIAKALAR NAKIIAFDEP TSSLSAREID NLFRVIRELR KEGRVILYVS HRMEEIFALS DAITVFKDGR YVKTFTDMQQ VDHDALVQAM VGRDIGDIYG WQPRSYGEER LRLDAVKAPG VRTPISLAVR SGEIVGLFGL VGAGRSELMK GMFGGTQITA GQVYIDQQPI DIRKPSHAIA AGMMLCPEDR KAEGIIPVHS VRDNINISAR RKHVLGGCVI NNGWEENNAD HHIRSLNIKT PGAEQLIMNL SGGNQQKAIL GRWLSEEMKV ILLDEPTRGI DVGAKHEIYN VIYALAAQGV AVLFASSDLP EVLGVADRIV VMREGEIAGE LLHEQADERQ ALSLAMPKVS QAVA
|
| |