Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3237 |
Symbol | |
ID | 6066789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3547162 |
End bp | 3548346 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641602652 |
Product | MFS transport protein AraJ |
Protein accession | YP_001726186 |
Protein GI | 170021232 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.691392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00106159 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAG TCATTTTATC TTTGGCTCTG GGCACGTTTG GTTTGGGGAT GGCCGAATTT GGCATTATGG GCGTGCTCAC GGAGCTGGCG CATAACGTAG GAATTTCGAT TCCTGCCGCC GGGCATATGA TCTCGTATTA TGCACTGGGG GTGGTGGTCG GTGCGCCAAT CATCGCACTC TTTTCCAGCC GCTACTCACT CAAACATATC TTGTTGTTTC TGGTGGCGTT GTGCGTCATT GGCAACGCCA TGTTCACGCT CTCTTCGTCT TACCTGATGC TCGCCATTGG TCGGCTGGTA TCCGGCTTTC CGCATGGCGC ATTTTTTGGC GTCGGAGCGA TCGTGTTATC AAAAATTATC AAACCCGGAA AAGTCACCGC CGCCGTGGCG GGGATGGTTT CCGGGATGAC AGTCGCCAAT TTGCTGGGCA TTCCGCTGGG AACGTATTTA AGTCAGGAAT TTAGCTGGCG TTACACCTTT TTATTGATCG CTGTTTTTAA TATTGCGGTG ATGGCATCGG TCTATTTTTG GGTGCCAGAT ATTCGCGACG AGGCGAAAGG AAATCTGCGC GAACAATTTC ACTTTTTGCG CAGCCCGGCC CCGTGGTTAA TTTTCGCCGC CACGATGTTT GGCAACGCAG GTGTGTTTGC CTGGTTCAGC TACGTAAAGC CATACATGAT GTTTATTTCC GGTTTTTCGG AAACGGCGAT GACCTTTATT ATGATGTTAG TTGGGCTAGG GATGGTGCTG GGAAATATGC TAAGTGGCAG GATTTCAGGA CGTTATTCAC CACTGCGCAT TGCAGCAGTG ACTGACTTTA TAATTGTACT GGCACTGCTG ATGCTCTTTT TCTGCGGCGG CATGAAAACA ACGTCGCTTA TTTTTGCTTT TATTTGTTGC GCGGGATTAT TTGCCCTTTC AGCACCGCTA CAAATATTGT TACTACAAAA CGCCAAAGGC GGAGAGTTAT TAGGTGCCGC AGGTGGGCAA ATAGCGTTTA ACCTCGGTAG CGCCGTCGGC GCATATTGCG GAGGTATGAT GCTGACGCTG GGGCTGGCAT ATAATTACGT GGCGCTGCCT GCCGCCCTGC TTTCGTTTGC TGCGATGTCG TCGTTGCTGC TGTATGGTCG CTATAAGCGC CAGCAAGCGG CGGATACTCC GGTGCTGGCG AAACCACTGG GGTAG
|
Protein sequence | MKKVILSLAL GTFGLGMAEF GIMGVLTELA HNVGISIPAA GHMISYYALG VVVGAPIIAL FSSRYSLKHI LLFLVALCVI GNAMFTLSSS YLMLAIGRLV SGFPHGAFFG VGAIVLSKII KPGKVTAAVA GMVSGMTVAN LLGIPLGTYL SQEFSWRYTF LLIAVFNIAV MASVYFWVPD IRDEAKGNLR EQFHFLRSPA PWLIFAATMF GNAGVFAWFS YVKPYMMFIS GFSETAMTFI MMLVGLGMVL GNMLSGRISG RYSPLRIAAV TDFIIVLALL MLFFCGGMKT TSLIFAFICC AGLFALSAPL QILLLQNAKG GELLGAAGGQ IAFNLGSAVG AYCGGMMLTL GLAYNYVALP AALLSFAAMS SLLLYGRYKR QQAADTPVLA KPLG
|
| |