Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3725 |
Symbol | |
ID | 6064413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4078309 |
End bp | 4079535 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603142 |
Product | major facilitator transporter |
Protein accession | YP_001726662 |
Protein GI | 170021708 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGTT TTTTTGCCCG CCATGCCGCC ACGCTGTTTT TCCCGATGGC GTTGATTTTG TATGACTTTG CCGCGTACCT GTCGACGGAT CTGATCCAGC CGGGGATCAT TAATGTGGTC CGTGATTTTA ATGCCGATGT CAGTCTGGCC CCGGCTGCCG TCAGTCTCTA TCTCGCTGGC GGTATGGCGT TACAGTGGCT GCTGGGGCCG CTTTCCGACA GGATTGGCCG CAGGCCGGTG CTGATTACCG GGGCGCTTAT TTTTACCCTT GCCTGCGCCG CGACAATGTT CACAACGTCT ATGACACAGT TTCTTATCGC GCGTGCAATT CAGGGTACCA GTATCTGTTT TATTGCTACC GTTGGTTATG TCACGGTGCA GGAAGCCTTT GGGCAGACGA AAGCCATTAA ACTGATGGCG ATTATTACCT CCATTGTGTT GCTGGCCCCC ATAATCGGCC CTCTTTCCGG CGCGGCGTTA ATGCACTTTG TCCACTGGAA AGTGCTTTTT GCCATTATTG CGGCGATGGG GCTTGTCTCG TTTGTCGGCC TGTTGTTGGC GATGCCAGAA ACGGTGAAGC GTGGTGCGGT TCCGTTCAGC GCTGTTAGCG TACTGCGCGA TTTTCGTAAC GTTTTTTGCA ACCGGCTGTT CCTCTTTGGC GCGGCAACCA TCGCCCTCAG CTACATCCCC ATGATGAGTT GGGTTGCGGT GTCACCGGTT ATCCTGATCG ACGCGGGCGG CTTAACGACT TCGCAATTTG CCTGGACACA GGTTCCGGTC TTCGGCGCGG TGATTGTCGC CAACACCATC GTGGCGCGAT TTATCAAAGA TCCGACCGAA CCGCGGTTTA TCTGGCGTGC CGTGCCCATT CAACTGGTCG GCCTCGCGCT ATTGATTGTC GGTAATCTGC TGTCGCCGCA CGTCTGGCTG TGGTCGGTGC TTGGTACCAG CCTGTACGCT TTTGGTATTG GTTTGATTTT CCCGTCGCTA TACCGCTTTA CGCTGTTTTC GAACAACCTG CCGAAGGGCA CCGTGTCGGC ATCGCTGAAT ATCGTGGTTT TAACGGTGAT GTCGGTCTCG GTCGAAATTG GTCGCTGGCT GTGGTTTAAC GGCGGTCGTT TGCCGTTTCA TCTGCTGGCG GTAGTTGCGG GGCTGTTTGT TGTGTTAACG CTGGCGGGGT TACTGAAACG CGTGCAGTTG CATCAGGCAA GCGAACTGGC GGTATAG
|
Protein sequence | MPRFFARHAA TLFFPMALIL YDFAAYLSTD LIQPGIINVV RDFNADVSLA PAAVSLYLAG GMALQWLLGP LSDRIGRRPV LITGALIFTL ACAATMFTTS MTQFLIARAI QGTSICFIAT VGYVTVQEAF GQTKAIKLMA IITSIVLLAP IIGPLSGAAL MHFVHWKVLF AIIAAMGLVS FVGLLLAMPE TVKRGAVPFS AVSVLRDFRN VFCNRLFLFG AATIALSYIP MMSWVAVSPV ILIDAGGLTT SQFAWTQVPV FGAVIVANTI VARFIKDPTE PRFIWRAVPI QLVGLALLIV GNLLSPHVWL WSVLGTSLYA FGIGLIFPSL YRFTLFSNNL PKGTVSASLN IVVLTVMSVS VEIGRWLWFN GGRLPFHLLA VVAGLFVVLT LAGLLKRVQL HQASELAV
|
| |