Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3868 |
Symbol | |
ID | 6146825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3936947 |
End bp | 3938155 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618694 |
Product | major facilitator family transporter |
Protein accession | YP_001745833 |
Protein GI | 170682261 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00890] Oxalate/Formate Antiporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCTT CAAATTATCA GCGTACTCGC TGGCTGACAC TCATCGGTAC TATCATTACC CAGTTTGCTC TGGGGTCGGT TTATACCTGG AGTCTGTTTA ACGGTGCGCT TTCTGCCAAG CTGGGGGCTC CTGTAAGCCA GGTTGCTTTC TCTTTCGGCT TGTTAAGTCT GGGGCTGGCA ATTTCATCTT CTGTTGCGGG CAAATTGCAG GAACGTTTTG GTGTTAAACG CGTCACTATG GCTTCCGGCA TTTTGCTGGG ATTAGGTTTC TTCCTGACAG CACATTCCAA CAACCTGATG ATGCTGTGGT TAAGCGCCGG TGTGTTGGTT GGTCTGGCGG ATGGTGCGGG TTACCTGTTG ACGCTCTCTA ACTGCGTGAA GTGGTTTCCG GAGCGTAAGG GACTTATCTC TGCTTTTGCT ATCGGTTCTT ATGGTCTGGG CAGCCTGGGT TTTAAATTTA TCGACACGCA CCTGCTCGAA ACGGTGGGTC TGGAAAAAAC CTTTGTTATT TGGGGCGCGA TTGTACTGGT GATGATTGTC TTTGGCGCAA CGTTAATGAA AGATGCGCCG AAGCAGGAAG TGAAAACCAG CAATGGTGTG GTGGAGAAGG ACTACACCCT GGCAGAGTCG ATGCGTAAAC CGCAGTACTG GATGTTAGCG GTTATGTTCC TGACTGCGTG CATGAGTGGT CTGTATGTGA TTGGTGTAGC GAAAGATATC GCTCAAAGTC TGGCGCATCT TGATGCAATT TCCGCAGCCA ATGCTGTGAC GGTTATTTCC ATCGCCAACC TTTCTGGTCG TCTGGTGCTG GGCATTCTGT CTGACAAAAT CGCCCGTATC CGTGTTATCA CCATTGGTCA GGTGATATCG CTGGTGGGTA TGGCGGCCCT GCTGTTTGCA CCATTGAATG CAGTGACGTT CTTTGCAGCG ATTGCCTGCG TGGCGTTTAA CTTTGGCGGC ACTATCACGG TGTTCCCGTC ACTGGTCAGT GAGTTCTTCG GCCTCAATAA CCTGGCGAAA AACTACGGTG TGATTTATCT CGGTTTCGGT ATCGGCAGCA TTTGTGGGTC GATTATCGCC TCACTGTTTG GCGGCTTCTA TGTGACTTTC TACGTCATTT TTGCCCTGCT GATTCTGTCT CTGGCGCTTT CAACGACCAT TCGCCAGCCA GAGCAGAAAG TATTGCGTGA AGCGCATGGC TCCCTTTAA
|
Protein sequence | MTPSNYQRTR WLTLIGTIIT QFALGSVYTW SLFNGALSAK LGAPVSQVAF SFGLLSLGLA ISSSVAGKLQ ERFGVKRVTM ASGILLGLGF FLTAHSNNLM MLWLSAGVLV GLADGAGYLL TLSNCVKWFP ERKGLISAFA IGSYGLGSLG FKFIDTHLLE TVGLEKTFVI WGAIVLVMIV FGATLMKDAP KQEVKTSNGV VEKDYTLAES MRKPQYWMLA VMFLTACMSG LYVIGVAKDI AQSLAHLDAI SAANAVTVIS IANLSGRLVL GILSDKIARI RVITIGQVIS LVGMAALLFA PLNAVTFFAA IACVAFNFGG TITVFPSLVS EFFGLNNLAK NYGVIYLGFG IGSICGSIIA SLFGGFYVTF YVIFALLILS LALSTTIRQP EQKVLREAHG SL
|
| |