Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4121 |
Symbol | |
ID | 6144620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4218984 |
End bp | 4220411 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641618946 |
Product | DHA2 family drug:H+ antiporter-1 |
Protein accession | YP_001746084 |
Protein GI | 170679741 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00185105 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.108904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATA AAAAGAAGCG CAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC CTTAATCGTT CTCCTCTCGC GATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG ATGCTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATATTTACC CTTGCCGTGA GTCTGTTCAC GTTGGGTTCT CTGGCCTGCG CACTTTCTAA CTCGCTACCA CAGCTGGTTG TCTTCCGGGT TATTCAAGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT CGGCTGGCCT TACTGCGCGC TTATCCTCGT AATGAGCTTC TTCCTGTATT GAATTTTGTC GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC GGGCCTTCTT TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC TTTTTGCTGT TTGGCCTTAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA AAGATTGTCG CCAGTTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT CTCTATATTC TCCATGCACG ACGCACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGTTGT GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC GGCTGTATGA TGGCACCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA GTCTTACGTC GTCTGGGCTA TCGCCATACG TTAGTGGGGA TCACGGTGAT TATTGGGCTA ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG TTGTTTATAT TAGGGATGGC TATGTCGACG CAATTTACCG CGATGAATAC CATCACACTT GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA CTGTCTATCA GTTTAGGCGT TGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TGAAGGGATG GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACGATGGG CATTATTACT GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
|
Protein sequence | MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA MLIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL LYILHARRTP NPLISLDLFK TRTFSIGIVG NIATRLGTGC VPFLMPLMLQ VGFGYQAFIA GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYEGM EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE
|
| |