Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3587 |
Symbol | |
ID | 6066270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3920782 |
End bp | 3921960 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641603004 |
Product | sugar efflux transporter |
Protein accession | YP_001726528 |
Protein GI | 170021574 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00899] sugar efflux transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000416053 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCTGGA TAATGACGAT GGCTCGCCGT ATGAACGGTG TTTACGCGGC ATTTATGCTG GTCGCTTTTA TGATGGGGGT GGCCGGGGCG CTACAGGCTC CTACATTGAG CTTATTTCTG AGTCGTGAGG TTGGCGCGCA ACCTTTCTGG ATCGGACTCT TTTATACGGT GAATGCTATT GCTGGGATCG GCGTAAGCCT CTGGTTGGCA AAACGTTCTG ACAGTCAGGG CGATCGGCGA AAACTGATTA TATTTTGCTG TTTGATGGCT ATCGGCAATG CGCTATTGTT TGCATTTAAT CGTCATTATC TGACGCTTAT CACCTGTGGT GTGCTTCTGG CATCTCTGGC CAATACGGCA ATGCCACAGT TATTTGCTCT GGCGCGGGAA TATGCGGATA ACTCGGCGCG AGAAGTGGTG ATGTTTAGCT CGGTGATGCG TGCGCAGCTT TCTCTGGCAT GGGTTATCGG TCCACCGTTG GCCTTTATGC TGGCGTTGAA TTACGGCTTT ACGGTGATGT TTTCGATTGC CGCCGGGATA TTCACACTCA GTCTGGTATT GATTGCATTT ATGCTTCCGT CTGTGGCGCG GGTAGAACTG CCGTCGGAAA ATGCTTTATC AATGCAAGGT GGCTGGCAGG ATAGTAACGT ACGGATGTTA TTTGTCGCCT CGACGTTAAT GTGGACCTGC AACACCATGT ACATTATTGA TATGCCGTTG TGGATCAGTA GCGAGTTAGG ATTGCCAGAC AAACTGGCGG GTTTCCTGAT GGGGACGGCA GCTGGACTGG AAATACCAGC AATGATTCTG GCTGGCTACT ATGTCAAACG TTATGGTAAG CGGCGAATGA TGGTCATAGC AGTGGCGGCA GGAGTACTGT TTTACACCGG ATTGATTTTA TTTCATAGCC GTCTGGCGTT GATGACGCTG CAACTTTTTA ACGCTGTATT TATCGGCATT GTTGCGGGTA TTGGGATGCT ATGGTTTCAG GATTTAATGC CTGGAAGAGC GGGGGCAGCT ACCACCTTAT TTACTAACAG TATTTCTACC GGGGTAATTC TGGCTGGCGT TATTCAGGGA GCAATTGCAC AAAGTTGGGG GCACTTTGCT GTCTACTGGA TAATTGCGGT TATTTCTGTT GTCGCATTAT TTTTAACCGC AAAGGTTAAA GACATTTGA
|
Protein sequence | MIWIMTMARR MNGVYAAFML VAFMMGVAGA LQAPTLSLFL SREVGAQPFW IGLFYTVNAI AGIGVSLWLA KRSDSQGDRR KLIIFCCLMA IGNALLFAFN RHYLTLITCG VLLASLANTA MPQLFALARE YADNSAREVV MFSSVMRAQL SLAWVIGPPL AFMLALNYGF TVMFSIAAGI FTLSLVLIAF MLPSVARVEL PSENALSMQG GWQDSNVRML FVASTLMWTC NTMYIIDMPL WISSELGLPD KLAGFLMGTA AGLEIPAMIL AGYYVKRYGK RRMMVIAVAA GVLFYTGLIL FHSRLALMTL QLFNAVFIGI VAGIGMLWFQ DLMPGRAGAA TTLFTNSIST GVILAGVIQG AIAQSWGHFA VYWIIAVISV VALFLTAKVK DI
|
| |