Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3356 |
Symbol | |
ID | 6142707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3433542 |
End bp | 3435005 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618185 |
Product | anion transporter |
Protein accession | YP_001745335 |
Protein GI | 170681701 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000525621 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTT CCACTGAATG GTGGCGATAC CTTGCGCCGC TGGCGGTCAT CGCCATTATT GCTCTAATTC CGGTTCCCGC AGGGCTGGAG AGTCATACCT GGCTCTACTT TGCTGTTTTT ACTGGCGTGA TCGTTGGACT GATCCTCGAA CCCGTGCCGG GTGCCGTGGT GGCGATGGTG GGTATCTCCA TTATCGCCAT TCTCTCTCCC TGGCTGCTGT TCAGCCCGGA GCAGCTCGCC CAGCCAGGTT TTAAGTTCAC CGCCAAATCC CTCTCCTGGG CGGTTTCTGG TTTTTCTAAT TCGGTTATCT GGCTGATTTT CGCCGCCTTT ATGTTTGGCA CAGGCTATGA AAAAACCGGG CTGGGGCGAC GTATCGCGCT GATTCTGGTG AAAAAGATGG GGCATCGCAC GCTGTTTCTC GGCTATGCGG TGATGTTCTC CGAGCTTATC CTGGCACCTG TAACACCGTC CAACTCGGCG CGTGGTGCGG GGATTATCTA TCCCATCATC CGTAACCTGC CACCGCTCTA TCAATCACAA CCAAACGACA GCAGTTCGCG CAGCATTGGC TCGTACATCA TGTGGATGGG GATTGTTGCC GACTGCGTGA CCAGCGCCAT TTTCCTGACG GCGATGGCAC CTAACTTGCT GTTAATTGGC CTGATGAAAA GCGCATCTCA CGCCACACTG AGTTGGGGCG ACTGGTTCCT CGGGATGTTG CCGCTCAGTA TTTTACTGGT TCTGCTGGTT CCCTGGCTGG CTTACGTGCT GTACCCGCCG GTACTGAAGT CTGGTGATCA GGTGCCGCGC TGGGCAGAGA CGGAACTGCA GGCAATGGGC CCGCTCTGTT CGCGTGAAAA ACGGATGCTG GGGCTGATGG TAGGCGCGCT GGTGCTGTGG ATTTTCGGCG GTGATTATAT CGATGCCGCG ATGGTCGGTT ACAGCGTGGT GGCGCTGATG CTGCTTCTGC GCATTATCAG TTGGGACGAC ATTGTCAGTA ATAAAGCCGC GTGGAACGTT TTCTTCTGGC TGGCCTCGCT TATCACCCTC GCTACCGGAC TCAACAACAC CGGTTTTATT AGCTGGTTTG GCAAACTGTT AGCAGGCAGC TTAAGCGGTT ATTCGCCAAC GATAGTGATG GTGGCGTTGA TTGTGGTGTT TTATCTACTG CGCTACTTTT TCGCCAGCGC CACGGCGTAT ACCTCCGCTC TCGCGCCGAT GATGATTGCC GCCGCGCTGG CGATGCCGGA AATCCCGCTG CCGGTATTCT GCCTGATGGT TGGCGCGGCA ATTGGTCTGG GGAGCATTCT TACGCCATAC GCCACCGGAC CCAGCCCGAT TTACTACGGT AGTGGTTATC TGCCAACGGT GGATTACTGG CGACTGGGGG CAATTTTTGG GCTGATATTC CTCGTATTGC TGGTGATTAC CGGCTTACTG TGGATGCCCG TGGTGTTGCT TTAA
|
Protein sequence | MKPSTEWWRY LAPLAVIAII ALIPVPAGLE SHTWLYFAVF TGVIVGLILE PVPGAVVAMV GISIIAILSP WLLFSPEQLA QPGFKFTAKS LSWAVSGFSN SVIWLIFAAF MFGTGYEKTG LGRRIALILV KKMGHRTLFL GYAVMFSELI LAPVTPSNSA RGAGIIYPII RNLPPLYQSQ PNDSSSRSIG SYIMWMGIVA DCVTSAIFLT AMAPNLLLIG LMKSASHATL SWGDWFLGML PLSILLVLLV PWLAYVLYPP VLKSGDQVPR WAETELQAMG PLCSREKRML GLMVGALVLW IFGGDYIDAA MVGYSVVALM LLLRIISWDD IVSNKAAWNV FFWLASLITL ATGLNNTGFI SWFGKLLAGS LSGYSPTIVM VALIVVFYLL RYFFASATAY TSALAPMMIA AALAMPEIPL PVFCLMVGAA IGLGSILTPY ATGPSPIYYG SGYLPTVDYW RLGAIFGLIF LVLLVITGLL WMPVVLL
|
| |