Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4029 |
Symbol | |
ID | 6142883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4115893 |
End bp | 4117239 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641618854 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001745992 |
Protein GI | 170683707 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.607592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAAA AAATGAATAA TGACAATACC GATTACGTGA GTAATGAATC AGGGACGCTT TCGCGATTAT TTAAACTACC TCAGCATGGG ACCACCGTCC GCACAGAATT GATTGCGGGG ATGACCACTT TTTTAACCAT GGTGTACATC GTTTTTGTGA ACCCGCAAAT CCTCGGCGCG GCACAAATGG ACCCGAAAGT GGTGTTTGTT ACCACCTGTT TGATTGCCGG TATCGGCAGT ATTGCGATGG GGATATTTGC TAACTTACCC GTGGCGCTGG CTCCGGCAAT GGGGCTGAAC GCCTTCTTTG CGTTCGTGGT CGTGGGGGCG ATGGGCATCT CCTGGCAGAC CGGGATGGGC GCGATATTCT GGGGCGCAGT TGGGCTATTT CTGCTCACGC TGTTTCGTAT CCGGTACTGG ATGATCTCCA ACATTCCATT AAGTTTACGT ATTGGTATCA CCAGCGGAAT CGGATTATTT ATCGCCTTAA TGGGATTAAA AAATACTGGC GTTATTGTCG CCAATAAAGA CACCCTGGTG ATGATTGGCG ATTTAAGTTC TCACGGCGTG TTGTTAGGTA TTTTAGGGTT TTTTATTATA ACCGTGTTGT CATCACGTCA TTTTCATGCC GCGGTGCTTG TTTCTATTGT GGTGACGTCT TGCTGTGGAT TATTTTTCGG TGATGTTCAT TTTAGCGGCG TCTATTCCAT TCCGCCTGAT ATTAGCGGCG TCATTGGTGA AGTAGATTTG AGCGGCGCGT TAACACTTGA ACTCGCCGGT ATCATTTTCT CCTTCATGCT GATCAACCTA TTTGATTCAT CAGGAACATT AATTGGTGTA ACTGATAAAG CTGGCTTAAT AGATAGTAAC GGTAAATTCC CCAATATGAA TAAGGCGCTA TATGTTGATA GCGTCAGTTC GGTGGCGGGC GCGTTTATCG GCACCTCGTC TGTTACCGCC TATATTGAAA GTACTTCTGG TGTGGCAGTC GGTGGTCGCA CGGGGCTGAC TGCGGTTGTG GTCGGCGTTA TGTTCCTGTT GGTTATGTTC TTCTCACCGC TGGTGGCGAT GGTTCCTCCT TACGCAACCG CCGGAGCGTT AATCTTTGTT GGCGTGCTGA TGACTTCGAG CCTGGCGCGC GTTAACTGGG ATGATTTTAC CGAATCGGTG CCTGCGTTTA TTACCACGGT GATGATGCCC TTTACTTTCT CGATCACCGA AGGGATTGCA CTCGGCTTTA TGTCGTACTG CATCATGAAA GTATGCACCG GGCGCTGGCG CGATCTGAAC CTGTGTGTGG TGGTGGTCGC AGCTCTGTTT GCACTGAAGA TTATTCTGGT GGATTAG
|
Protein sequence | MDKKMNNDNT DYVSNESGTL SRLFKLPQHG TTVRTELIAG MTTFLTMVYI VFVNPQILGA AQMDPKVVFV TTCLIAGIGS IAMGIFANLP VALAPAMGLN AFFAFVVVGA MGISWQTGMG AIFWGAVGLF LLTLFRIRYW MISNIPLSLR IGITSGIGLF IALMGLKNTG VIVANKDTLV MIGDLSSHGV LLGILGFFII TVLSSRHFHA AVLVSIVVTS CCGLFFGDVH FSGVYSIPPD ISGVIGEVDL SGALTLELAG IIFSFMLINL FDSSGTLIGV TDKAGLIDSN GKFPNMNKAL YVDSVSSVAG AFIGTSSVTA YIESTSGVAV GGRTGLTAVV VGVMFLLVMF FSPLVAMVPP YATAGALIFV GVLMTSSLAR VNWDDFTESV PAFITTVMMP FTFSITEGIA LGFMSYCIMK VCTGRWRDLN LCVVVVAALF ALKIILVD
|
| |