Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4081 |
Symbol | |
ID | 6147165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4171940 |
End bp | 4173277 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618905 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001746043 |
Protein GI | 170683636 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.578449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAC AACACACAAC CCAGGCTTCT GGCCAGGGGA TGCTGGAACG CGTGTTTAAA CTGCGCGAAC ATGGCACGAC GGCACGGACC GAAGTGATCG CCGGTTTTAC CACCTTCCTG ACGATGGTTT ACATCGTTTT TGTTAACCCG CAAATTCTTG GCGTTGCTGG CATGGATACC AGCGCCGTCT TCGTCACTAC CTGTCTGATT GCTGCATTCG GCAGTATTAT GATGGGACTG TTTGCTAACC TGCCAGTTGC ATTGGCACCC GCTATGGGCC TGAATGCGTT CTTCGCTTTT GTGGTTGTGC AGGCGATGGG CTTGCCGTGG CAAGTCGGGA TGGGCGCAAT CTTCTGGGGC GCGATTGGTC TACTGTTACT GACGATTTTC CGCGTTCGCT ACTGGATGAT AGCCAACATT CCGGTAAGTC TACGTGTGGG CATCACCAGT GGTATCGGTC TGTTTATTGG CATGATGGGG CTGAAAAACG CAGGTGTGAT TGTCGCTAAC CCGGAAACGC TGGTGAGCAT CGGTAACCTG ACTTCTCACA GCGTGCTGCT GGGGATCCTC GGCTTCTTCA TCATCGCCAT TCTGGCCTCG CGCAATATTC ACGCGGCGGT GCTGGTTTCC ATCGTGGTGA CAACGCTGCT GGGCTGGATG CTGGGTGATG TCCACTACAA CGGCATCGTT TCTGCGCCGC CGAGCGTAAT GACTGTCGTG GGTCATGTAG ATTTAGCCGG GTCGTTTAAC CTCGGGCTGG CAGGGGTGAT TTTCTCTTTC ATGCTGGTCA ACTTGTTTGA CTCCTCCGGT ACGCTGATTG GCGTGACCGA TAAAGCAGGT CTGGCTGATG ATAAAGGTAA ATTCCCGCGC ATGAAGCAGG CGCTGTATGT CGACAGCATC TCTTCGGTGA CCGGTTCGTT TATCGGTACT TCTTCTGTTA CGGCGTATAT TGAATCTTCT TCCGGCGTTT CCGTTGGCGG TCGTACCGGT CTGACGGCAG TGGTTGTTGG TCTGCTGTTC CTGCTGGTTA TCTTCCTGTC GCCGCTGGCA GGGATGGTGC CAGGCTACGC TGCAGCTGGT GCGCTGATTT ACGTTGGCGT GTTGATGACC TCAAGTCTTG CTCGCGTGAA CTGGCAGGAT CTTACTGAAT CTGTTCCGGC GTTTATTACC GCTGTGATGA TGCCGTTCAG CTTCTCGATT ACTGAAGGTA TCGCGCTGGG CTTTATCTCC TACTGCGTGA TGAAGATTGG TACCGGACGT CTGCGTGACC TTAGCCCGTG CGTAATCATC GTTGCGCTGC TGTTTATCCT GAAGATTGTG TTTATCGACG CTCACTAA
|
Protein sequence | MSQQHTTQAS GQGMLERVFK LREHGTTART EVIAGFTTFL TMVYIVFVNP QILGVAGMDT SAVFVTTCLI AAFGSIMMGL FANLPVALAP AMGLNAFFAF VVVQAMGLPW QVGMGAIFWG AIGLLLLTIF RVRYWMIANI PVSLRVGITS GIGLFIGMMG LKNAGVIVAN PETLVSIGNL TSHSVLLGIL GFFIIAILAS RNIHAAVLVS IVVTTLLGWM LGDVHYNGIV SAPPSVMTVV GHVDLAGSFN LGLAGVIFSF MLVNLFDSSG TLIGVTDKAG LADDKGKFPR MKQALYVDSI SSVTGSFIGT SSVTAYIESS SGVSVGGRTG LTAVVVGLLF LLVIFLSPLA GMVPGYAAAG ALIYVGVLMT SSLARVNWQD LTESVPAFIT AVMMPFSFSI TEGIALGFIS YCVMKIGTGR LRDLSPCVII VALLFILKIV FIDAH
|
| |