Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4223 |
Symbol | |
ID | 5588577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4212034 |
End bp | 4213371 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640927839 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001465198 |
Protein GI | 157156784 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAC AACACACAAC CCAGGCTTCT GGCCAGGGGA TGCTGGAACG CGTGTTTAAA CTGCGCGAAC ATGGCACGAC GGCACGGACC GAAGTGATCG CCGGTTTTAC CACCTTCCTG ACGATGGTTT ACATCGTTTT TGTTAACCCG CAAATTCTTG GCGTTGCTGG CATGGATACC AGCGCCGTCT TCGTTACTAC CTGTCTGATT GCTGCATTCG GCAGTATTAT GATGGGACTG TTTGCTAACC TGCCAGTTGC ACTGGCACCC GCTATGGGCC TGAATGCATT CTTCGCTTTT GTCGTTGTAC AGGCTATGGG CTTGCCGTGG CAAGTCGGGA TGGGCGCAAT CTTCTGGGGC GCGATTGGTC TGCTGTTACT GACGATTTTC CGCGTTCGTT ACTGGATGAT TGCTAACATT CCGGTAAGTC TGCGTGTGGG CATCACCAGC GGTATCGGTC TGTTCATTGG TATGATGGGA CTGAAGAATG CAGGTGTGAT TGTCGCTAAC CCGGAAACGC TGGTCAGCAT CGGTAACCTG ACTTCTCACA GCGTGCTGCT GGGGATCCTC GGCTTCTTCA TCATCGCTAT TCTGGCCTCG CGCAACATTC ACGCGGCGGT GCTGGTTTCC ATCGTGGTGA CAACGCTGCT GGGCTGGATG CTGGGTGATG TCCACTACAA TGGCATCGTT TCTGCGCCGC CGAGCGTAAT GACTGTCGTG GGCCATGTAG ATTTAGCCGG GTCGTTTAAC CTCGGGCTGG CAGGGGTGAT TTTCTCTTTC ATGCTGGTCA ACCTGTTTGA CTCCTCCGGT ACGCTGATTG GCGTGACCGA TAAAGCAGGC CTGGCGGATG AGAAGGGTAA ATTCCCGCGC ATGAAGCAGG CGCTGTATGT CGACAGTATC TCTTCCGTGA CCGGTTCGTT TATCGGTACT TCTTCCGTTA CGGCTTATAT TGAGTCCTCT TCCGGCGTTT CCGTTGGCGG TCGTACCGGT CTGACGGCAG TGGTTGTTGG TCTGCTGTTC CTGCTGGTTA TCTTTCTGTC GCCGCTGGCG GGGATGGTGC CAGGCTACGC TGCAGCTGGT GCGCTGATTT ACGTTGGCGT ACTGATGACT TCCAGCCTGG CACGCGTGAA CTGGCAGGAT CTTACTGAAT CTGTTCCGGC GTTTATTACC GCCGTGATGA TGCCGTTCAG CTTTTCGATT ACCGAAGGTA TTGCGCTGGG CTTTATCTCC TACTGCGTGA TGAAAATCGG TACCGGACGT CTGCGTGACC TTAGCCCGTG CGTAATCATC GTTGCGCTGC TGTTTATCCT GAAGATTGTC TTTATCGACG CTCACTAA
|
Protein sequence | MSQQHTTQAS GQGMLERVFK LREHGTTART EVIAGFTTFL TMVYIVFVNP QILGVAGMDT SAVFVTTCLI AAFGSIMMGL FANLPVALAP AMGLNAFFAF VVVQAMGLPW QVGMGAIFWG AIGLLLLTIF RVRYWMIANI PVSLRVGITS GIGLFIGMMG LKNAGVIVAN PETLVSIGNL TSHSVLLGIL GFFIIAILAS RNIHAAVLVS IVVTTLLGWM LGDVHYNGIV SAPPSVMTVV GHVDLAGSFN LGLAGVIFSF MLVNLFDSSG TLIGVTDKAG LADEKGKFPR MKQALYVDSI SSVTGSFIGT SSVTAYIESS SGVSVGGRTG LTAVVVGLLF LLVIFLSPLA GMVPGYAAAG ALIYVGVLMT SSLARVNWQD LTESVPAFIT AVMMPFSFSI TEGIALGFIS YCVMKIGTGR LRDLSPCVII VALLFILKIV FIDAH
|
| |