Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4208 |
Symbol | |
ID | 6271119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3932558 |
End bp | 3933895 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641728028 |
Product | inorganic anion transporter, sulfate permease (SulP) family |
Protein accession | YP_001882449 |
Protein GI | 187730803 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAC AACACACAAC CCAGGCTTCT GGCCAGGGGA TGCTGGAACG CGTGTTTAAA CTGCGCGAAC ATGGCACGAC GGCACGGACC GAAGTGATCG CCGGTTTTAC CACCTTCCTG ACGATGGTTT ACATCGTTTT TGTTAACCCA CAAATTCTGG GCGTTGCTGG CATGGACACC AGTGCCGTCT TCGTAACAAC CTGTCTGATT GCTGCTTTCG GTAGCATTAT GATGGGATTG TTTGCAAACC TTCCGGTGGC GCTGGCTCCT GCGATGGGGC TTAACGCCTT TTTCGCTTTT GTGGTTGTCC AGGCAATGGG GCTGCCGTGG CAGATCGGTA TGGGCGCAAT CTTCTGGGGC GCTGTCGGAC TGCTTCTTTT GACCATTTTC CGCGTTCGTT ACTGGATGAT TGCTAACATT CCGGTAAGTC TGCGTGTGGG TATTACCAGC GGTATCGGCC TGTTTATCGG CATGATGGGG CTGAAAAACG CAGGTGTGAT TGTCGCTAAC TCGGAAACGC TGGTGAGCAT CGGTAATCTG ACTTCTCACA GCGTACTTCT GGGTATCCTC GGCTTCTTCA TCATTGCTAT TCTGGCCTCG CGCAACATTC ACGCAGCGGT GCTGGTTTCT ATCGTGGTGA CGACGCTGCT GGGCTGGATG CTGGGTGATG TGCACTACAA TGGCATCGTT TCTGCGCCGC CGAGCGTAAT GACAGTTGTG GGTCATGTAG ATTTAGCCGG GTCGTTTAAC CTCGGGCTGG CAGGGGTGAT TTTCTCTTTC ATGTTGGTCA ACTTGTTTGA CTCCTCCGGT ACGCTGATTG GCGTGACCGA TAAAGCAGGT CTGGCGGATG AGAAGGGGAA ATTCCCGCGC ATGAAGCAGG CGCTGTATGT CGACAGTATC TCTTCCGTGA CCGGTTCGTT TATCGGTACT TCTTCCGTTA CGGCTTATAT TGAGTCCTCT TCCGGCGTAT CGGTTGGCGG TCGTACCGGT CTGACGGCAG TGGTTGTTGG TCTGCTGTTC CTGCTGGTTA TCTTTCTGTC GCCGCTGGCG GGAATGGTGC TAGGCTACGC TGCAGCTGGC GCGTTGATCT ACGTTGGCGT GTTGATGACC TCAAGTCTTG CTCGCGTGAA CTGGCAGGAT CTTACTGAAT CTGTTCCGGC GTTTATTACC GCCGTGATGA TGCCGTTCAG CTTTTCGATT ACCGAAGGTA TTGCGCTGGG CTTTATCTCC TACTGCGTGA TGAATATTGG TACCGGGCGT CTGCGTGACC TTAGCCCGTG CGTAATCATC GTTGCGCTGC TGTTTATCCT GAAGATTGTA TTTATCGACG CTCATTAA
|
Protein sequence | MSQQHTTQAS GQGMLERVFK LREHGTTART EVIAGFTTFL TMVYIVFVNP QILGVAGMDT SAVFVTTCLI AAFGSIMMGL FANLPVALAP AMGLNAFFAF VVVQAMGLPW QIGMGAIFWG AVGLLLLTIF RVRYWMIANI PVSLRVGITS GIGLFIGMMG LKNAGVIVAN SETLVSIGNL TSHSVLLGIL GFFIIAILAS RNIHAAVLVS IVVTTLLGWM LGDVHYNGIV SAPPSVMTVV GHVDLAGSFN LGLAGVIFSF MLVNLFDSSG TLIGVTDKAG LADEKGKFPR MKQALYVDSI SSVTGSFIGT SSVTAYIESS SGVSVGGRTG LTAVVVGLLF LLVIFLSPLA GMVLGYAAAG ALIYVGVLMT SSLARVNWQD LTESVPAFIT AVMMPFSFSI TEGIALGFIS YCVMNIGTGR LRDLSPCVII VALLFILKIV FIDAH
|
| |