Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4751 |
Symbol | |
ID | 6268878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4434974 |
End bp | 4435939 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641728508 |
Product | hypothetical protein |
Protein accession | YP_001882903 |
Protein GI | 187733831 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.289249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAGCG GCGTGCTGTA CGCCCTGTTA GCAGGGTTGA TGTGGGGGCT TATTTTTGTC GGGCCGTTGA TCGTACCGGA ATACCCGGCG ATGTTGCAGT CGATGGGGCG TTATCTGGCG TTAGGGTTAA TTGCGCTGCC CATTGCCTGG CTGGGACGCG TGCGTCTGCG TCAGTTGGCG CGTCGCGACT GGCTTACCGC CTTGATGCTC ACAATGATGG GCAACCTCAT CTATTACTTC TGCCTTGCCA GTGCCATTCA ACGTACTGGT GCGCCTGTTT CCACGATGAT TATCGGCACC CTGCCGGTGG TGATTCCCGT TTTTGCCAAT CTGCTTTATA GCCAGCGCGA CGGCAAACTC GTGTGGGGAA AACTCGCCCC GGCACTGGTT TGTATTGGCA TCGGCCTGGC GTGTGTGAAT ATTGCTGAGT TAAACCACGG ACTCCCCGAT TTTGACTGGG CACGTTATAC CTCAGGCATC GTGCTAGCGT TAGTTTCCGT GGTCTGCTGG GCATGGTATG CCCTGCGCAA CGCCCGCTGG CTGCGGGAGA ATCCCGACAA ACATCCGATG ATGTGGGCGA CGGCGCAGGC GCTGGTCACA CTGCCGGTTT CTCTCATCGG CTATCTCGTC GCCTGTTACT GGCTGAATAC GCAAACGCCG GACTTCTCCC TACCTTTTGG CCCCCGTCCG CTGGTGTTTA TTAGTCTGAT GGTTGCGATA GCCGTGCTTT GCTCATGGGT TGGCGCACTC TGCTGGAACG TCGCCAGCCA GCGATTACCG ACAGTGATTC TCGGGCCGCT GATCGTTTTC GAAACACTGG CAGGTTTGCT GTACACCTTT TTGATACGCC AGCAAATGCC GCCGCTGATG ACGCTGAGCG GTATCGCGCT GTTAGTGGTT GGCGTGGTCA TTGCAGTCAG AGCAAAACCG GAAAAGCCTT TAACTGAATC TGTCTCAGAA AGTTGA
|
Protein sequence | MISGVLYALL AGLMWGLIFV GPLIVPEYPA MLQSMGRYLA LGLIALPIAW LGRVRLRQLA RRDWLTALML TMMGNLIYYF CLASAIQRTG APVSTMIIGT LPVVIPVFAN LLYSQRDGKL VWGKLAPALV CIGIGLACVN IAELNHGLPD FDWARYTSGI VLALVSVVCW AWYALRNARW LRENPDKHPM MWATAQALVT LPVSLIGYLV ACYWLNTQTP DFSLPFGPRP LVFISLMVAI AVLCSWVGAL CWNVASQRLP TVILGPLIVF ETLAGLLYTF LIRQQMPPLM TLSGIALLVV GVVIAVRAKP EKPLTESVSE S
|
| |