Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2699 |
Symbol | |
ID | 6271954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2500271 |
End bp | 2501449 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641726664 |
Product | hypothetical protein |
Protein accession | YP_001881144 |
Protein GI | 187731249 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCTG TAAGCCAAAC CGAAACACGA TCTTCTGCCA ATTTTTCGCT CTTCCGCATC GCTTTTGCGG TTTTTCTCAC CTACATGACC GTAGGGCTGC CGTTGCCGGT TATCCCGCTG TTTGTTCATC ATGATCTGGG CTATGGCAAT ACCATGGTCG GGATTGCCGT CGGGATTCAG TTTCTGGCTA CGGTGCTGAC GCGTGGCTAT GCCGGGCGAC TGGCCGATCA ATATGGTGCA AAACGTTCGG CGCTTCAGGG GATGTTAGCT TGTGGTCTGG CTGGCGGCGC GTTGCTGCTG GCGGCGATTT TGCCTGTCTC CGCACCGTTG AAATTTGCCC TGTTGGTCGT CGGGCGTTTG ATTCTTGGCT TTGGTGAAAG CCAGTTACTG ACAGGCGCTC TGACCTGGGG ATTAGGCATC GTAGGGGCAA AACACTCTGG CAAAGTGATG TCATGGAACG GAATGGCGAT TTACGGTGCC CTCGCTGTTG GTGCGCCGCT TGGCCTGTTG ATTCATAGCC ATTACGGTTT TGCCGCACTG GCGATCACCA CAATGGTATT ACCCTTACTG GCGTGGGCCT GTAACGGCAC AGTGCGCAAA GTACCGGCCC TGGCGGGAGA ACGTCCATCG CTGTGGAGCG TTGTCGGGCT TATCTGGAAA CCAGGGTTAG GGCTGGCACT ACAAGGCGTT GGTTTTGCGG TTATCGGGAC TTTCGTTTCG CTCTACTTTG CCAGCAAAGG ATGGGCGATG GCGGGCTTTA CTCTTACCGC GTTTGGCGGC GCATTTGTCG TGATGCGCGT CATGTTTGGC TGGATGCCGG ACCGTTTTGG CGGCGTGAAA GTGGCGATTG TCTCTCTGCT TGTAGAAACG GTGGGCTTGT TGCTGCTCTG GCAAGCCCCA GGTGCATGGG TCGCATTAGC GGGCGCGGCG TTAACCGGAG CCGGATGTTC GCTTATCTTT CCTGCGCTGG GCGTGGAGGT GGTTAAACGC GTCCCCTCAC AAGTTCGCGG CACCGCACTG GGCGGTTACG CCGCGTTTCA GGATATCGCC CTCGGCGTCT CCGGGCCGCT GGCGGGAATG CTGGCGACCA CGTTTGGTTA CTCTTCGGTA TTTCTTGCCG GGGCGATCTC TGCGGTGCTG GGTATTATTG TCACGATACT GTCATTTCGT CGGGGTTAA
|
Protein sequence | MTAVSQTETR SSANFSLFRI AFAVFLTYMT VGLPLPVIPL FVHHDLGYGN TMVGIAVGIQ FLATVLTRGY AGRLADQYGA KRSALQGMLA CGLAGGALLL AAILPVSAPL KFALLVVGRL ILGFGESQLL TGALTWGLGI VGAKHSGKVM SWNGMAIYGA LAVGAPLGLL IHSHYGFAAL AITTMVLPLL AWACNGTVRK VPALAGERPS LWSVVGLIWK PGLGLALQGV GFAVIGTFVS LYFASKGWAM AGFTLTAFGG AFVVMRVMFG WMPDRFGGVK VAIVSLLVET VGLLLLWQAP GAWVALAGAA LTGAGCSLIF PALGVEVVKR VPSQVRGTAL GGYAAFQDIA LGVSGPLAGM LATTFGYSSV FLAGAISAVL GIIVTILSFR RG
|
| |