Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3734 |
Symbol | |
ID | 6269884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3458918 |
End bp | 3461008 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727598 |
Product | integral membrane protein, YccS/YhfK family |
Protein accession | YP_001882033 |
Protein GI | 187733272 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCGCA GACTGATTTA TCACCCCGAT ATCAACTATG CACTTCGACA AACGCTGGTG CTATGTTTGC CCGTGGCCGT TGGGTTAATG CTTGGCGAAT TACGATTCGG TCTGCTCTTC TCTCTCGTTC CTGCCTGTTG CAATATTGCG GGCCTTGATA CGCCTCATAA ACGTTTTTTC AAACGCTTAA TCATTGGTGC GTCGCTGTTT GCCACCTGTA GCTTACTGAC ACAGCTGCTA CTGGCAAAAG ATGTTCCCCT GCCCTTTTTG CTGACCGGAT TAACGCTGGT ACTTGGCGTC ACTGCTGAGT TGGGGCCATT GCACGCAAAA TTGCTTCCCG CATCGCTGCT GGCAGCCATT TTTACCCTCA GTCTGGCGGG ATACATGCCG GTCTGGGAAC CGTTGCTCAT CTATGCGTTG GGCACTCTCT GGTACGGATT GTTTAACTGG TTTTGGTTCT GGATCTGGCG CGAACAACCG CTGCGCGAGT CACTAAGTCT GCTGTACCGT GAACTGGCAG ATTATTGTGA AGCCAAATAC AGCCTGCTTA CCCAGCACAC CGACCCTGAA AAAGCGCTGC CGCCGCTGCT GGTGCGCCAG CAAAAAGCGG TCGATCTAAT TACCCAGTGC TATCAGCAAA TGCATATGCT TTCCGCGCAA AATAATACTG ACTACAAGCG GATGCTGCGT ATTTTCCAGG AGGCGCTGGA TTTACAGGAA CATATTTCGG TCAGTTTGCA TCAGCCGGAA GAGGTGCAAA AGCTGGTCGA GCGTAGCCAT GCGGAAGAAG TTATCCGCTG GAATGCGCAA ACCGTCGCCG CTCGCCTGCG CGTGCTGGCT GATGACATTC TTTACCATCG CCTGCCAACG CGTTTTACGA TGGAAAAGCA AATTGGCGCA CTGGAAAAAA TCGCCCGCCA GCATCCGGAT AATCCGGTTG GGCAATTCTG CTACTGGCAT TTCAGCCGCA TCGCCCGCGT GCTGCGCACC CAAAAACCGC TCTATGCCCG TGACTTACTG GCCGATAAAC AGCGGCGAAT GCCATTACTT CCGGCGCTGA AAAGTTATCT GTCACTAAAG TCTCCGGCGC TACGCAATGC CGGACGACTC AGTGTGATGT TAAGCGTTGC CAGCCTGATG GGCACCGCGC TGCATCTGCC GAAGTCGTAC TGGATCCTGA TGACGGTATT GCTGGTGACA CAAAATGGCT ATGGCGCAAC CCGTCTGAGG ATTGTGAATC GCTCCGTGGG AACCGTGGTC GGGTTAATCA TTGCGGGCGT GGCGCTGCAC TTTAAAATTC CCGAAGGTTA CACCCTGACG TTGATGCTGA TTACCACCCT CGCCAGCTAC CTGATATTGC GCAAAAACTA CGGCTGGGCG ACGGTCGGTT TTACTATTAC CGCAGTGTAT ACCCTGCAAC TATTGTGGTT GAACGGCGAG CAATACATCC TTCCGCGTCT TATCGATACC ATTATTGGTT GTTTAATTGC TTTCGGCGGT ACTGTCTGGC TGTGGCCGCA GTGGCAGAGC GGGTTATTGC GTAAAAACGC CCATGACGCT TTAGAAGCCT ATCAGGAAGC GATTCGCTTG ATTCTTAGCG AGGATCCGCA ACCTACGCCA CTGGCCTGGC AGCGAATGCG GGTAAATCAG GCACATAACA CTCTGTATAA CTCATTGAAT CAGGCGATGC AGGAACCGGC GTTTAACAGC CATTATCTGG CAGATATGAA ACTGTGGGTA ACGCACAGCC AGTTTATTGT TGAGCATATT AATGCCATGA CCACGCTGGC GCGGGAACAC CGGGCATTGC CACCTGAACT GGCACAAGAG TATTTACAGT CTTGTGAAAT CGCCATTCAG CGTTGTCAGC AGCGACTGGA GTATGACGAA CCGGGTAGTT CTGGCGATGC CAATATCATG GATGCGCCGG AGATGCAGCC GCACGAAGGC GCGGCAGGTA CGCTGGAGCA GCATTTACAG CGGGTTATTG GTCATCTGAA CACCATGCAC ACCATTTCGT CGATGGCATG GCGTCAGCGA CCGCATCACG GGATTTGGCT GAGTCGCAAG TTGCGGGATT CGAAGGCGTA A
|
Protein sequence | MWRRLIYHPD INYALRQTLV LCLPVAVGLM LGELRFGLLF SLVPACCNIA GLDTPHKRFF KRLIIGASLF ATCSLLTQLL LAKDVPLPFL LTGLTLVLGV TAELGPLHAK LLPASLLAAI FTLSLAGYMP VWEPLLIYAL GTLWYGLFNW FWFWIWREQP LRESLSLLYR ELADYCEAKY SLLTQHTDPE KALPPLLVRQ QKAVDLITQC YQQMHMLSAQ NNTDYKRMLR IFQEALDLQE HISVSLHQPE EVQKLVERSH AEEVIRWNAQ TVAARLRVLA DDILYHRLPT RFTMEKQIGA LEKIARQHPD NPVGQFCYWH FSRIARVLRT QKPLYARDLL ADKQRRMPLL PALKSYLSLK SPALRNAGRL SVMLSVASLM GTALHLPKSY WILMTVLLVT QNGYGATRLR IVNRSVGTVV GLIIAGVALH FKIPEGYTLT LMLITTLASY LILRKNYGWA TVGFTITAVY TLQLLWLNGE QYILPRLIDT IIGCLIAFGG TVWLWPQWQS GLLRKNAHDA LEAYQEAIRL ILSEDPQPTP LAWQRMRVNQ AHNTLYNSLN QAMQEPAFNS HYLADMKLWV THSQFIVEHI NAMTTLAREH RALPPELAQE YLQSCEIAIQ RCQQRLEYDE PGSSGDANIM DAPEMQPHEG AAGTLEQHLQ RVIGHLNTMH TISSMAWRQR PHHGIWLSRK LRDSKA
|
| |