Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3640 |
Symbol | |
ID | 6145814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3698933 |
End bp | 3701023 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618467 |
Product | YccS/YhfK family integral membrane protein |
Protein accession | YP_001745607 |
Protein GI | 170682476 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.000149202 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGGCGCA GACTGATTTA TCACCCCGAT ATTAACTATG CACTGCGACA AACGCTGGTG CTATGTTTGC CCGTGGCCGT TGGGTTAATG CTTGGCGAAT TACGATTCGG TCTGCTCTTC TCCCTCGTTC CTGCCTGTTG CAATATTGCG GGCCTTGATA CGCCTCATAA ACGTTTTTTC AAACGCTTAA TCATTGGTGC GTCGCTGTTT GCCACCTGTA GCTTGCTGAC ACAGGTGCTA CTGGCAAAAG ATGTTCCCCT GCCCTTTTTG CTGACCGGAT TAACGCTGGT ACTTGGCGTC ACTGCTGAGC TGGGGCCATT GCACGCAAAA TTGCTTCCCG CATCGCTGCT CGCCGCCATT TTTACCCTCA GTCTGGCGGG ATACATGCCG GTCTGGGAAC CGTTGCTCAT CTATGCGCTG GGCACTTTGT GGTACGGATT GTTTAACTGG TTTTGGTTCT GGATCTGGCG CGAACAACCG CTACGCGAGT CACTAAGTCT GCTGTACCGT GAACTGGCAG ATTACTGTGA AGCCAAATAC AGCCTGCTTA CCCAGCACAC CGACCCTGAA AAAGCGCTGC CGCCGCTGCT GGTGCGCCAG CAAAAAGCGG TCGATTTAAT TACCCAGTGC TATCAGCAAA TGCATATGCT TTCCGCGCAA AATAATACCG ATTACAAGCG GATGCTGCGT ATTTTCCAGG AGGCGCTGGA TTTGCAGGAA CATATTTCGG TCAGTTTGCA TCAGCCGGAA GAGGTGCAAA AGCTGGTCGA GCGTAGCCAT GCGGAAGAAG TTATCCGCTG GAATGCGCAA ACCGTCGCCG CTCGCCTGCG CGTGCTGGCT GATGACATTC TTTACCATCG CCTGCCGACG CGTTTTACGA TGGAAAAGCA AATTGGCGCA CTGGAAAAAA TCGCCCGCCA ACACCCGGAT AATCCGGTTG GGCAATTCTG CTACTGGCAT TTCAGCCGCA TTGCCCGCGT GCTGCGCACC CAAAAACCGC TCTATGCCCG TGACTTACTG GCCGATAAAC AGCGGCGAAT GCCGTTACTT CCGGCGCTGA AAAGTTATTT GTCACTAAAG TCTCCGGCGC TACGCAATGC CGGACGACTC AGTGTGATGT TAAGCGTTGC CAGCCTGATG GGCACCGCGC TGCATCTGCC GAAGTCGTAC TGGATCCTGA TGACGGTATT GCTGGTGACG CAAAATGGCT ATGGCGCAAC CCGTCTGAGG ATTGTGAATC GCTCCGTGGG AACCGTGGTC GGGTTAATCA TTGCGGGCGT GGCGCTGCAC TTTAAAATTC CCGAAGGTTA CACCCTGACC TTGATGCTGA TTACCACCCT TGCCAGCTAC CTGATATTGC GCAAAAACTA CGGCTGGGCG ACGGTCGGTT TTACCATTAC CGCAGTGTAT ACCCTGCAAC TGTTGTGGCT GAACGGTGAG CAGTACATCC TTCCGCGTCT TATCGATACC ATTATTGGTT GTTTAATTGC TTTCGGCGGT ACTGTCTGGC TGTGGCCGCA GTGGCAGAGC GGGTTATTGC GTAAGAACGC CCATGACGCT TTAGAAGCCT ATCAGGACGC GATTCGCTTG ATTCTGAGCG AGGACCCGCA ACCAACGCCA CTGGCCTGGC AGCGAATGCG GGTGAATCAG GCGCATAACA CTCTGTATAA CTCATTGAAT CAGGCGATGC AGGAACCGGC ATTCAACAGC CATTATCTGG CAGATATGAA ACTGTGGGTA ACGCACAGTC AGTTTATAGT TGAACATATC AATGCCATGA CCACGCTGGC ACGGGAACAC CGGGCATTGC CACCTGAACT GGCGCAAGAA TATTTACAGT CTTGTGAAAT CGCCATTCAG CGTTGTCAGC AGCGACTGGA GTATGACGAA CCGGGTAGTT CTGGCGATGC CAATATCATG GATGCGCCGG AGATGCAGCC GCACGAAGGC GCGGCAGGTA CGCTGGAGCA GCATTTACAG CGGGTTATTG GTCATCTGAA CACCATGCAC ACCATTTCGT CGATGGCATG GCGTCAGCGA CCGCATCACG GGATTTGGCT GAGTCGCAAG TTGCGGGATT CGAAGGCGTA A
|
Protein sequence | MWRRLIYHPD INYALRQTLV LCLPVAVGLM LGELRFGLLF SLVPACCNIA GLDTPHKRFF KRLIIGASLF ATCSLLTQVL LAKDVPLPFL LTGLTLVLGV TAELGPLHAK LLPASLLAAI FTLSLAGYMP VWEPLLIYAL GTLWYGLFNW FWFWIWREQP LRESLSLLYR ELADYCEAKY SLLTQHTDPE KALPPLLVRQ QKAVDLITQC YQQMHMLSAQ NNTDYKRMLR IFQEALDLQE HISVSLHQPE EVQKLVERSH AEEVIRWNAQ TVAARLRVLA DDILYHRLPT RFTMEKQIGA LEKIARQHPD NPVGQFCYWH FSRIARVLRT QKPLYARDLL ADKQRRMPLL PALKSYLSLK SPALRNAGRL SVMLSVASLM GTALHLPKSY WILMTVLLVT QNGYGATRLR IVNRSVGTVV GLIIAGVALH FKIPEGYTLT LMLITTLASY LILRKNYGWA TVGFTITAVY TLQLLWLNGE QYILPRLIDT IIGCLIAFGG TVWLWPQWQS GLLRKNAHDA LEAYQDAIRL ILSEDPQPTP LAWQRMRVNQ AHNTLYNSLN QAMQEPAFNS HYLADMKLWV THSQFIVEHI NAMTTLAREH RALPPELAQE YLQSCEIAIQ RCQQRLEYDE PGSSGDANIM DAPEMQPHEG AAGTLEQHLQ RVIGHLNTMH TISSMAWRQR PHHGIWLSRK LRDSKA
|
| |