Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1619 |
Symbol | rspA |
ID | 6142908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1608687 |
End bp | 1609901 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616495 |
Product | starvation-sensing protein RspA |
Protein accession | YP_001743673 |
Protein GI | 170684261 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000210785 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG TAAAGGCTGA AGTTTTTGTT ACCTGTCCGG GGCGTAATTT CGTCACATTA AAAATCACCA CTGAGGACGG TATTACGGGC CTTGGGGATG CCACCCTAAA TGGACGTGAG CTTTCCGTGG CCTCTTATTT GCAGGATCAC CTTTGTCCGC AGCTTATTGG TCGCGATGCA CACCGTATCG AAGATATCTG GCAGTTTTTC TATAAAGGTG CTTACTGGCG TCGCGGTCCG GTTACGATGT CGGCCATTTC AGCGGTTGAT ATGGCGCTGT GGGATATTAA AGCCAAAGCT GCCGACATGC CGCTTTACCA GTTACTCGGC GGCGCGTCTC GTGAAGGGGT GATGGTTTAT TGCCATACCA CCGGTCACAG TATTGATGAA GCTCTGGATG ATTATGCCCG TCATCAGGAG CTGGGATTCA AAGCCATCCG CGTGCAGTGC GGAATCCCTG GTATGAAAAC CACCTATGGC ATGTCGAAAG GTAAAGGTCT GGCTTATGAA CCCGCAACCA AAGGACAGTG GCCGGAAGAG CAGCTGTGGT CGACGGAGAA ATACCTCGAT TTCATGCCGA AATTGTTTGA CGCGGTACGT AACAAGTTTG GTTTTAATGA ACATTTGCTT CATGACATGC ACCATCGCTT AACGCCTATT GAAGCGGCGC GCTTTGGTAA AAGCATTGAA GATTATCGCA TGTTCTGGAT GGAAGACCCG ACGCCTGCGG AAAACCAGGA GTGTTTCCGC CTGATTCGCC AACACACCGT CACGCCAATT GCAGTGGGTG AAGTCTTCAA CAGCATCTGG GACTGCAAAC AGCTGATTGA AGAACAACTC ATCGACTATA TCCGCACCAC GCTGACTCAC GCGGGGGGGA TTACCGGTAT GCGCCGGATT GCCGATTTTG CTTCGCTGTA TCAGGTACGT ACTGGCTCAC ACGGTCCTTC CGATTTGTCG CCAGTCTGCA TGGCTGCGGC GCTGCACTTT GATCTGTGGG TCCCCAATTT CGGTGTCCAG GAATACATGG GTTATTCCGA ACAAATGCTT GAAGTCTTCC CGCACAACTG GACTTTCGAT AACGGCTATA TGCATCCGGG AGACAAACCG GGTCTTGGCA TCGAATTCGA TGAAAAGCTG GCGGCGAAAT ATCCCTATGA ACCTGCTTAT CTGCCAGTCG CACGTCTGGA AGATGGCACG CTGTGGAACT GGTAA
|
Protein sequence | MKIVKAEVFV TCPGRNFVTL KITTEDGITG LGDATLNGRE LSVASYLQDH LCPQLIGRDA HRIEDIWQFF YKGAYWRRGP VTMSAISAVD MALWDIKAKA ADMPLYQLLG GASREGVMVY CHTTGHSIDE ALDDYARHQE LGFKAIRVQC GIPGMKTTYG MSKGKGLAYE PATKGQWPEE QLWSTEKYLD FMPKLFDAVR NKFGFNEHLL HDMHHRLTPI EAARFGKSIE DYRMFWMEDP TPAENQECFR LIRQHTVTPI AVGEVFNSIW DCKQLIEEQL IDYIRTTLTH AGGITGMRRI ADFASLYQVR TGSHGPSDLS PVCMAAALHF DLWVPNFGVQ EYMGYSEQML EVFPHNWTFD NGYMHPGDKP GLGIEFDEKL AAKYPYEPAY LPVARLEDGT LWNW
|
| |