Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1654 |
Symbol | rspA |
ID | 5594637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1676062 |
End bp | 1677276 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640920802 |
Product | starvation-sensing protein RspA |
Protein accession | YP_001458358 |
Protein GI | 157161040 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.0315691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCG TAAAGGCTGA AGTTTTTGTT ACCTGTCCGG GGCGTAATTT CGTCACATTA AAAATCACCA CTGAGGACGG TATTACGGGC CTTGGGGATG CCACCCTCAA TGGACGTGAG CTTTCCGTGG CCTCTTATTT GCAGGATCAC CTTTGTCCGC AGCTTATTGG TCGCGATGCG CACCGTATCG AAGATATCTG GCAGTTTTTC TATAAAGGTG CTTACTGGCG TCGCGGTCCG GTTACGATGT CGGCCATTTC AGCGGTTGAT ATGGCGCTGT GGGATATTAA AGCCAAAGCT GCCAACATGC CGCTTTACCA GTTACTCGGC GGCGCGTCTC GTGAAGGGGT GATGGTTTAT TGCCATACCA CCGGTCACAG TATTGATGAA GCTCTGGATG ATTATGCCCG TCATCAAGAG CTTGGATTCA AAGCCATCCG CGTGCAGTGC GGAATCCCTG GTATGAAAAC CACCTACGGC ATGTCGAAAG GTAAAGGTCT GGCTTATGAA CCCGCAACCA AAGGACAGTG GCCGGAAGAG CAGCTGTGGT CGACGGAGAA ATACCTCGAT TTCATGCCGA AATTGTTTGA CGCGGTACGT AACAAGTTTG GTTTTAATGA ACATTTGCTG CATGACATGC ACCATCGCTT AACGCCTATT GAAGCGGCGC GCTTTGGTAA AAGCATTGAA GATTATCGCA TGTTCTGGAT GGAAGACCCG ACGCCTGCGG AAAACCAGGA ATGCTTCCGT CTCATTCGCC AACATACCGT CACACCCATC GCAGTGGGTG AAGTCTTCAA CAGCATCTGG GACTGCAAAC AACTGATTGA AGAGCAACTC ATCGATTATA TCCGCACCAC GCTGACCCAT GCAGGCGGAA TTACCGGTAT GCGCCGGATT GCCGATTTTG CTTCGCTGTA TCAGGTACGT ACTGGCTCAC ACGGTCCTTC CGATTTGTCA CCAGTCTGCA TGGCTGCGGC GCTGCACTTT GATCTGTGGG TCCCCAATTT CGGTGTCCAG GAATACATGG GTTATTCCGA ACAAATGCTC GAAGTCTTCC CGCACAACTG GACTTTCGAT AACGGCTATA TGCATCCGGG AGACAAACCG GGTCTTGGTA TCGAATTCGA TGAAAAGCTG GCGGCGAAAT ATCCCTATGA ACCTGCTTAT CTACCAGTCG CACGTCTGGA AGATGGCACG CTGTGGAACT GGTAA
|
Protein sequence | MKIVKAEVFV TCPGRNFVTL KITTEDGITG LGDATLNGRE LSVASYLQDH LCPQLIGRDA HRIEDIWQFF YKGAYWRRGP VTMSAISAVD MALWDIKAKA ANMPLYQLLG GASREGVMVY CHTTGHSIDE ALDDYARHQE LGFKAIRVQC GIPGMKTTYG MSKGKGLAYE PATKGQWPEE QLWSTEKYLD FMPKLFDAVR NKFGFNEHLL HDMHHRLTPI EAARFGKSIE DYRMFWMEDP TPAENQECFR LIRQHTVTPI AVGEVFNSIW DCKQLIEEQL IDYIRTTLTH AGGITGMRRI ADFASLYQVR TGSHGPSDLS PVCMAAALHF DLWVPNFGVQ EYMGYSEQML EVFPHNWTFD NGYMHPGDKP GLGIEFDEKL AAKYPYEPAY LPVARLEDGT LWNW
|
| |