Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4822 |
Symbol | |
ID | 6145041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4909284 |
End bp | 4910387 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641619626 |
Product | hypothetical protein |
Protein accession | YP_001746733 |
Protein GI | 170681364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAT CTATCGACAT TTCAGAACTT ATTCAATTAG GGAAGAAAAT GTTACCAGAA GGAGTCGATT TTTTTCTGGA TGAATCCCCT ATTGACTTTG ATCCTATAGA TATTGAGTTA TCCACGGGTA AAGAAGTTAG TATCGAAGAT CTTGACCCTG GTAGCGGGCT AATCTCTTAT CATGGCCGCC AGGTTCTTTT ATATATTAGG GACCATTCAG GGCGTTATGA TGCGGCTATC ATCGATGGCG AAAAAGGAAA ACGTTTTCAT ATTGCCTGGT GCAGAACTCT TGATGAAATG CGCCATAAAA ATCGATTTGA AAGGTATCAT GCAACTAACC GCATAGATGG TTTATTCGAA ATTGATGATG GTTCAGGCCG GAGCCAGGAT GTTGATTTAC GGGTATGTAT GAATTGTCTG GAACGACTTA ATTACAAAGG AAGTATTGAT AAACAAAGGA AAAGAGAGAT TTTTAAATCA TTCTCATTAA ATGAGTTTTT TTCAGATTAT AGTACCTGTT TTCGTCATAT GCCTAAGGGT ATCTATGACA AAACAAATAG TGGGTATGTC GAAAACTGGA AAGATATATC AAAATCAATA CGAGAAAAGG CCAAGTATAC TTGTAATGAT TGTGGTGTGA ATTTATCAAC CGCCAAAAAC TTGTGCCATG TCCATCATAA AAATGGCATC AAATATGATA ATCACCATGA AAACCTTCTT GTTCTGTGTA AGGATTGCCA TCGTAAACAG CCCCTCCATG AAGGTATATT CGTTACCCAA GCTGAGATGG CTATCATTCA ACGTTTACGT TCCCAACAAG GGTTATTAAA AGCCGAATCC TGGAATGAAA TATATGACCT GACTGATCCA TCAGTACATG GTGATATTAA TATGATGCAA CATAAAGGCT TTCAACCTCC TGTTCCTGGG TTAGATCTTC AAAACTCAGA ACATGAAATT ATTGCAACCG TAGAAGCAGC ATGGCCAGGC CTTAAAATTG CAGTTAACCT TACTCCCGCC GAAGTCGAAG GATGGAGAAT ATATACCGTG GGTGAGCTGG TTAAAGAAAT ACAAACAGGA GCCTTTACGT CAGCAACGTT GTAA
|
Protein sequence | MKLSIDISEL IQLGKKMLPE GVDFFLDESP IDFDPIDIEL STGKEVSIED LDPGSGLISY HGRQVLLYIR DHSGRYDAAI IDGEKGKRFH IAWCRTLDEM RHKNRFERYH ATNRIDGLFE IDDGSGRSQD VDLRVCMNCL ERLNYKGSID KQRKREIFKS FSLNEFFSDY STCFRHMPKG IYDKTNSGYV ENWKDISKSI REKAKYTCND CGVNLSTAKN LCHVHHKNGI KYDNHHENLL VLCKDCHRKQ PLHEGIFVTQ AEMAIIQRLR SQQGLLKAES WNEIYDLTDP SVHGDINMMQ HKGFQPPVPG LDLQNSEHEI IATVEAAWPG LKIAVNLTPA EVEGWRIYTV GELVKEIQTG AFTSATL
|
| |