Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0825 |
Symbol | |
ID | 6146484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 828262 |
End bp | 829347 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615713 |
Product | hypothetical protein |
Protein accession | YP_001742905 |
Protein GI | 170682957 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGTG GTCATCGCTT TGATGCTCAG ACGTTGCACA GTTTTATTCA GGCTGTATTT CGTCAGATGG GTAGCGAGGA ACAAGAAGCG AAATTAGTTG CCGATCATTT AATCGCGGCA AACCTGGCAG GGCATGATTC ACATGGTATT GGCATGATCC CAAGCTATGT GCGCTCCTGG AGTCAGGGGC ATCTGCAAAT TAACCATCAT GCCAAAGTCG TTAAAGAGGC GGGGGCGGCG GTCACGCTCG ATGGCGATCG CGCATTTGGT CAGGTCGCGG CACACGAAGC GATGGCGCTG GGGATTGAGA AAGCGCATCA GCACGGCATT GCCGCCGTGG CGCTCCATAA CTCGCATCAT ATCGGCCGTA TCGGTTACTG GGCGGAGCAG TGTGCAGCGG CGGGGTTTGT CTCTATCCAC TTTGTTAGCG TGGTCGGTAT TCCAATGGTC GCGCCGTTCC ACGGTCGCGA CAGCCGCTTT GGCACCAATC CGTTCTGTGT GATTTTCCCT CGTAAAGATA ATTTTCCGCT GTTGCTCGAT TACGCCACCA GCGCCATTGC ATTTGGCAAA ACCCGCGTCG CCTGGCATAA AGGCGTCCCC GTGCCGCCAG GTTGCCTGAT TGACGTTAAC GGCGTGCCGA CGACAAATCC GGCGGTAATG CAGGAGTCGC CGTTGGGTTC GCTGTTGACC TTTGCCGAAC ATAAAGGCTA CGCCCTTGCA GCGATGTGTG AAATTCTTGG CGGGGCGCTT TCTGGCGGTA AAACGACGCA TCAGGAAACG TTACAAACCA GTCCCGATGC CATTCTTAAC TGCATGACCA CTATCATCAT CAACCCGGAA CTGTTCGGCG CGCCGGATTG TAGTGCACAG ACCGAAGCCT TTGCCGAGTG GGTGAAAGCC TCGCCGCATG ATGACGATAA GCCGATTTTG CTACCGGGCG AGTGGGAAGT GAACACGCGT CGCGAACGGC AGGAGCAGGG GATTCCACTG GATGCGGGAA GCTGGCAGGC CATTTGTGAT GCAGCGCGGC AGATTGGTAT GCCGGAAGAG ACGTTGCAGG CTTTCTGTCA GCAGTTAGCC AGCTAA
|
Protein sequence | MESGHRFDAQ TLHSFIQAVF RQMGSEEQEA KLVADHLIAA NLAGHDSHGI GMIPSYVRSW SQGHLQINHH AKVVKEAGAA VTLDGDRAFG QVAAHEAMAL GIEKAHQHGI AAVALHNSHH IGRIGYWAEQ CAAAGFVSIH FVSVVGIPMV APFHGRDSRF GTNPFCVIFP RKDNFPLLLD YATSAIAFGK TRVAWHKGVP VPPGCLIDVN GVPTTNPAVM QESPLGSLLT FAEHKGYALA AMCEILGGAL SGGKTTHQET LQTSPDAILN CMTTIIINPE LFGAPDCSAQ TEAFAEWVKA SPHDDDKPIL LPGEWEVNTR RERQEQGIPL DAGSWQAICD AARQIGMPEE TLQAFCQQLA S
|
| |