Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3652 |
Symbol | |
ID | 6145214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3711854 |
End bp | 3712939 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618479 |
Product | hypothetical protein |
Protein accession | YP_001745619 |
Protein GI | 170682063 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.164535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0000282332 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGACGT TTCCTCTGCA AAGCCTGACG CTTATTGAGG CGCAGCAAAA GCAGTTTGCG CTGGTGGATA GCATTTGTCG CCATTTTCCC GGCAGCGAGT TTCTTGCTGG CGGTGATTTA GGCTTAACGC CGGGGCTGAA TCAACCGCGC ATTACCCAGC GAGTGGAACA GGTGCTGGCT GATGCATTTC ACGCACAGGC TGCAGCGCTG GTACAGGGCG CGGGTACTGG CGCGATCCGT GCGGCGTTGG CGGCTTTGCT CAAACCGGGG CAGCGTCTTC TGGTGCATGA CGCGCCTGTT TACCCGACGA CTCAGGTCAT TATTGAGCAG ATGGGGCTGA CGCTTATTAC TGCTAATTTC AATGACCTGT CGGCACTTAA GCAGGTCGTC GACGAGCAAC AACCGGATGC GGCGCTGGTG CAGCATACGC GCCAGCAGCC GCAGGACGGC TACATTCTGG CAGATGTGCT GGCAACGTTG CGCTCGGCAG GTGTTCCAGC CTTAACCGAT GATAACTATG CGGTGATGAA GGTGGCGCGC ATTGGCTGTG AATGCGGGGC GAATGTCTCG ACATTTTCCT GCTTCAAGCT ATTTGGGCCA GAGGGTGTTG GCGCGGTGGT AGGCGATGCC GATGTTATTA GCCGGATTCG CGCCACGCTC TATTCCGGCG GCAGCCAGGT TCAGGGCGCG CAGGCGCTGG AAGTCTTGCG TGGTCTGGTG CTCGCGCCAG TGATGCACGC GGTGCAGGCG GGGGTATCTG AACGGTTGCT GGCTTTGCTT AACGGCGGCG CGGTGGCGGA AGTGAAAAGC GCCGTCATTG CCAATGCACA GTCGAAGGTA TTGATTGTGG AGTTTCATCA GCCGATTGCC GCCAGAGTGC TGGAAGAGGC GCAGAAACTC GGTGCCTTGC CTTACCCGGT TGGTGCAGAG TCGAAATATG AAATCCCGCC GCTCTTTTAT CGACTTTCCG GAACGTTTCG CCAGGCGAAT CCGCAGTTAG AACATTGCGC GATTCGCATT AACCCAAATC GCAGCGGTGA AGAGACGGTA TTGCGGATTT TGCGCGAGAG TATTGCCGAC GTTTAA
|
Protein sequence | MKTFPLQSLT LIEAQQKQFA LVDSICRHFP GSEFLAGGDL GLTPGLNQPR ITQRVEQVLA DAFHAQAAAL VQGAGTGAIR AALAALLKPG QRLLVHDAPV YPTTQVIIEQ MGLTLITANF NDLSALKQVV DEQQPDAALV QHTRQQPQDG YILADVLATL RSAGVPALTD DNYAVMKVAR IGCECGANVS TFSCFKLFGP EGVGAVVGDA DVISRIRATL YSGGSQVQGA QALEVLRGLV LAPVMHAVQA GVSERLLALL NGGAVAEVKS AVIANAQSKV LIVEFHQPIA ARVLEEAQKL GALPYPVGAE SKYEIPPLFY RLSGTFRQAN PQLEHCAIRI NPNRSGEETV LRILRESIAD V
|
| |