Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3406 |
Symbol | |
ID | 6145506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3485569 |
End bp | 3486879 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618235 |
Product | hypothetical protein |
Protein accession | YP_001745384 |
Protein GI | 170681817 |
COG category | [S] Function unknown |
COG ID | [COG3681] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA GTAAAACCGG CGCTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG GCGCTGGGAG CGTTAGGTGG AAATGCTAAC GCCGGGCTGG AAGTGCTGAA AGATGCAACT GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC AGTTAAGATC CAGGAACCTT GCGATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG GCGTGTGTCA CCATTGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACAATGGT GTGGTGTTTA CCCAGCAGGC GTGTGTGACA GAGGGCGAGC AAGAGTCGCC GCTGACGGTG CTTTCCAGGA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG ATCCGCTTTA TTCTCGATTC CGCGAAGTTA AATTGCGCGT TATCGCAGGA AGGCTTGAGC GGTAACTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGCGCG CGGCTTGCTG GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG GGCGGCGCTA CGCTTCCGGC AATGAGTAAC TCCGGCTCGG GTAACCAGGG GATCACTGCA ACAATGCCTG TGGTGGTGGT AGCAGAACAC TTCGGGGCCG ATGATGAACG GCTGGCGCGT GCGCTGATGC TTTCGCATTT GAGCGCGATT TACATCCATA ACCAGTTACC GCGTTTGTCT GCGCTTTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG GATGGGCGTT ATGAAACCAT TTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC ATGATTTGCG ATGGTGCGTC GAACAGCTGC GCGATGAAGG TTTCGACCAG TGCTTCGGCT GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCAG TGACCGGCAA TGAAGGGATT GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG CAGCAAACGG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
|
Protein sequence | MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI QEPCDEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHNG VVFTQQACVT EGEQESPLTV LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GNWGLHIGAT LEKQCARGLL AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMPVVVVAEH FGADDERLAR ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM QQTDRQIIEI MASKAR
|
| |