Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4825 |
Symbol | |
ID | 6145629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4912969 |
End bp | 4916328 |
Gene Length | 3360 bp |
Protein Length | 1119 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641619629 |
Product | hypothetical protein |
Protein accession | YP_001746736 |
Protein GI | 170684274 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTGTGA AAACTCCAGT TAACCCACTT TTGCAATGGC TTAACATGTT TTTTAGTCGC CGTTCATTAT CCGGAGCTGA TGGACGGGCG TTATATGCTT ACCGTTGTAC TGATACAGAG TACGAAAGTT TGGCAGAACT ACTGCGTACA TATGCCCCCC GTAGTTATCC AAGAACGATA TTCATTTCCT ATAGCGATGT TCTATTTAGC ATATATGCCG CAGAGTTTAT CCGCCGGACT CATACGGTCG GACACCCTAA ATGGGACACA ATTTTAGATT CTATTGACTG GAAAGTTCCG TATGTTCATC GACAGAAGCT GGTTAATGAT GGCATTCGCT ACTGGAAAAG AAAGATAAGA AACCTGGGGC AAGCTTCGGG TTATCTGCAT ACACTGGCTT GTGAAGGTGG TTTACCTATC CGCATGATTG AAAATGAGAG CGGTTATTTG ATTACCTATT TCAGAAGGAT ATATCAGGCG CTTCGTGGGC AAGCATCTCA ATATCCTGCC GCTAAAATTG CACAAGAATT AGGCGATACG ATTCCTGTCA CAATGCAAAA CGAACTGGTC TATGAAATAG CCGGTGAATT CTGCGAGACT CTCTGCCGAT TACTTAGTGA GCATCCACCT CAAAGCAGTG ATCCCGTTTC TGCATTACGT AAGCTCTCTC CAGACTGGCA CCTTCAACTT CCACTCGTTC TCCCCGAAGC GAATGCTGCA GAGATTGTAA GGCGATTACT TTCTCAATCT TCTGAAATAC GCAGTGCAAG TAGTCTTCAG GTTGAAAGAA TCTGGGTCGA TGTTGATGAC AGTTGGTATT GTGATGCCCG GTTCCGATTC CCGGCCACAA TGCGTACAGA ACAGTTAACC TCTTTGTTTG AATGCAATAT TCAGCCGGAG CAGACCCGAC TCATCATTTC AGGAAAATGG AGAAATGGCG GGGCAAGGTT GGCAATGCTA AGCCGTTATG AGCAGCAAGA TTGGCGGGTC GAGTTATTAC CTATTGCGAT GCAAAAGCTC TCTGGTGCAG ATGCAATGGC CGAAATCTCG TTGTCGCTTC ATGAAGGTCC AATTCTGTTA GGCCACACAA TTCCAAAAGG CGGTTATGAA CTTACTGAAG AACTCCCCTG GGTTTTTGAA GCGATGAATG AGAGTGAATC GCAGCTAAAA CTTGTCGGTA TGGGGTCTGT GAGTTCCAGG CTAAATGCTC TGTTTATTTC GCTACCGAAA AATAGCCATT TAGATATTAG TGGAGAAGGT GAGTTTGATA TCCCAAGGTT GCTGAAAAAC AGTGAACGCA GCTTAACTAA AATAAGTGGA GTATTTAGCG TTGTTTTACA TGATGGTGCA GTTTGTACAA TTCGCACCCA GCAACTTTAT GATTCTGCGA TTGAATATTA TATTAAATCG ACAGAAGTTG AACTGGTTAA ATCGGATTAT CCTGTCCATC GAGCCTGGCC GAAGATCGGT TGGAAAAAGG ATCTGCAATA TGGCATTGTA CCGGAGAAAG AACTTTTTTG GCGTTCCATT CGTTCAGGTA ACAATGCCTG GTATTCTGTC GCATCAAAAA TGCCAAAAGG ACAGATAGAA GTCAGACGAA TAGTTAATGA TGAGGTTTTA TTTAGCGGTA AAGTTGTAGT TTTACCAGCT GACTTCGATA TTAATATTAT ACCTGAGAGT GCTCAGCAAG GCATCATAAT GCTTTCTGGT ATAACGGATA CTAGAATTGA CAAATATTCA AACAATGAAA AAGTAACACT TAAGTCCGAT TATTCACAAA ACGAGTGTGC TATTTATTAT AATTCATCGG AGATGCTGGA AAATACCGTT GATTTACGGG TCTCCTGGAA AGATGGTTCT AATCTTAAAC TATTATTACC TAAGCCGGTT AGCGGTGGCC GATTTGTAAC TCATGATGGT TCTGTTCATT TTGATGGTGT GGCATCTATA GCGCATTTGC ATGGAATAGA TGCTGAGTTG TTAACCATAT CTTGTGCTGG AAGAGGATAT CTTAATATCG AGTTGTTGGA TGAAAATCCA GTAGCTGAAA AATTCCGTTA TTTACATGCC GACCTTCCTC TTTTATCAGG ATGCAATGAC AAATTAAAAC AAATTTCACT TTATGAGAAC TATAATTTGC TAAATGCTAT GCTGGCATGT GCATGGAGCA GCAATAGTAC ATTATGTGTT GATTTTTACT CTGACAGATT TGGAAAGGAT AAAGCAACAC TCAGTATTAA ACGCTACGAT GGTAGTTTTA TTGAACACGA TCAAGGGTTA CTGGTTGATA TAAAAAATTC TGTTGTTTTT CCTGCAAATA GAATAGACGA GCTGGTTGTT GATGCTATTT CTCTTAAGGA TCCTGGCTTA CACATATCGT TGCTAAAAAA AGATGAATTT GCTTATGATC TTTCAGCTCT GAATGTTCAG GATAGCCCAT GGTTAATTGT GGGGAAACTT GATGGTACAG CTCGCATTGC ACCAGTAATT AAATGGGGGC TACCTGTATT ACAGACAAAT GATTTATTAC TTAATGCTCT ATGCGAAGCG GATTCCGAAC AGCGAAAAAA ATATTTTAAT GAGCTAATTT TTGAAATAGA TAACAATCCT TTGCAAAATT ATTGCTGTTT ATTAACGGAG TATATTAAGA AATACAAAAT GAATAATGGC TTATCCTTGC TGGATCTGGA TTTGTTCAGA GGTATTTCGA GTAATTACCG CGTAGTTGTT CAGTTGTTAA TATCATCATG TCTTTCTGGT GATAGCGATA CGATTTATGA TATACAGGAA GAATTACCCT TTTCATGGGG ATGGATTCCC GTTTCAATCT GGAAAGATGT TTTTCAAAAG TGTTGGGCTT ATCTGGAAAA ACAGATTAAC GATAAAACAT TAGCATTACA TATATTGCAA CCCTTTATTG CTTTTATGAA CCATCGTGCA CATATCGATC GTCGTCTGGC TCCCATTGCG AATATGTTAC TTACATATAG CAAACTCCTA CCAGCCGGTT GTAATGTCTT GCCAACTGTT AGTCGTGAGC AGTTTAATGA AGCTAAACAG ATGCTATTAA GGAACCCCGA CAGCTTTGGG CGTATCAGTA TCTTCCCTAA AGAACTTTGG TCTAGTGCGA TTACTCCAGA GTTAAAATCT GTTTTTAATA AGCTTTGGAT TAAAAATAAA TATCACTCAC GGCTTGAAAA ACGTTTTAAT TTGATGTTAG TCGCAGCGCT GTTAACCCAA AAAGATAATA ACTTGATACA TCAACTGTCT GCGCTTTTTG AATTCCACTA TCAGCAAGCC CCGCAGCAAT TAGGGGTAAT CTATCAATAT TATTTTGAAC AAGCAGGAGT ATGTCATTGA
|
Protein sequence | MLVKTPVNPL LQWLNMFFSR RSLSGADGRA LYAYRCTDTE YESLAELLRT YAPRSYPRTI FISYSDVLFS IYAAEFIRRT HTVGHPKWDT ILDSIDWKVP YVHRQKLVND GIRYWKRKIR NLGQASGYLH TLACEGGLPI RMIENESGYL ITYFRRIYQA LRGQASQYPA AKIAQELGDT IPVTMQNELV YEIAGEFCET LCRLLSEHPP QSSDPVSALR KLSPDWHLQL PLVLPEANAA EIVRRLLSQS SEIRSASSLQ VERIWVDVDD SWYCDARFRF PATMRTEQLT SLFECNIQPE QTRLIISGKW RNGGARLAML SRYEQQDWRV ELLPIAMQKL SGADAMAEIS LSLHEGPILL GHTIPKGGYE LTEELPWVFE AMNESESQLK LVGMGSVSSR LNALFISLPK NSHLDISGEG EFDIPRLLKN SERSLTKISG VFSVVLHDGA VCTIRTQQLY DSAIEYYIKS TEVELVKSDY PVHRAWPKIG WKKDLQYGIV PEKELFWRSI RSGNNAWYSV ASKMPKGQIE VRRIVNDEVL FSGKVVVLPA DFDINIIPES AQQGIIMLSG ITDTRIDKYS NNEKVTLKSD YSQNECAIYY NSSEMLENTV DLRVSWKDGS NLKLLLPKPV SGGRFVTHDG SVHFDGVASI AHLHGIDAEL LTISCAGRGY LNIELLDENP VAEKFRYLHA DLPLLSGCND KLKQISLYEN YNLLNAMLAC AWSSNSTLCV DFYSDRFGKD KATLSIKRYD GSFIEHDQGL LVDIKNSVVF PANRIDELVV DAISLKDPGL HISLLKKDEF AYDLSALNVQ DSPWLIVGKL DGTARIAPVI KWGLPVLQTN DLLLNALCEA DSEQRKKYFN ELIFEIDNNP LQNYCCLLTE YIKKYKMNNG LSLLDLDLFR GISSNYRVVV QLLISSCLSG DSDTIYDIQE ELPFSWGWIP VSIWKDVFQK CWAYLEKQIN DKTLALHILQ PFIAFMNHRA HIDRRLAPIA NMLLTYSKLL PAGCNVLPTV SREQFNEAKQ MLLRNPDSFG RISIFPKELW SSAITPELKS VFNKLWIKNK YHSRLEKRFN LMLVAALLTQ KDNNLIHQLS ALFEFHYQQA PQQLGVIYQY YFEQAGVCH
|
| |