Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4828 |
Symbol | |
ID | 6146965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4922866 |
End bp | 4925724 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619632 |
Product | helicase family protein |
Protein accession | YP_001746739 |
Protein GI | 170683792 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGC AAAATTATGC ACCTGGTATG CGGGTGGTTA TTCGTGATGC CGAATGGCGT ATTCGCCGGG CGGATGACAG TGGTGATGGT GGGTATTTGC TGACCTGCGA TGGTATTTCA GAGCTTGTGC GCGGTAAAGA AGGGTTATTT CTGACCAAAC TGGAACAAAA AGTAGAGATC CTGGATCCCG CAAAAACGCA TTTGGTGGAA GATGAATCCG CCAATTATCA GGCGGCGCAG TTGTATATCG AAAGTCAGTT ACGCCAACGC GTACCGACAG ACAGTAAAGT CCATTTTGGT CATCTGGCGG CGATGGATTC CATGCCGTTT CAGCTCGACC CTACGCGGAT GGCTCTGGCA CAGCCGCGCC AGCGGATACT CATTGCCGAC GCTGTCGGTC TGGGTAAAAC GCTGGAAGCC GGTATTCTGG TTTCGGAGCT GATCCGTCGT GGGCGCGGAA AACGTATTCT GGTTCTGGCG GTTAAATCGA TGCTGACACA ATTCCAGAAG GAGTTCTGGA GCCGTTTTGC GATCCCATTG ACCCGCCTGG ATTCTGCTGG TTTACAAAAG GTGCGTAACC GCATCCCTAC CAACCACAAC CCGTTTCACT ATTTTGACAA GACCATTATC TCCATCGATA CGCTGAAACA AGACATTGAA TATCGTCACC ATCTGGAAAA TGCCTGGTGG GATATTATTG TCATTGATGA GGCCCATAAC GTCGCAGAAC GTGGAACCAG TTCGTTACGC AGTAAACTGG CAAAACTGCT CGCCGGGCGT AGCGATACGC TGATTATGCT TTCAGCGACG CCGCATGATG GTAAGGCTGA AAGTTTTGCC AGCCTGATGA ACATGCTTGA CCCGACGGCG ATCGCCAACC CGAAAGAGTA TGAATACGCC GACTTTGCCG ATAAAAATCT GGTTGTCCGT CGCTTCAAAA AAGACGTGAA AGATCAGATG TCCGGCGAGT TCCCCGAGCG CAATATTGTC AAACTGACTC GCCTGGCATC CGGTGCCGAA GAAGAAGCAT ACCGTCGCCT GGTTGAGAGT CAGTTCCGTG ATGATGACGA TGAACAGGCC CAGTCTAACA AAGGTCGTCT GTTCAAGATA ACCCTCGAAA AAGCGCTATT TTCCAGCCCG ATGGCGTGTG CCAGCGTAGT GGCTAATCGT CTGAAGCGTC TTGAGAGCCG TAAAGATCAT AATAGCCAGA GCCAGATCAA CGAACTGGAG TCATTGCTGT TGGCGCTGAA TAACATTGAT GCCAGCCAGT TTAGCAAATA CCAGTTGCTG CTCGACACCA TCCGAAAAGA CCTCGCCTGG AAAGCCAATA ACACGGAAGA TCGCCTGGTG ATCTTCACTG AAAGCATTAA AACGCTGGAG TTCCTCGAAC AGCAACTGCG AGCGGATCTG AAATTGAAAG ATGATCAGAT CGCCACGCTG CGCGGCGATC AGGGCGATAC GGTGTTGATG GAAACCGTAG AAGCCTTTGG TAAAACGCAG TCGCCGTTGC GTCTTCTGGT TTGTTCGGAT GTGGCATCTG AAGGGATCAA CCTGCATCAC CTTAGCCACA AAATGATTCA CTTTGATATT CCGTGGTCGT TGATGGTATT CCAGCAACGT AACGGGCGTA TTGACCGCTA TGGGCAAAAA CATCAGCCTC AAATCCGCTA TCTGCTTACC GAGGCCAGCG AACCGCAGAT CAATGGCGAT ATGCGCGTAC TGGAAGTGTT GATCAACAAA GACGAACAGG CGCAGAAGAA CATTGGGGAC AGTTCGGAGT TTACCGGCAA ATTTACCCAG GAAGAGGAAG AAGAGCAGGT TGCAGAGTTC ATGATGCAGG ATGATGGTGC CAGCCTGTTT GATCAACTGC TGAACAGCAA CGTCTCGGAA AGCGCCGAAC ATGATCTGTT TGGCGAAATA TGCAGTGCGG TTTCCAGCGA TGCCTCAATG GTCACCGAAA CAGACACCAG CTTGTTTGCC AGCGAACAGG CGTACTGCGA AAGAGCATTG GGTTACCTGA AAGCCAGTGG TCAAACTATC CAGTATGAAA CCTTGCCTGA TAATACCTTG TCGCTGGTGG CGCCGGAGGA GTTACGCCGC CGCTTCAATC AGCTACCGCC TGAAATCGCG CCGGAGAACT GGCAGCTCTA TTTAAGTCAG GATAAAACCG TCATCACCGA CGCGATTGCA CGCGCTCGCG GTGAGCAACA TGCCTGGCCC GATGTGCAGT ATCTCTGGCA AATCAACCCG GTAGTGCAGT GGCTGGACGA TAAAATCTCT TCGGCTTTTG GTCGTCATCA GGCACCGGTT ATCCGTCTGC CATACCTGCT TGAACCCGAT GAAGATCACT TCATTCTTTC CGGGTTATTC CCGAACCGTA AATCACATCC GATGGTGAAC CCGTGGATAG TGGTGAGCTT TAACCGTGAA TCGCTAATCG GCAGTCAGCC ATTTGCTGAG TTTTTACAAC GTCATCCGCA GTTGAGTAAC AAGCTGACTA ACAGCGGCGG TAAAGATCGT AATCACCAGC GCCAGCAGGA TTTACTGGAA GCGGCTATTG CTCATGCCCG GGAGGTATTC ATCCATGATC GGAATGCGTT TGAAACGCAC ATTAACCAGC AACTGAATGA GCATCTGCAA AAGCTGGACG TTTTGCGTGG GCGGCAGTTG AGCCAACTTG AGCTGGATTT TGCCGATAAC AAACAGCAAT TGTCAGTTAA GCAGAGCCGT AAAGAGCAGA GACAACGCGA AATCGAACAC AATTTTGACA GCTATATCGA ATGGATTGAG GACACCATGA CGACTGAAAA AGAACCCTAC ATTCAGGTAA TTGCTGTTAT CACCGGAGCG GAGGGTTAA
|
Protein sequence | MNKQNYAPGM RVVIRDAEWR IRRADDSGDG GYLLTCDGIS ELVRGKEGLF LTKLEQKVEI LDPAKTHLVE DESANYQAAQ LYIESQLRQR VPTDSKVHFG HLAAMDSMPF QLDPTRMALA QPRQRILIAD AVGLGKTLEA GILVSELIRR GRGKRILVLA VKSMLTQFQK EFWSRFAIPL TRLDSAGLQK VRNRIPTNHN PFHYFDKTII SIDTLKQDIE YRHHLENAWW DIIVIDEAHN VAERGTSSLR SKLAKLLAGR SDTLIMLSAT PHDGKAESFA SLMNMLDPTA IANPKEYEYA DFADKNLVVR RFKKDVKDQM SGEFPERNIV KLTRLASGAE EEAYRRLVES QFRDDDDEQA QSNKGRLFKI TLEKALFSSP MACASVVANR LKRLESRKDH NSQSQINELE SLLLALNNID ASQFSKYQLL LDTIRKDLAW KANNTEDRLV IFTESIKTLE FLEQQLRADL KLKDDQIATL RGDQGDTVLM ETVEAFGKTQ SPLRLLVCSD VASEGINLHH LSHKMIHFDI PWSLMVFQQR NGRIDRYGQK HQPQIRYLLT EASEPQINGD MRVLEVLINK DEQAQKNIGD SSEFTGKFTQ EEEEEQVAEF MMQDDGASLF DQLLNSNVSE SAEHDLFGEI CSAVSSDASM VTETDTSLFA SEQAYCERAL GYLKASGQTI QYETLPDNTL SLVAPEELRR RFNQLPPEIA PENWQLYLSQ DKTVITDAIA RARGEQHAWP DVQYLWQINP VVQWLDDKIS SAFGRHQAPV IRLPYLLEPD EDHFILSGLF PNRKSHPMVN PWIVVSFNRE SLIGSQPFAE FLQRHPQLSN KLTNSGGKDR NHQRQQDLLE AAIAHAREVF IHDRNAFETH INQQLNEHLQ KLDVLRGRQL SQLELDFADN KQQLSVKQSR KEQRQREIEH NFDSYIEWIE DTMTTEKEPY IQVIAVITGA EG
|
| |