Gene EcSMS35_4828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4828 
Symbol 
ID6146965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4922866 
End bp4925724 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content50% 
IMG OID641619632 
Producthelicase family protein 
Protein accessionYP_001746739 
Protein GI170683792 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGC AAAATTATGC ACCTGGTATG CGGGTGGTTA TTCGTGATGC CGAATGGCGT 
ATTCGCCGGG CGGATGACAG TGGTGATGGT GGGTATTTGC TGACCTGCGA TGGTATTTCA
GAGCTTGTGC GCGGTAAAGA AGGGTTATTT CTGACCAAAC TGGAACAAAA AGTAGAGATC
CTGGATCCCG CAAAAACGCA TTTGGTGGAA GATGAATCCG CCAATTATCA GGCGGCGCAG
TTGTATATCG AAAGTCAGTT ACGCCAACGC GTACCGACAG ACAGTAAAGT CCATTTTGGT
CATCTGGCGG CGATGGATTC CATGCCGTTT CAGCTCGACC CTACGCGGAT GGCTCTGGCA
CAGCCGCGCC AGCGGATACT CATTGCCGAC GCTGTCGGTC TGGGTAAAAC GCTGGAAGCC
GGTATTCTGG TTTCGGAGCT GATCCGTCGT GGGCGCGGAA AACGTATTCT GGTTCTGGCG
GTTAAATCGA TGCTGACACA ATTCCAGAAG GAGTTCTGGA GCCGTTTTGC GATCCCATTG
ACCCGCCTGG ATTCTGCTGG TTTACAAAAG GTGCGTAACC GCATCCCTAC CAACCACAAC
CCGTTTCACT ATTTTGACAA GACCATTATC TCCATCGATA CGCTGAAACA AGACATTGAA
TATCGTCACC ATCTGGAAAA TGCCTGGTGG GATATTATTG TCATTGATGA GGCCCATAAC
GTCGCAGAAC GTGGAACCAG TTCGTTACGC AGTAAACTGG CAAAACTGCT CGCCGGGCGT
AGCGATACGC TGATTATGCT TTCAGCGACG CCGCATGATG GTAAGGCTGA AAGTTTTGCC
AGCCTGATGA ACATGCTTGA CCCGACGGCG ATCGCCAACC CGAAAGAGTA TGAATACGCC
GACTTTGCCG ATAAAAATCT GGTTGTCCGT CGCTTCAAAA AAGACGTGAA AGATCAGATG
TCCGGCGAGT TCCCCGAGCG CAATATTGTC AAACTGACTC GCCTGGCATC CGGTGCCGAA
GAAGAAGCAT ACCGTCGCCT GGTTGAGAGT CAGTTCCGTG ATGATGACGA TGAACAGGCC
CAGTCTAACA AAGGTCGTCT GTTCAAGATA ACCCTCGAAA AAGCGCTATT TTCCAGCCCG
ATGGCGTGTG CCAGCGTAGT GGCTAATCGT CTGAAGCGTC TTGAGAGCCG TAAAGATCAT
AATAGCCAGA GCCAGATCAA CGAACTGGAG TCATTGCTGT TGGCGCTGAA TAACATTGAT
GCCAGCCAGT TTAGCAAATA CCAGTTGCTG CTCGACACCA TCCGAAAAGA CCTCGCCTGG
AAAGCCAATA ACACGGAAGA TCGCCTGGTG ATCTTCACTG AAAGCATTAA AACGCTGGAG
TTCCTCGAAC AGCAACTGCG AGCGGATCTG AAATTGAAAG ATGATCAGAT CGCCACGCTG
CGCGGCGATC AGGGCGATAC GGTGTTGATG GAAACCGTAG AAGCCTTTGG TAAAACGCAG
TCGCCGTTGC GTCTTCTGGT TTGTTCGGAT GTGGCATCTG AAGGGATCAA CCTGCATCAC
CTTAGCCACA AAATGATTCA CTTTGATATT CCGTGGTCGT TGATGGTATT CCAGCAACGT
AACGGGCGTA TTGACCGCTA TGGGCAAAAA CATCAGCCTC AAATCCGCTA TCTGCTTACC
GAGGCCAGCG AACCGCAGAT CAATGGCGAT ATGCGCGTAC TGGAAGTGTT GATCAACAAA
GACGAACAGG CGCAGAAGAA CATTGGGGAC AGTTCGGAGT TTACCGGCAA ATTTACCCAG
GAAGAGGAAG AAGAGCAGGT TGCAGAGTTC ATGATGCAGG ATGATGGTGC CAGCCTGTTT
GATCAACTGC TGAACAGCAA CGTCTCGGAA AGCGCCGAAC ATGATCTGTT TGGCGAAATA
TGCAGTGCGG TTTCCAGCGA TGCCTCAATG GTCACCGAAA CAGACACCAG CTTGTTTGCC
AGCGAACAGG CGTACTGCGA AAGAGCATTG GGTTACCTGA AAGCCAGTGG TCAAACTATC
CAGTATGAAA CCTTGCCTGA TAATACCTTG TCGCTGGTGG CGCCGGAGGA GTTACGCCGC
CGCTTCAATC AGCTACCGCC TGAAATCGCG CCGGAGAACT GGCAGCTCTA TTTAAGTCAG
GATAAAACCG TCATCACCGA CGCGATTGCA CGCGCTCGCG GTGAGCAACA TGCCTGGCCC
GATGTGCAGT ATCTCTGGCA AATCAACCCG GTAGTGCAGT GGCTGGACGA TAAAATCTCT
TCGGCTTTTG GTCGTCATCA GGCACCGGTT ATCCGTCTGC CATACCTGCT TGAACCCGAT
GAAGATCACT TCATTCTTTC CGGGTTATTC CCGAACCGTA AATCACATCC GATGGTGAAC
CCGTGGATAG TGGTGAGCTT TAACCGTGAA TCGCTAATCG GCAGTCAGCC ATTTGCTGAG
TTTTTACAAC GTCATCCGCA GTTGAGTAAC AAGCTGACTA ACAGCGGCGG TAAAGATCGT
AATCACCAGC GCCAGCAGGA TTTACTGGAA GCGGCTATTG CTCATGCCCG GGAGGTATTC
ATCCATGATC GGAATGCGTT TGAAACGCAC ATTAACCAGC AACTGAATGA GCATCTGCAA
AAGCTGGACG TTTTGCGTGG GCGGCAGTTG AGCCAACTTG AGCTGGATTT TGCCGATAAC
AAACAGCAAT TGTCAGTTAA GCAGAGCCGT AAAGAGCAGA GACAACGCGA AATCGAACAC
AATTTTGACA GCTATATCGA ATGGATTGAG GACACCATGA CGACTGAAAA AGAACCCTAC
ATTCAGGTAA TTGCTGTTAT CACCGGAGCG GAGGGTTAA
 
Protein sequence
MNKQNYAPGM RVVIRDAEWR IRRADDSGDG GYLLTCDGIS ELVRGKEGLF LTKLEQKVEI 
LDPAKTHLVE DESANYQAAQ LYIESQLRQR VPTDSKVHFG HLAAMDSMPF QLDPTRMALA
QPRQRILIAD AVGLGKTLEA GILVSELIRR GRGKRILVLA VKSMLTQFQK EFWSRFAIPL
TRLDSAGLQK VRNRIPTNHN PFHYFDKTII SIDTLKQDIE YRHHLENAWW DIIVIDEAHN
VAERGTSSLR SKLAKLLAGR SDTLIMLSAT PHDGKAESFA SLMNMLDPTA IANPKEYEYA
DFADKNLVVR RFKKDVKDQM SGEFPERNIV KLTRLASGAE EEAYRRLVES QFRDDDDEQA
QSNKGRLFKI TLEKALFSSP MACASVVANR LKRLESRKDH NSQSQINELE SLLLALNNID
ASQFSKYQLL LDTIRKDLAW KANNTEDRLV IFTESIKTLE FLEQQLRADL KLKDDQIATL
RGDQGDTVLM ETVEAFGKTQ SPLRLLVCSD VASEGINLHH LSHKMIHFDI PWSLMVFQQR
NGRIDRYGQK HQPQIRYLLT EASEPQINGD MRVLEVLINK DEQAQKNIGD SSEFTGKFTQ
EEEEEQVAEF MMQDDGASLF DQLLNSNVSE SAEHDLFGEI CSAVSSDASM VTETDTSLFA
SEQAYCERAL GYLKASGQTI QYETLPDNTL SLVAPEELRR RFNQLPPEIA PENWQLYLSQ
DKTVITDAIA RARGEQHAWP DVQYLWQINP VVQWLDDKIS SAFGRHQAPV IRLPYLLEPD
EDHFILSGLF PNRKSHPMVN PWIVVSFNRE SLIGSQPFAE FLQRHPQLSN KLTNSGGKDR
NHQRQQDLLE AAIAHAREVF IHDRNAFETH INQQLNEHLQ KLDVLRGRQL SQLELDFADN
KQQLSVKQSR KEQRQREIEH NFDSYIEWIE DTMTTEKEPY IQVIAVITGA EG