Gene EcolC_3885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3885 
Symbol 
ID6064350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4252940 
End bp4255798 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content50% 
IMG OID641603299 
Producthelicase domain-containing protein 
Protein accessionYP_001726814 
Protein GI170021860 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.929535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGC AAAATTATGC ACCTGGTATG CGGGTGGTTA TTCGTGATGC CGAATGGCGT 
ATTCGCCGGG CGGATGACAG TGGTGATGGT GGGTATTTGC TGACCTGCGA TGGTATTTCA
GAGCTTGTGC GCGGTAAAGA AGGGTTATTT CTGACCAAAC TGGAACAAAA AGTAGAGATC
CTTGATCCCG CAAAAACGCA TTTGGTGGAA GATGAATCCG CCAATTATCA GGCGGCACAG
TTGTATATTG AAAGTCAGTT ACGCCAACGC GTACCGACAG ACAGTAAAGT CCATTTTGGA
CATCTGGCGG CGATGGATTC CATGCCGTTT CAGCTCGATC CTACGCGGAT GGCGCTGGCA
CAACCGCGTC AGCGGATACT CATTGCCGAC GCTGTCGGTC TGGGTAAAAC GCTGGAAGCG
GGTATTCTGG TTTCGGAACT GATCCGTCGT GGGCGCGGAA AACGTATTCT GGTTCTGGCG
GTTAAATCGA TGCTGACACA ATTCCAGAAG GAGTTCTGGA GCCGTTTTGC GATCCCATTG
ACCCGCCTGG ATTCTGCTGG TTTACAAAAA GTGCGTAACC GCATCCCTAC CAACCACAAC
CCGTTCCACT ATTTTGACAA GACCATTATC TCCATCGATA CGCTGAAACA AGATATTGAA
TATCGTCACC ATCTGGAAAA TGCCTGGTGG GATATTATTG TCATTGATGA GGCCCATAAC
GTCGCAGAAC GTGGAACCAG TTCGTTACGC AGTAAACTGG CAAAACTGCT CGCCGGGCGT
AGCGATACGC TGATTATGCT TTCAGCGACG CCGCATGATG GTAAGGCTGA AAGTTTTGCC
AGCCTGATGA ACATGCTTGA CCCGACGGCG ATCGCCAACC CGAAAGAGTA TGAATACGCC
GACTTTGCCG ATAAAAATCT GGTTGTCCGT CGCTTCAAAA AAGACGTGAA AGATCAGATG
TCCGGCGAGT TCCCCGAGCG CAATATTGTC AAACTGACTC GCCTGGCATC TGGTGCCGAA
GAAGAAGCAT ACCGTCGCCT GGTTGAGAGC CAGTTCCGTG ATGATGACGA TGAACAGGCC
CAGTCTAACA AAGGTCGTCT GTTCAAGATA ACCCTCGAAA AAGCGCTATT TTCCAGCCCG
ATGGCGTGTG CCAGCGTAGT GGCGAATCGT CTGAAGCGTC TTGAGAGCCG TAAAGATCAT
AATAGCCAGA GCCAGATCAA CGAACTGGAG TCATTGCTGT TGGCGCTGAA TAACATTGAT
GCCAGCCAGT TTAGCAAATA CCAGTTGCTG CTCGACACCA TCCGAAAAGA CCTCGCCTGG
AAAGCCAATA ACACGGAAGA TCGCCTGGTG ATCTTCACCG AGAGCATTAA AACGCTGGAG
TTCCTCGAAC AGCAACTGCG AGCAGATCTG AAATTGAAAG ATGACCAGAT CGCCACGCTG
CGCGGCGATC AGGGCGATAC GGTGTTGATG AAAACCGTAG AAGACTTTGG TAAAACGCAG
TCGCCGTTGC GTCTTTTGGT TTGTTCGGAT GTGGCATCTG AAGGGATCAA CTTGCATCAC
CTTAGCCACA AAATGATTCA CTTTGATATT CCGTGGTCGT TGATGGTATT CCAGCAACGT
AACGGGCGTA TTGACCGCTA TGGGCAAAAA CATCAGCCTC AAATCCGCTA TCTGCTTACC
GAGGCCAGCG AACCACAGAT CAATGGCGAT ATGCGCGTAC TGGAAGTGTT GATCAACAAA
GACGAACAGG CGCAGAAGAA CATTGGGGAC AGTTCGGAGT TTACCGGCAA ATTTACCCAG
GAAGAGGAAG AAGAGCAGGT TGCAGAGTTC ATGATGCAGG ATGATGGTGC CAGCCTGTTT
GATCAACTGC TGAACAGCAA CGTCTCGGAA AGCGCCGAAC ACGATCTGTT TGGCGAAATA
TGCAGTGCGG TTTCCAGCGA TGCCTCAATG GTCATCGAAA CAGACACCAG CTTGTTTGCC
AGCGAACAGG CGTACTGTGA AAGAGCATTG GGTTACCTGA AAGCCAGTGG TCAAACTATC
CAGTATGAAA CCTTGCCTGA TAATACCTTG TCGCTGGTGG CACCGGAGGA GTTACGCCGC
CGCTTCAACC AGCTACCGCC TGAAATCGCC CCGGAGAACT GGCAGCTCTT TTTAAGTCAG
GATAAAACCG TCATCACCGA CGCGATTGCA CGCGCTCGCG GTGAGCAACA TGCCTGGCCC
GATGTGCAGT ATCTCTGGCA AATCAACCCG GTAGTGCAGT GGCTGGACGA TAAAATCTCT
TCGGCTTTTG GTCGTCATCA GGCACCGGTT ATCCGTCTGC CATACCTGCT TGAACCCGAT
GAAGATCACT TCATTCTTTC CGGGTTATTC CCGAACCGTA AATCACATCC GATGGTGAAC
CCGTGGATAG TGGTGAGCTT TAACCGTGAA TCGCTAATCG GCAGTCAGCC ATTCGCTGAG
TTTTTACAAC GTCATCCGCA GTTGAGTAAC AAGCTGACTA ACAGCGGCGG TAAAGATCGT
AATCACCAGC GCCAGCAGGA TTTACTGGAA GCGGCTATTG CGCATGCCCG GGAGGTATTC
ATCCATGATC GGAATGCGTT TGAAACGCAC ATTAACCAGC AACTGAATGA GCATCTGCAA
AAGCTGGACG TTTTGCGTGG GCGGCAGTTG AGCCAACTTG AGCTGGATTT TGCCGATAAC
AAACAGCAAT TGTCAGTTAA GCAGAGCCGT AAAGAGCAGA GACAACGCGA AATCGAACAC
AATTTTGACA GCTATATCGA ATGGATTGAG GACACCATGA CGACTGAAAA AGAACCCTAC
ATTCAGGTAA TTGCTGTTAT CACCGGAGCG GAGGGTTAA
 
Protein sequence
MNKQNYAPGM RVVIRDAEWR IRRADDSGDG GYLLTCDGIS ELVRGKEGLF LTKLEQKVEI 
LDPAKTHLVE DESANYQAAQ LYIESQLRQR VPTDSKVHFG HLAAMDSMPF QLDPTRMALA
QPRQRILIAD AVGLGKTLEA GILVSELIRR GRGKRILVLA VKSMLTQFQK EFWSRFAIPL
TRLDSAGLQK VRNRIPTNHN PFHYFDKTII SIDTLKQDIE YRHHLENAWW DIIVIDEAHN
VAERGTSSLR SKLAKLLAGR SDTLIMLSAT PHDGKAESFA SLMNMLDPTA IANPKEYEYA
DFADKNLVVR RFKKDVKDQM SGEFPERNIV KLTRLASGAE EEAYRRLVES QFRDDDDEQA
QSNKGRLFKI TLEKALFSSP MACASVVANR LKRLESRKDH NSQSQINELE SLLLALNNID
ASQFSKYQLL LDTIRKDLAW KANNTEDRLV IFTESIKTLE FLEQQLRADL KLKDDQIATL
RGDQGDTVLM KTVEDFGKTQ SPLRLLVCSD VASEGINLHH LSHKMIHFDI PWSLMVFQQR
NGRIDRYGQK HQPQIRYLLT EASEPQINGD MRVLEVLINK DEQAQKNIGD SSEFTGKFTQ
EEEEEQVAEF MMQDDGASLF DQLLNSNVSE SAEHDLFGEI CSAVSSDASM VIETDTSLFA
SEQAYCERAL GYLKASGQTI QYETLPDNTL SLVAPEELRR RFNQLPPEIA PENWQLFLSQ
DKTVITDAIA RARGEQHAWP DVQYLWQINP VVQWLDDKIS SAFGRHQAPV IRLPYLLEPD
EDHFILSGLF PNRKSHPMVN PWIVVSFNRE SLIGSQPFAE FLQRHPQLSN KLTNSGGKDR
NHQRQQDLLE AAIAHAREVF IHDRNAFETH INQQLNEHLQ KLDVLRGRQL SQLELDFADN
KQQLSVKQSR KEQRQREIEH NFDSYIEWIE DTMTTEKEPY IQVIAVITGA EG