Gene EcE24377A_4903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4903 
Symbol 
ID5586601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4887680 
End bp4890571 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content44% 
IMG OID640928504 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001465831 
Protein GI157158933 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.891445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGTA CAACACCTTA TCATGCGAAA TATTTTGCGC ATGAACTTTC CATTCTCCAT 
TCAAACAATG GGGTGGACCG TCTGTCTCAA TCGCTGTTTG ATGCAAGCGT AGATCTAAAT
CCTCATCAAA TTGAGGCGGC TCTTTTTGCT ATTGAAAATC CGCTAAGCAA AGGTGTTGTG
CTTGCCGATG AAGTTGGTTT AGGTAAAACC ATTGAAGCGG GGTTAGTGCT TTGCCAGTTG
TGGGCTGAAA GAAAACGTAA ACTGTTGATC ATTTGTCCTG CTTCACTGCG TCGCCAATGG
GCGAGCGAGT TACAGGAAAA GTTCAATCTA CCTTCTCAGG TGTTGGACGC GAAAACGTAT
AGCCAGCTTC AAAAAGAAGG GATCCATAAT CCCTTAAATA ATAAATCGAT CACTATCATG
TCATACCACT ATGCTGCTCG GTTAGAAGAA AAATTGGTGG CTGAAATGTG GGATCTGGTT
GTTATCGACG AAGCCCATAA ACTGCGTAAT GCTCATCGAG AAAGTAACAA AATGGGGCAA
GCGTTAAAGC GTGCGCTTGA TGGGCGGAAA AAATTACTAC TGACCGCAAC GCCTTTACAG
AATTCGTTAA TGGAACTGTA TGGCATGTCG ACGCTTATAG ATGAACATAC CTTCGGCGAA
GTTAAAGCGT TCAGAAAGCA ATATATGCAG GCAGATAGCG ATATTGCGGA GTTGAAGGGG
CGATTGTCGC GTTTTATCAA ACGAACTTTA CGCAAGAATG TGCTGGAGTA CATCAAGTAT
ACAGAACGTA AAGCGATCAC TATTCCTTTC TATCCGAGCG AACAGGAGCA GGATCTTTAT
GAACGCGTAC AAAAATTGCT TGAACGTGAG GATAGCTATG CCTTACCAAA ACGACACCGG
CATCTGACGG GGCTTATTCT CAGAAAGCTA CTTTCCTCAA GTACGAAAGC TGTATTGAAT
AACTTGCAGA TCCTTAAGAG TCGTTTAGAA AGGTTAAAGC TGGAAGGTAT CGTTGAAGAT
GATATGAACA TCATCCAACA GATCATAATG GATGACGATC TTGAAGATGA CATCGTGGAA
GATGCCGAAT CAGTGGCTTT AGATACCGAG TGCAAAGTTG TCGATTCAGA CGCGCTTCAG
GCAGAAATCA ATGAGCTGGA ATCACTCATT GTCAAAGCAG AACAGATTGG CACTGATACC
AAATCAAAAG AGTTGCTTTC TGGCCTGGAG CAAGGTTTTG CACAGCTTGC CGAAATGGGG
GCTGCAAAAA AAGTCATCAT CTTCACGGAG TCAATGCGAA CTCAACAGTA TCTTGCACAT
TTTCTGGAAA ACAATGGCTA CCAGGGGAAA GTAGTAACCT TTAGTGGCAC AAACAATACA
CCGCAGGCAA ACAAGATCTA CCAGCAATGG CGCGAGGAAT ATCAGGGATC TTCTCGTATT
ACCGGGTCGG CGCAAATTGA TAAACGCTCG GCTTTGATCG ACCACTTCAA AGATCATGCT
GAAATCATGA TAGCTACAGA AGCTGCAGCT GAAGGGGTGA ACCTGCAATT TTGTTCGCTA
CTGATCAATT ACGATTTACC CTGGAACCCA CAACGCATTG AACAGCGTAT TGGACGTTGT
CACCGATACG GGCAAAAATT TGATGTGGTT GTTATTAACT TCCTCAATCA GCGAAACCAG
GCCGATCAAC GCGTTCTTGA ACTCTTAACT GAAAAGTTCA GCCTGTTTGA TGGCGTGTTT
GGCGCATCGG ATAGTGTGCT TGGCAGCATA GAAAATGGTG TGGATTTTGA AAAAAGGATC
CAGTCAATTT ATGAGTCATG CCGTACCCCC GAGGAGATAG AGCTGGCTTT TGAAAGGTTG
CAGAAAGAGC TGGAAGATGA AATCAATCTC AAGATGCAGA AGACGCAAGA ACAGTTGCTT
GAGCATTTTG ATGAAGATAT TCACGATATC CTTAAAATAA GGCTTGATGA AGCTCGGGAG
CGTCTGGATA AAGTCGGTCG TTGGTTTTGG GCCGTGACCT GTCAGCAGCT CAAACAATAT
GCTGATTTTG ATCATGAGCA TCATCGTTTC ACTTTACGTG AATCAGTAAA TGAACTGCCT
GTCGGGCAGT ACCAGCTGTT GCGCAAAGAT GGTAGGCAAA ATAGTGCGTT ACAAATGCAG
GATCACCACA GTTTTGCATA TCGGATCAAT CATCCGCTGG GTGAGTACGT CATTGAGGGG
GCTAAGGTCC AGGAGACGAA AACAGCTGAA GTCATCTTCG ATTACAGTAA CCACACAACG
AAAGTCTCGG TGGCTGAGCA GCTGAAAGGG CAATCAGGGT GGTTAACACT AAATTTGCTT
ACTGTTGATG CGTTTAACAG GCAGCAATTT TTAGTCTTTA CCGCAAAAAC TGACAACGGA
CGTGTGGTAG ATGGTGAAGC TTGTAAACGT CTGTTCAACT TGTCTGCAAT AACCAACGAA
AAGCTGGATA ATTCGGCAAG TTCAGATGCA TTACCTGATG ACTTGATCTT ACTCAAGAAC
AGGCAAATTG ACGCCAGACT TGCGGAAGTT CTGGAGCAGA ACAATGTCCT TTTTGAAGCT
GAAAGAGATA AGCTGGAAAA ATGGGCTGAA GATATGATCT TTGCGGCGGA AGAAGCATTA
CGCGATACCA AAATGCAAAT CAAGTCGTTA AAGCGAGAAG CCAGACTGGC GCAATCGATG
GAAGAACAAA AGCAAAACCA GGAAAAGCTG AAGCTACTGG AACGGCAGCA GAAACGCCAA
CGTATGGAAA TTTTTGACAT TGAAGATGAG ATCGCTGACA AGCGGGATGA ACTCATCAGT
GCCCTTGAAG AGCGGATGAA GCAAAAAACG GATATCACCG AGCTTTTTTC CATCCGCTGG
AAAGTTATTT AA
 
Protein sequence
MIGTTPYHAK YFAHELSILH SNNGVDRLSQ SLFDASVDLN PHQIEAALFA IENPLSKGVV 
LADEVGLGKT IEAGLVLCQL WAERKRKLLI ICPASLRRQW ASELQEKFNL PSQVLDAKTY
SQLQKEGIHN PLNNKSITIM SYHYAARLEE KLVAEMWDLV VIDEAHKLRN AHRESNKMGQ
ALKRALDGRK KLLLTATPLQ NSLMELYGMS TLIDEHTFGE VKAFRKQYMQ ADSDIAELKG
RLSRFIKRTL RKNVLEYIKY TERKAITIPF YPSEQEQDLY ERVQKLLERE DSYALPKRHR
HLTGLILRKL LSSSTKAVLN NLQILKSRLE RLKLEGIVED DMNIIQQIIM DDDLEDDIVE
DAESVALDTE CKVVDSDALQ AEINELESLI VKAEQIGTDT KSKELLSGLE QGFAQLAEMG
AAKKVIIFTE SMRTQQYLAH FLENNGYQGK VVTFSGTNNT PQANKIYQQW REEYQGSSRI
TGSAQIDKRS ALIDHFKDHA EIMIATEAAA EGVNLQFCSL LINYDLPWNP QRIEQRIGRC
HRYGQKFDVV VINFLNQRNQ ADQRVLELLT EKFSLFDGVF GASDSVLGSI ENGVDFEKRI
QSIYESCRTP EEIELAFERL QKELEDEINL KMQKTQEQLL EHFDEDIHDI LKIRLDEARE
RLDKVGRWFW AVTCQQLKQY ADFDHEHHRF TLRESVNELP VGQYQLLRKD GRQNSALQMQ
DHHSFAYRIN HPLGEYVIEG AKVQETKTAE VIFDYSNHTT KVSVAEQLKG QSGWLTLNLL
TVDAFNRQQF LVFTAKTDNG RVVDGEACKR LFNLSAITNE KLDNSASSDA LPDDLILLKN
RQIDARLAEV LEQNNVLFEA ERDKLEKWAE DMIFAAEEAL RDTKMQIKSL KREARLAQSM
EEQKQNQEKL KLLERQQKRQ RMEIFDIEDE IADKRDELIS ALEERMKQKT DITELFSIRW
KVI