Gene EcE24377A_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0287 
Symbol 
ID5589822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp309338 
End bp310753 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content41% 
IMG OID640924012 
Producttype I restriction modification DNA specificity domain-containing protein 
Protein accessionYP_001461441 
Protein GI157156744 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTG AGCCAAAGGA ATACTGCCTT ACTGATATAT GTGATGATGT CAGCTACGGT 
TATACTGCCA GCGCAAATGA GCAATGCATT GGGCCGAAAT TCCTAAGGAT TACTGATATC
CAAGGGGGAC TATGCAATTG GAATGCAGTG CCTTATTGTA ATATTGATGC CAAAAACAAA
AGTAAGTACA ACCTTGAAAT TGGTGATATT GTTATTGCCA GAACGGGTAA TAGTACAGGT
GAAAATTATA TTATCCAAGA CGACATTGAC TCAGTTTTTG CATCATACTT GATACGCTAT
CGTATTAATA AATCAATCGC TGACCCTTAT TTCGTATGGT TGAATCTTAG AACTGATAAT
TGGTGGAGTT ATGTAAATGG AGCAAAAACA GGTTCCGCTC AAGCAGGTGC AAATGCAAAA
GTTTTAGGGA GTTACCCTTT ATCTCTGCCT TCACTTACTA GGCAAGTAGG TATTTCAAAA
CTATTTAAAA TTATTAATGG GAAAATTTTT GAAAATACAA AAATCAACCA AACCCTAGAA
CAAATGGCGC AAGCCCTGTT CAAAAGCTGG TTTGTTAATT TTGAACCCGT AAAGGCCAAA
ATGGCAGTGC TGGAAGCGGG CGGTTCGCAG GAAGACGCAA CGCTTGCCGC AATGACCGCC
ATTTCCGGGA AAAATGCGGA TGCGCTAGCG GTTTTTGAGC GTGAACATCC TGAGCAGTAT
GCCGAATTAA AAGCCACGGC AGAGCTGTTT CCGTTGGCGA TGCAGGACAG TGAGTTGGGG
GAAATTCCGG AAGGGTGGAC TCTATCTGAA ATTGGAGCGC AAATCGATAT TGCTGGTGGG
GCAACGCCAT CAACTAAAAC ACCTGATTTT TGGGATAATG GAGATATTCA CTGGACTACA
CCTAAAGATT TATCCAACGT TAAAGACAAA ATTTTACTGC ATACAGAAAG AAAAATAACA
AAAGCAGGAT TGGGCAAAAT TTCCTCTGGT TTACTACCTG TTAATACCGT GTTGATGTCA
TCACGTGCAC CTGTGGGATA CCTGGCAATT GCAAAAGTCC CTGTGGCGAT TAATCAGGGA
TATATTGCCA TGAAGTGTAA TAAAGAGTTA AGCCCTGAAT TCGTGTTACA GTGGTGTTCT
GCAAACATGC CAGAAATTAT ATCTCGCGCA AGTGGTACCA CCTTTGCTGA AATTAGTAAG
AAAAATTTCA ATCCAATTCC TCTTGTGAAG CCGCCACTTG AGCTTGTTAA AAATTATACC
AAGCAAGTAA GTGCAATCTA CTCGTTGATT GAAAATACTA TGCGGGAAAA TAATAGCCTA
ACCGAACTCC GCGACACCCT TCTCCCCAAA CTTCTCTCGG GTGAAATTAC CCTGCCGGAA
GCAGAACAGG CAGTCAGTGA GGTGGAAAAT GTATAA
 
Protein sequence
MSSEPKEYCL TDICDDVSYG YTASANEQCI GPKFLRITDI QGGLCNWNAV PYCNIDAKNK 
SKYNLEIGDI VIARTGNSTG ENYIIQDDID SVFASYLIRY RINKSIADPY FVWLNLRTDN
WWSYVNGAKT GSAQAGANAK VLGSYPLSLP SLTRQVGISK LFKIINGKIF ENTKINQTLE
QMAQALFKSW FVNFEPVKAK MAVLEAGGSQ EDATLAAMTA ISGKNADALA VFEREHPEQY
AELKATAELF PLAMQDSELG EIPEGWTLSE IGAQIDIAGG ATPSTKTPDF WDNGDIHWTT
PKDLSNVKDK ILLHTERKIT KAGLGKISSG LLPVNTVLMS SRAPVGYLAI AKVPVAINQG
YIAMKCNKEL SPEFVLQWCS ANMPEIISRA SGTTFAEISK KNFNPIPLVK PPLELVKNYT
KQVSAIYSLI ENTMRENNSL TELRDTLLPK LLSGEITLPE AEQAVSEVEN V