Gene EcE24377A_2910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2910 
Symbol 
ID5589342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2914257 
End bp2915378 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content37% 
IMG OID640926563 
Productputative type I restriction-modification system, S subunit 
Protein accessionYP_001463945 
Protein GI157159064 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGT GGATAAAAAC TAAACTTGGT GAGATTGTCA TTCTTAATTA TGGTAAAGCA 
TTAAAGGCAC AGGATAGGAA TGCTGGCTCT ATACCTGTAT ATAGTTCAGG TGGATTAACT
GGTTGGCATA ACAAAGCATT AATAAACGAA CAAGGAATTA TTATTGGTCG TAAAGGAACT
GTTGGGAAAG CGTACTTAAC TTATGGACCA TTTTGGTGTA TTGATACTGC TTATTATATT
TTACCTAACC CTTCGAAATA TGATTTTGTT TTTTTATTCT ATTTGTTGAA AACATTAGGG
CTGGAAGAAC TGAACGAAGA TAGCGCTGTT CCTGGACTAA ACAGGGATAC CGCTTATAGT
CAAGAGATCC TATTACCTTC ACTGCCAGAG CAAAAAACCA TCGCCTCTGT GCTGTCCTCT
CTTGATGACA AAATAGACTT GCTTCATCGC CAGAATAAAA CTCTGGAATC CATGGCCGAA
ACCCTGTTTA GGCAGTGGTT TATTTTAGAT AGTACTGGTG TATCAGTTAG TATAGATCAG
ATAATTGACT TTAATCCAAA GCGAACTTTA ATTAAAAGCC AAGACGCTAC TTATTTGGAC
ATGGCTGGAC TGAGCACTGT TATTTTTCGT GCAAATGGAT ATTACAGACG TCCATTTTCT
TCAGGAACTA AATTTACTAA ACGAGATACC TTGCTAGCCA GAATTACCCC ATGTTTAGAA
AATGGGAAAG CGGCCTATAT TGATTTTTTA GACGACAATG AAACTGGATG GGGTTCTACC
GAATTCATAG TTATGCGTCC TAAAAAAGAA ATCCATCCTT TTATCTCATA TATCATGTGC
CGAAACCCGG ACTTCAAAGA ATATGCAGAA AGCTGTATGG AAGGTTCAAC TGGCAGACAA
CGAGTTAACT TGGATCATCT TAAGAAGTTC AATGTTAATC TGCCAACAGA AGCTTCGCTA
CGTATAATTA ATGAGCTATT AGATTCATTT GAAAGTAAAC TTATTAACAA CTCAAAACAA
ATTGATAGCT TAGAAAAACT CCGCGATACC TTACTTCCCA AACTGATGAG CGGTGAAGTA
CGGGTTCAGT ATGCAGAAGA AGCAATCGCA TCAGTAGCAT AA
 
Protein sequence
MSQWIKTKLG EIVILNYGKA LKAQDRNAGS IPVYSSGGLT GWHNKALINE QGIIIGRKGT 
VGKAYLTYGP FWCIDTAYYI LPNPSKYDFV FLFYLLKTLG LEELNEDSAV PGLNRDTAYS
QEILLPSLPE QKTIASVLSS LDDKIDLLHR QNKTLESMAE TLFRQWFILD STGVSVSIDQ
IIDFNPKRTL IKSQDATYLD MAGLSTVIFR ANGYYRRPFS SGTKFTKRDT LLARITPCLE
NGKAAYIDFL DDNETGWGST EFIVMRPKKE IHPFISYIMC RNPDFKEYAE SCMEGSTGRQ
RVNLDHLKKF NVNLPTEASL RIINELLDSF ESKLINNSKQ IDSLEKLRDT LLPKLMSGEV
RVQYAEEAIA SVA