Gene EcHS_A0330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0330 
Symbol 
ID5590907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp335287 
End bp336495 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content33% 
IMG OID640919516 
ProductDNA methylase family protein 
Protein accessionYP_001457102 
Protein GI157159784 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.00332873 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTA ACTATATGCC ATTCACACAA CGAGATAGTC TCGGAAGATA CTATACAAAA 
GAATCCATAA GTGCATTATT GGTTTCTCAG ATGAAAGCTG AAAAAGTCAA TAATATTATT
GATTTAGCTT CTGGTGAGGG GAGCTTAACC TATGCTGCTT TAGACCGATG GAAGAATGCA
GAAGCATATT CTTTGGATAT AGAATCTCGG ATGTCAAAAA AAGTATGTGA TAATCTTACT
CATATTGTAA CAGATGCCCT TGTTCATTCA TTTCCTGAAA TGCTTGCCCG ACATCAAGGA
AATTTTGATG TTGCAGTATG TAATCCTCCA TTCACTCTCC CTGAATGGAG GGATGATTAT
TTTAAAATCA TTAGTGAGAT TGGCGCAGAT AAATATATAT CTGTCTCAAA ATATGTCCCA
GCAGAGATAA TATTTATATC TCAAGTCATC AGGTTTCTTA AAAAGGGCGG TGAGGCTGGA
ATAATTTTAC CTGATGGTAT ATTTACAGCA AGAAAGTTTA TAGGTTTAAG ACGATATTTA
TTGAATGAAC ATTCAATTAC AAAAGTCATT GAATTGCCTA GGAATATCTT CAAAAGGACA
GAGGCTAAAA CACATATTTT AATTTTTAAT AAAAAAATTA TGCCTCATCA TAAAATACAA
TTACATTGTA TAACTAAAGA TGGGGAATTG TCGCCTCCTG TTTTAATTAG AAAAGAAGAT
GCGGTTGAGA GAATGGACTA CTCTTATCAT TATAATAAAA ATGAAGGTAA AGGGTTTAGC
ACAATAGGGA TGCTTAAAAA TATTTCAATT TTTAGAGGTA GGTTTAATTC AAAGGAAATT
ACGGAACATG TTTTTCATAC GACAAAATTT AGTGGTGATG AAAAGTACAT TAAATTCCAC
TGCAACTCTG TAGAAGAATT GAAGCCATCA AAATTAGATG TCATTGCTAA GCCTGGTGAT
ATATTAATAG CAAGAGTTGG GCGAAATTTT CATAAAAAAA TATTGTTTGT TGAGAGCGGC
TATTCTTATA TCAGTGACTG TATTTTTCTG ATACGAGCCT CTGGTGGAGA TAAGAAAAAA
CTATTTGATT TTCTTTGTTC TCAAGATGGG CAAGAGGAAT TATCTCGAGC GAGTAGTGGT
GTAGCCGCAC AACATATTAC AATGGATGCA TTAAAAAAAA TACATCTTGT AAGGATTAAA
CATGACTGA
 
Protein sequence
MNSNYMPFTQ RDSLGRYYTK ESISALLVSQ MKAEKVNNII DLASGEGSLT YAALDRWKNA 
EAYSLDIESR MSKKVCDNLT HIVTDALVHS FPEMLARHQG NFDVAVCNPP FTLPEWRDDY
FKIISEIGAD KYISVSKYVP AEIIFISQVI RFLKKGGEAG IILPDGIFTA RKFIGLRRYL
LNEHSITKVI ELPRNIFKRT EAKTHILIFN KKIMPHHKIQ LHCITKDGEL SPPVLIRKED
AVERMDYSYH YNKNEGKGFS TIGMLKNISI FRGRFNSKEI TEHVFHTTKF SGDEKYIKFH
CNSVEELKPS KLDVIAKPGD ILIARVGRNF HKKILFVESG YSYISDCIFL IRASGGDKKK
LFDFLCSQDG QEELSRASSG VAAQHITMDA LKKIHLVRIK HD