Gene ECH74115_2677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2677 
Symbol 
ID6968950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2518271 
End bp2519335 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content46% 
IMG OID643386539 
Producttransposase InsI for insertion sequence element IS30B/C/D 
Protein accessionYP_002271021 
Protein GI209397386 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.151074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0236339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGGACC TCGGGGGAAA AACGTGTATT TTTAGTCATC CTGTTTACCT CTTTCTCAGG 
GAGTTTAGTC TCCAGGATTC CCGGGGCGGT TCACTGTCTG AGCGCGAGGA GATACGAGCT
GGTTTGTCAG CCAAAATGAG CATTCGTGCG ATAGCTACTG CGCTGAATCG CAGTCCTTCG
ACGATCTCAC GTGAAGTTCA GCGTAATCGG GGCAGACGCT ATTACAAAGC TGTTGATGCT
AATAACCGAG CCAACAGAAT GGCGAAAAGG CCAAAACCGT GCTTACTGGA TCAAAATTTA
CCATTGCGAA AGCTTGTTCT GGAAAAGCTG GAGATGAAAT GGTCTCCAGA GCAAATATCA
GGATGGTTAA GGCGAACAAA ACCACGTCAA AAAACGCTGC GAATATCACC TGAGACAATT
TATAAAACGC TGTACTTTCG TAGCCGTGAA GCGCTACACC ACCTGAATAT ACAGCATCTG
CGACGGTCGC ATAGCCTTCG CCATGGCAGG CGTCATACCC GCAAAGGCGA AAGAGGTACG
ATTAACATAG TGAACGGAAC ACCAATTCAC GAACGTTCCC GAAATATCGA TAACAGACGC
TCTCTAGGGC ATTGGGAGGG CGATTTAGTC TCAGGTACAA AAAACTCTCA TATAGCCACA
CTTGTAGACC GAAAATCACG TTATACGATC ATCCTCAGAC TCAGGGGCAA AGATTCTGTC
TCAGTAAATC AGGCTCTTAC CGACAAATTC CTGAGTTTAC CGTCAGAACT CAGAAAATCA
CTGACATGGG ACAGAGGAAT GGAACTGGCC AGACATCTAG AATTTACTGT CAGCACCGGC
GTTAAAGTTT ACTTCTGCGA TCCTCAGAGT CCTTGGCAGC GGGGAACAAA TGAGAACACA
AATGGGCTAA TTCGGCAGTA CTTTCCTAAA AAGACATGTC TTGCCCAATA TACTCAACAT
GAACTAGATC TGGTTGCTGC TCAGCTAAAC AACAGACCGA GAAAGACACT GAAGTTCAAA
ACACCGAAAG AGATAATTGA AAGGGGTGTT GCATTGACAG ATTGA
 
Protein sequence
MTDLGGKTCI FSHPVYLFLR EFSLQDSRGG SLSEREEIRA GLSAKMSIRA IATALNRSPS 
TISREVQRNR GRRYYKAVDA NNRANRMAKR PKPCLLDQNL PLRKLVLEKL EMKWSPEQIS
GWLRRTKPRQ KTLRISPETI YKTLYFRSRE ALHHLNIQHL RRSHSLRHGR RHTRKGERGT
INIVNGTPIH ERSRNIDNRR SLGHWEGDLV SGTKNSHIAT LVDRKSRYTI ILRLRGKDSV
SVNQALTDKF LSLPSELRKS LTWDRGMELA RHLEFTVSTG VKVYFCDPQS PWQRGTNENT
NGLIRQYFPK KTCLAQYTQH ELDLVAAQLN NRPRKTLKFK TPKEIIERGV ALTD