Gene ECH74115_2287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2287 
Symbol 
ID6970575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2158749 
End bp2160131 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content49% 
IMG OID643386166 
Productexonuclease family protein 
Protein accessionYP_002270650 
Protein GI209397980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.118532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000000465845 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAAACGG CAATTCCAGA CAACGAAAAA ACCGAATGCA AAGTGGAAGT CGAACCATCT 
GTAGAACGTG AGGGGCCGTT CTACTTCCTC TTCACCGACA AGGATGGCGA AAAATACGGT
CGCGCAAACA AACTTTCTGG TCTGGATAAG GCGCTGGCTG CCGGGGCTAC TGAAATCACA
AAAGAAGAAT ATTTTGCCCG AAAAAATGGC ACATACACAG GCTTACCGCA AAATGCAAAT
ACCGCACAAA ATTCTGAACA ACCAGAACCG GTAAAAGTTA CCGCTGACGA AGTAAAGAAA
ATTATGCAGG CAGCCAATAT CAGCCAGCCT GACGCCGAGG AACTGCTTGC AGTATCACGT
GGTGAATTTG TTGAAGGGAT TAGCGACCCG AATGATCCGA AATGGGTTAA GGGGATCCAG
ACCCGCGATT CTGTGAACCA GAACCAGCAA GAAACGGAAC AGAACGACCA GAAAGCGGAA
CAAAACAGCC CAAATACGCA ACAAAACGAG CCAGAAACGA AACAACCTGA ACCAGTAGTG
CAACAGGAAC CGGAAAAGAT CTGCACCGCC TGCGGTCAGA GCGGTGGCGG CAACTGCCCT
GATTGTGGTG CGGTGATGGG TGACGCAACA TACCAGGAAA TATTCGATGG AGAGAATCAG
CCTGAAGTTC AGGAAAATGA TCCGGAGGAA ATGGAAGGTA CTGCACATCA GCACAAGGAG
AACACTGGCG GCAATCAGCA TCATGCCAGC GATAGTGAAA CTGGCGAGGC GTCAGATCCC
TTAATTAAGG CGAACGGTCA TCATAATCTC ACATCCACCA GCAGAGCGGG GATTCATCTG
ATGATCGATC TTGAAACCAT GGGAAAAAAT CCCGATGCCC CGATTATCTC AATAGGTGCA
ATATTTTTCG ATCCGCAAAC CGGAGATATG GGACCGGAAT TTAGTAAGAC TATCGATCTG
GATACTGCTG GCGGAGTCAT TGATCGGGAC ACCATGAAAT GGTGGCTTAA ACAATCACGC
GAAGCGCAAT CTGCCATTAT GACCGATGAA ATCCCGTTAG ATGATGCACT GTTACAATTG
CGGGAATTTA TCGACGAAAA CTCCGGTGAA TTTTTTGTTC ATGTCTGGGG AAATGGAGCC
AACTTCGACA ACACGATTTT GCGCCGTTCA TACGAACGGC AGGGGAGCCC CTGCCCGTGG
CGTTACTACA ACGATCGCGA TGTACGCACA ATCGTTGAGC TGGGGAAAGC CATAGACTTC
GATGCCAGAA CGGCTATTCC ATTCGAAGGT GAGCGCCATA ATGCACTTGA TGACGCCCGT
TACCAGGCAA AATACGTTTC AGCTATCTGG CAAAAACTGA TCCCGAGTCA GGCTGATTCT
TAA
 
Protein sequence
METAIPDNEK TECKVEVEPS VEREGPFYFL FTDKDGEKYG RANKLSGLDK ALAAGATEIT 
KEEYFARKNG TYTGLPQNAN TAQNSEQPEP VKVTADEVKK IMQAANISQP DAEELLAVSR
GEFVEGISDP NDPKWVKGIQ TRDSVNQNQQ ETEQNDQKAE QNSPNTQQNE PETKQPEPVV
QQEPEKICTA CGQSGGGNCP DCGAVMGDAT YQEIFDGENQ PEVQENDPEE MEGTAHQHKE
NTGGNQHHAS DSETGEASDP LIKANGHHNL TSTSRAGIHL MIDLETMGKN PDAPIISIGA
IFFDPQTGDM GPEFSKTIDL DTAGGVIDRD TMKWWLKQSR EAQSAIMTDE IPLDDALLQL
REFIDENSGE FFVHVWGNGA NFDNTILRRS YERQGSPCPW RYYNDRDVRT IVELGKAIDF
DARTAIPFEG ERHNALDDAR YQAKYVSAIW QKLIPSQADS