Gene ECH74115_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4021 
Symbol 
ID6967326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3717172 
End bp3718443 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content55% 
IMG OID643387788 
Productpyridine nucleotide-disulphide oxidoreductase family protein 
Protein accessionYP_002272231 
Protein GI209397050 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.953433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACG ACTGCGACAT TATTATTATT GGTGCCGGTA TTGCAGGCAC CGCTTGCGCG 
TTACGCTGCG CGCGAGCGGG TTTATCCGTT TTGTTACTGG AACGCGCTGA AATCCCCGGC
AGCAAAAATC TTTCCGGCGG GCGGTTATAT ACCCATGCAC TCGCGGAACT CCTCCCTCAA
TTTCATCTGA CCGCGCCTCT TGAACGACGC ATCACTCACG AAAGCCTTTC CCTGTTAACG
CCGGATTGCG CAACGACGTT TTCCAGCTTA CAGCCCGGCG GTGAATCCTG GAGTGTATTA
CGTGCACGGT TCGATCCGTG GCTGGTTGCC GAAGCCGAAA AAGAAGGTGT CGAATGCATC
CCTGGTGCGA CGGTGGATGC ACTGTATGAA GAAAACGGCA GGGTGTGTGG TGTCATTTGT
GGTGACGATA TTCTCCGCGC CCGTTATGTG GTGCTGGCAG AAGGTGCCAA CAGCGTCCTG
GCTGAACGTC ACGGGTTAGT GACTCGTCCT GCTGGCGAAG CGATGGCGTT GGGGATCAAA
GAAGTGCTGT CGCTGGAAAC ATCCGCTATT GAAGAACGTT TTCATCTGGA GAATAACGAA
GGCGCAGCGT TGCTGTTCAG CGGCGGAATC TGTGATGACT TACCCGGCGG CGCATTTCTT
TATACTAATC AACAAACGCT CTCGTTAGGG ATTGTTTGCC CACTCTCTTC CCTTACGCAA
AGTCGTGTTC CGGCAAGCGA GCTGCTTGCT CGCTTTAAAA CGCATCCGGC AGTGCGCCCG
CTTATCAAAA ACACGGAATC ACTGGAGTAT GGTGCGCATC TGGTGCCAGA AGGTGGCTTG
CACAGTATGC CGGTACAATA CGCCGGTAAC GGCTGGCTGC TGGTGGGCGA TGCGTTGCGC
AGTTGCGTCA ATACCGGAAT TTCTGTGCGC GGCATGGATA CGGCGCTAAC TGGCGCGCAG
GCGGCGGCAC AAACACTGAT AAGCGCCTGC CAGCACCGCG AGCCGCAAAA TCTGTTTCCG
CTTTATCATC ACAACGTAGA GCGCAGCCTG CTGTGGGATG TTCTACAGCG TTATCAGCAT
GTTCCGGTGC TTTTGCAACG CCCGGGATGG TACCGTACGT GGCCTGCGTT AATGCAGGAT
ATTTCCCGCG ATTTATGGGA TCAGGGTGAT AAACCTGTTC CACCGCTGCG CCAGTTACTC
TGGCGTCATT TACGTCGTCA TGGCTTGTGG AATCTGGCGG GCGATGTTAT CAGGAGTGTT
CGATGTCTGT AG
 
Protein sequence
MEDDCDIIII GAGIAGTACA LRCARAGLSV LLLERAEIPG SKNLSGGRLY THALAELLPQ 
FHLTAPLERR ITHESLSLLT PDCATTFSSL QPGGESWSVL RARFDPWLVA EAEKEGVECI
PGATVDALYE ENGRVCGVIC GDDILRARYV VLAEGANSVL AERHGLVTRP AGEAMALGIK
EVLSLETSAI EERFHLENNE GAALLFSGGI CDDLPGGAFL YTNQQTLSLG IVCPLSSLTQ
SRVPASELLA RFKTHPAVRP LIKNTESLEY GAHLVPEGGL HSMPVQYAGN GWLLVGDALR
SCVNTGISVR GMDTALTGAQ AAAQTLISAC QHREPQNLFP LYHHNVERSL LWDVLQRYQH
VPVLLQRPGW YRTWPALMQD ISRDLWDQGD KPVPPLRQLL WRHLRRHGLW NLAGDVIRSV
RCL