Gene ECH74115_5901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5901 
Symbol 
ID6970887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5553524 
End bp5555212 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content57% 
IMG OID643389516 
Productunknown domain/lipoate-protein ligase A fusion protein 
Protein accessionYP_002273907 
Protein GI209399317 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0095] Lipoate-protein ligase A
[COG3726] Uncharacterized membrane protein affecting hemolysin expression 
TIGRFAM ID[TIGR00545] lipoyltransferase and lipoate-protein ligase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCA CAAAACTGAA ATTCCGGCTG CATCGGGCAG TGATTGTCCT GTTCTGTCTT 
GCCTTGTTAG TGGCACTGAT GCAGGGGGCG TCATGGTTTA GTCAAAACCA CCAGCGACAG
CGTAATCCAC AGCTGGAAGA ACTGGCCCGC ACCCTGGCGC GTCAGGTGAC GCTGAACGTT
GCACCGCTGA TGCGTACCGA CTCACCGGAT GAAAAACGCA TTCAGGCGAT CCTCGATCAG
TTAACGGATG AAAGCCGTAT CCTCGACGCG GGTGTGTATG ACGAACAAGG CGATCTTATC
GCACGTTCTG GCGAAAGCGT CGAAGTGCGC GATCGGCTGG CGCTCGACGG TAAAAAAGCA
GGCGGCTATT TTAACCAGCA GATTGTCGAG CCAATTGCCG GGAAAAACGG ACCGCTCGGC
TATCTGCGCC TGACACTCGA CACCCATACG CTCGCCACCG AAGCCCAACA GGTGGATAAC
ACCACTAACA TTTTACGCCT GATGTTGCTG CTCTCACTGG CAATCGGCGT AGTGCTGACC
CGCACGCTGC TACAGGGTAA ACGCACCCGC TGGCAGCAAT CGCCCTTCCT GTTAACCGCC
AGCAAACCGG TGCCGGAAGA GGAAGAAAGC GAGAAAAAAG AGTGCCCCAT TACTACAAGA
AAGGAAATCG TTATGTCCAC ATTACGCCTG CTCATCTCTG ACTCTTACGA CCCGTGGTTT
AACCTGGCGG TGGAAGAGTG TATTTTTCGC CAAATGCCCG CCACGCAGCG CGTTCTGTTT
CTCTGGCGCA ATGCCGACAC GGTAGTAATT GGTCGCGCGC AGAACCCGTG GAAAGAGTGT
AATACCCGGC GGATGGAAGA AGATAACGTC CGCCTGGCGC GGCGCAGTAG CGGTGGCGGC
GCGGTGTTCC ACGATCTCGG CAATACCTGC TTTACCTTTA TGGCTGGCAA GCCGGAGTAC
GATAAAACCA TCTCCACGTC GATTGTGCTC AATGCACTGA ATGCGCTCGG CGTCAGTGCC
GAAGCGTCCG GACGTAACGA TCTGGTGGTG AAAACCGCCG AAGGCGACCG CAAAGTCTCA
GGATCGGCCT ATCGCGAAAC CAAAGATCGT GGCTTCCACC ACGGCACCTT GCTGCTCAAT
GCCGACCTTA GCCGCCTGGC AAACTATCTC AATCCGGATA AAAAGAAACT GGCGGCGAAA
GGCATTACTT CAGTGCGTTC CCGCGTGACC AACCTCACCG AGCTGCTGCC GGGGATCCCC
CATGAGCAGG TTTGCGAGGC CATAACCGAG GCCTTTTTCG CCCATTATGG CGAGCGCGTG
GAAGCGGAAA TCATCTCCCC GGACAAAACG CCAGACTTGC CAAACTTCGC CGAAACCTTT
GCCCGCCAGA GTAGCTGGGA GTGGAACTTC GGTCAGGCTC CGGCATTCTC GCATCTGCTG
GATGAACGCT TTAGCTGGGG CGGCGTGGAA CTGCATTTCG ACGTTGAAAA AGGCCATATC
ACCCGCGCCC AGGTGTTTAC CGACAGCCTC AACCCCGCGC CGCTGGAAGC CCTCGCCGGG
CGACTGCAAG GCTGCCTGTA CCGCGCGGAT ATGCTGCAAC AAGAGTGCGA AGCGCTGTTG
GTTGACTTCC CGGACCAGGA AAAAGAGCTA CGGGAGTTGT CGACGTGGAT AGCGGGGGCG
GTAAGGTAA
 
Protein sequence
MARTKLKFRL HRAVIVLFCL ALLVALMQGA SWFSQNHQRQ RNPQLEELAR TLARQVTLNV 
APLMRTDSPD EKRIQAILDQ LTDESRILDA GVYDEQGDLI ARSGESVEVR DRLALDGKKA
GGYFNQQIVE PIAGKNGPLG YLRLTLDTHT LATEAQQVDN TTNILRLMLL LSLAIGVVLT
RTLLQGKRTR WQQSPFLLTA SKPVPEEEES EKKECPITTR KEIVMSTLRL LISDSYDPWF
NLAVEECIFR QMPATQRVLF LWRNADTVVI GRAQNPWKEC NTRRMEEDNV RLARRSSGGG
AVFHDLGNTC FTFMAGKPEY DKTISTSIVL NALNALGVSA EASGRNDLVV KTAEGDRKVS
GSAYRETKDR GFHHGTLLLN ADLSRLANYL NPDKKKLAAK GITSVRSRVT NLTELLPGIP
HEQVCEAITE AFFAHYGERV EAEIISPDKT PDLPNFAETF ARQSSWEWNF GQAPAFSHLL
DERFSWGGVE LHFDVEKGHI TRAQVFTDSL NPAPLEALAG RLQGCLYRAD MLQQECEALL
VDFPDQEKEL RELSTWIAGA VR