Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5561 |
Symbol | uvrA |
ID | 6970559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5199396 |
End bp | 5202218 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389201 |
Product | excinuclease ABC subunit A |
Protein accession | YP_002273598 |
Protein GI | 209398765 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.543966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAACAT CAACCTCGTT ATCCCCCGCG ACAAGCTCAT TGTCGTGACC GGGCTTTCGG GTTCTGGCAA ATCCTCGCTC GCTTTCGACA CCTTATATGC CGAAGGGCAG CGCCGTTACG TTGAATCCCT TTCCGCCTAC GCGCGGCAGT TTCTGTCACT GATGGAAAAG CCGGACGTCG ATCATATTGA GGGGCTTTCT CCTGCCATCT CAATTGAGCA GAAATCGACG TCTCATAACC CGCGTTCTAC GGTGGGGACA ATCACCGAAA TCCACGACTA TTTGCGTTTG TTGTACGCCC GCGTCGGCGA ACCGCGCTGT CCGGACCACG ACGTCCCGCT GGCGGCGCAA ACCGTCAGCC AGATGGTGGA TAACGTGCTG TCGCAGCCGG AAGGCAAGCG TCTGATGCTG CTCGCGCCAA TCATTAAAGA GCGCAAAGGC GAACACACCA AAACGCTGGA GAACCTGGCA AGCCAGGGTT ACATCCGTGC TCGTATTGAT GGCGAAGTCT GCGATCTTTC CGATCCGCCG AAACTGGAAC TGCAAAAGAA ACACACTATT GAAGTGGTGG TTGATCGCTT CAAGGTGCGT GACGATCTTA CCCAACGTCT TGCCGAGTCG TTTGAAACCG CGCTGGAGCT TTCCGGTGGT ACCGCGGTAG TGGCGGATAT GGACGACCCG AAAGCGGAAG AGCTGCTGTT TTCCGCCAAC TTCGCCTGCC CAATTTGCGG CTACAGTATG CGTGAACTGG AGCCGCGACT GTTTTCGTTT AACAACCCGG CAGGTGCCTG CCCGACCTGC GACGGCCTTG GCGTACAGCA ATATTTCGAT CCTGACCGCG TGATCCAGAA TCCGGAACTG TCGCTGGCAG GTGGTGCGAT CCGTGGCTGG GATCGCCGCA ACTTCTATTA TTTCCAGATG CTGAAATCGC TGGCAGATCA CTATAAGTTC GACGTCGAAG CGCCGTGGGG CAGCCTGAGC GCGAACGTGC ATAAAGTGGT GTTGTACGGT TCTGGCAAAG AAAACATTGA ATTCAAATAC ATGAACGATC GTGGCGATAC CTCCATCCGT CGTCATCCGT TCGAAGGCGT GCTGCACAAT ATGGAGCGCC GTTATAAAGA GACAGAATCC AGTGCGGTAC GTGAAGAATT AGCCAAGTTC ATCAGCAATC GCCCGTGCGC CAGCTGCGAA GGGACGCGTC TGCGTCGGGA AGCGCGCCAC GTGTATGTCG AGAATACGCC GCTGCCCGCC ATCTCCGACA TGAGCATTGG TCATGCGATG GAATTCTTCA ACAATCTCAA ACTCGCAGGT CAGCGGGCGA AGATTGCGGA AAAAATTCTT AAAGAGATCG GCGATCGTTT GAAATTCCTC GTTAACGTCG GCCTGAATTA CCTGACGCTT TCCCGCTCGG CAGAAACGCT TTCCGGCGGT GAAGCCCAGC GTATCCGTCT GGCGAGCCAG ATTGGTGCGG GCCTGGTTGG CGTTATGTAC GTGCTGGATG AGCCGTCTAT CGGCCTGCAC CAGCGCGATA ACGAGCGCCT GTTGGGTACG CTTATCCATC TGCGCGATCT CGGTAATACC GTGATTGTGG TGGAGCACGA CGAAGACGCG ATTCGCGCCG CTGATCATGT GATCGATATC GGTCCTGGTG CAGGTGTACA TGGCGGTGAA GTGGTCGCGG AAGGTCCGCT GGAAGCGATT ATGGCGGTGC CGGAGTCGTT GACCGGACAG TACATGAGCG GTAAACGCAA GATTGAAGTG CCGAAGAAAC GCGTTCCGGC GAATCCGGAA AAAGTACTTA AGCTGACTGG CGCACGCGGC AACAACCTGA AAGACGTGAC GCTAACGCTG CCAGTCGGTC TGTTTACCTG CATCACAGGG GTTTCAGGTT CCGGTAAATC GACGCTGATT AACGACACAC TGTTCCCGAT TGCCCAACGC CAGTTGAATG GTGCGACCAT CGCCGAACCG GCACCGTATC GCGATATTCA GGGACTGGAG CATTTCGACA AAGTAATCGA TATCGACCAA AGCCCAATTG GTCGTACTCC GCGTTCTAAC CCGGCGACCT ATACCGGCGT GTTTACGCCT GTGCGCGAAC TGTTTGCGGG CGTACCGGAA TCCCGTGCGC GTGGTTATAC GCCAGGACGT TTCAGCTTTA ACGTCCGTGG CGGACGCTGC GAGGCCTGTC AGGGCGATGG CGTGATCAAA GTGGAGATGC ACTTCCTGCC GGACATTTAC GTGCCGTGCG ACCAGTGCAA AGGTAAACGC TATAACCGTG AAACGCTGGA GATTAAGTAC AAAGGCAAAA CCATCCACGA AGTGCTGGAT ATGACCATCG AAGAGGCGCG TGAGTTCTTT GATGCGGTGC CAGCTCTGGC GCGTAAGCTG CAAACGTTGA TGGACGTTGG TCTGACGTAC ATTCGCTTGG GGCAGTCCGC AACCACACTT TCTGGTGGTG AAGCCCAGCG CGTGAAGCTG GCGCGTGAGC TGTCAAAACG CGGCACCGGG CAGACGCTGT ATATTCTCGA CGAGCCGACC ACCGGTCTGC ACTTCGCCGA TATTCAGCAA CTGCTCGACG TACTGCATAA ACTGCGCGAT CAGGGCAACA CCATTGTGGT GATTGAGCAC AATCTCGACG TGATTAAAAC CGCTGACTGG ATTGTCGACC TGGGGCCGGA AGGCGGCAGT GGCGGCGGCG AGATCCTCGT CTCCGGTACG CCAGAAACCG TCGCGGAGTG CGAAGCTTCG CATACGGCAC GCTTCCTCAA GCCGATGCTG TAA
|
Protein sequence | MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LYARVGEPRC PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID GEVCDLSDPP KLELQKKHTI EVVVDRFKVR DDLTQRLAES FETALELSGG TAVVADMDDP KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPEL SLAGGAIRGW DRRNFYYFQM LKSLADHYKF DVEAPWGSLS ANVHKVVLYG SGKENIEFKY MNDRGDTSIR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLRREARH VYVENTPLPA ISDMSIGHAM EFFNNLKLAG QRAKIAEKIL KEIGDRLKFL VNVGLNYLTL SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRDLGNT VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV PKKRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE SRARGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHKLRD QGNTIVVIEH NLDVIKTADW IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML
|
| |