Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4610 |
Symbol | uvrA |
ID | 5587828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4610133 |
End bp | 4612955 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640928226 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001465558 |
Protein GI | 157157192 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAACAT CAACCTCGTT ATCCCCCGCG ACAAGCTCAT TGTCGTGACC GGGCTTTCGG GTTCTGGCAA ATCCTCGCTC GCTTTCGACA CCTTATATGC CGAAGGGCAG CGCCGTTACG TTGAATCCCT TTCCGCCTAC GCGCGGCAGT TTCTGTCACT GATGGAAAAG CCGGACGTCG ATCATATTGA GGGGCTTTCT CCTGCCATCT CAATTGAGCA GAAATCGACG TCTCATAACC CGCGTTCTAC GGTGGGGACA ATCACCGAAA TCCACGACTA TTTGCGTTTG TTGTTCGCCC GCGTCGGCGA ACCGCGCTGT CCGGACCACG ACGTCCCGCT GGCGGCGCAA ACCGTCAGCC AGATGGTGGA TAACGTGCTG TCGCAGCCGG AAGGCAAGCG TCTGATGCTG CTCGCGCCAA TCATTAAAGA GCGCAAAGGC GAACACACCA AAACGCTGGA GAACCTGGCA AGCCAGGGTT ACATCCGTGC TCGTATTGAT GGCGAAGTCT GCGATCTTTC CGATCCGCCG AAACTGGAAC TGCAAAAGAA ACATACCATT GAAGTGGTGG TTGATCGCTT CAAGGTGCGT GACGATCTTA CCCAACGTCT TGCGGAGTCG TTTGAAACCG CGCTGGAGCT TTCCGGTGGT ACAGCGGTAG TGGCGGATAT GGACGACCCG AAAGCGGAAG AGCTGCTGTT CTCCGCCAAC TTCGCCTGCC CAATTTGCGG CTACAGTATG CGTGAACTGG AGCCGCGACT GTTTTCGTTT AACAACCCGG CAGGTGCCTG CCCGACCTGT GACGGCCTTG GCGTACAGCA ATATTTCGAT CCTGACCGCG TGATCCAAAA CCCCGAGCTG TCACTGGCTG GCGGTGCGAT CCGTGGCTGG GATCGCCGCA ACTTCTATTA CTTCCAGATG CTGAAATCGC TGGCAGATCA CTATAAGTTC GACGTCGAAG CGCCGTGGGG CAGCCTGAGC GCGAACGTGC ATAAAGTGGT GTTGTACGGT TCTGGCAAAG AAAACATTGA ATTCAAATAC ATGAACGATC GTGGCGATAC CTCCATCCGT CGTCATCCGT TCGAAGGCGT GCTGCACAAT ATGGAGCGCC GTTATAAAGA GACAGAATCC AGTGCGGTAC GTGAAGAATT AGCCAAGTTT ATCAGCAATC GCCCATGCGC CAGCTGCGAA GGGACGCGTC TGCGTCGGGA AGCGCGCCAC GTTTATGTCG AGAATACGCC GCTGCCTGCT ATCTCCGACA TGAGCATCGG TCATGCGATG GAATTCTTCA ACAATCTCAA ACTCGCTGGT CAACGGGCGA AGATTGCGGA AAAAATCCTT AAAGAGATCG GCGATCGTTT GAAATTCCTC GTTAACGTCG GCCTGAATTA CCTGACGCTT TCCCGCTCGG CAGAAACGCT TTCCGGCGGT GAAGCCCAGC GTATCCGTCT GGCGAGCCAG ATTGGTGCGG GCCTGGTTGG CGTTATGTAC GTGCTGGACG AGCCATCTAT CGGCCTGCAC CAGCGCGATA ACGAGCGCCT GTTGGGTACG CTTATCCATC TGCGCGATCT CGGTAATACC GTGATTGTGG TGGAGCACGA CGAAGACGCA ATTCGCGCCG CTGACCATGT GATCGACATT GGCCCGGGCG CAGGTGTTCA CGGCGGTGAA GTGGTCGCAG AAGGTCCGCT GGAAGCGATT ATGGCGGTGC CGGAGTCGTT GACCGGGCAG TACATGAGCG GCAAACGCAA GATTGAAGTG CCGAAGAAAC GCGTTCCGGC GAATCCGGAA AAAGTGCTGA AGCTGACAGG CGCACGCGGC AACAACCTGA AGGACGTGAC GCTGACGCTG CCGGTGGGTC TGTTTACCTG CATCACCGGG GTTTCAGGTT CCGGTAAATC GACGCTGATT AACGACACAC TGTTCCCGAT TGCCCAACGC CAGTTGAATG GGGCGACCAT CGCCGAACCA GCACCGTATC GCGATATTCA GGGGCTGGAG CATTTCGATA AAGTGATCGA TATCGACCAA AGCCCAATTG GTCGTACTCC ACGTTCTAAC CCGGCGACCT ATACCGGCGT GTTTACGCCT GTGCGCGAAC TGTTTGCGGG CGTACCGGAA TCCCGTGCGC GTGGTTATAC GCCAGGACGT TTCAGCTTTA ACGTCCGTGG CGGGCGCTGC GAAGCCTGTC AGGGCGACGG TGTGATCAAA GTAGAGATGC ACTTCCTGCC GGATATCTAC GTGCCGTGCG ATCAGTGTAA AGGTAAACGC TATAACCGTG AAACGCTGGA AATTAAGTAC AAAGGCAAAA CCATCCACGA AGTGCTGGAT ATGACCATCG AAGAGGCGCG TGAGTTCTTT GATGCCGTAC CTGCACTGGC GCGTAAGCTG CAAACGTTGA TGGACGTTGG CCTGACGTAC ATTCGCCTGG GGCAGTCCGC AACCACACTT TCTGGTGGTG AAGCCCAGCG CGTGAAGCTG GCGCGTGAGC TGTCAAAACG CGGCACCGGG CAGACGCTGT ATATTCTCGA CGAGCCGACC ACCGGTCTGC ACTTCGCCGA TATTCAGCAA CTGCTCGACG TACTGCATAA ACTGCGCGAT CAGGGCAACA CCATTGTGGT GATTGAGCAC AATCTCGACG TGATCAAAAC CGCTGACTGG ATTGTCGACC TGGGACCAGA AGGCGGCAGT GGCGGCGGCG AAATCCTCGT CTCCGGTACG CCAGAAACCG TCGCGGAGTG CGAAGCTTCG CATACGGCAC GCTTCCTCAA GCCGATGCTG TAA
|
Protein sequence | MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID GEVCDLSDPP KLELQKKHTI EVVVDRFKVR DDLTQRLAES FETALELSGG TAVVADMDDP KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPEL SLAGGAIRGW DRRNFYYFQM LKSLADHYKF DVEAPWGSLS ANVHKVVLYG SGKENIEFKY MNDRGDTSIR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLRREARH VYVENTPLPA ISDMSIGHAM EFFNNLKLAG QRAKIAEKIL KEIGDRLKFL VNVGLNYLTL SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRDLGNT VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV PKKRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE SRARGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHKLRD QGNTIVVIEH NLDVIKTADW IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML
|
| |