Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4520 |
Symbol | uvrA |
ID | 6144389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4618636 |
End bp | 4621458 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619336 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001746448 |
Protein GI | 170684169 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAACAT CAACCTCGTT ATCCCCCGCG ACAAGCTCAT TGTCGTGACC GGGCTTTCGG GTTCTGGCAA ATCCTCGCTC GCTTTCGACA CCTTATATGC CGAAGGGCAG CGCCGTTACG TTGAATCACT TTCCGCCTAC GCGCGGCAGT TTCTGTCACT GATGGAAAAG CCGGACGTCG ATCATATTGA GGGGCTTTCT CCTGCCATCT CAATTGAGCA GAAATCAACG TCTCATAACC CGCGTTCTAC GGTGGGGACA ATCACCGAAA TCCACGACTA TTTGCGTTTG TTGTTCGCCC GCGTCGGCGA GCCGCGCTGT CCGGACCACG ACGTCCCGCT GGCGGCGCAA ACCGTCAGCC AGATGGTGGA TAACGTGCTG TCGCAGCCGG AAGGCAAGCG TCTGATGCTG CTCGCGCCAA TCATTAAAGA GCGCAAAGGC GAACACACCA AAACGCTGGA GAACCTGGCA AGTCAGGGTT ACATCCGTGC TCGTATTGAT GGCGAAGTCT GCGATCTTTC CGATCCGCCG AAACTGGAAC TGCAAAAGAA ACATACCATT GAAGTGGTGG TTGATCGCTT CAAGGTGCGT GACGATCTTA CCCAACGTCT TGCCGAGTCG TTTGAAACCG CGCTGGAGCT TTCCGGTGGT ACCGCGGTAG TTGCGGATAT GGACGACCCG AAAGCGGAAG AGTTGCTGTT CTCCGCTAAT TTCGCCTGCC CAATTTGCGG CTACAGTATG CGTGAACTGG AGCCGCGACT GTTTTCGTTT AACAACCCGG CAGGGGCCTG CCCGACCTGT GACGGCCTTG GCGTGCAGCA ATATTTCGAT CCTGACCGCG TGATCCAGAA TCCGGAACTG TCGCTGGCTG GCGGTGCGAT CCGTGGCTGG GATCGCCGCA ACTTCTATTA TTTCCAGATG CTGAAATCGC TGGCAGATCA CTATAAGTTC GACGTCGAAG CGCCGTGGGG CAGCCTGAGC GCGAACGTGC ATAAAGTGGT GTTGTACGGT TCTGGCAAAG AAAACATTGA ATTCAAATAC ATGAACGATC GTGGCGATAC CTCCATTCGT CGTCATCCGT TTGAAGGCGT GCTGCACAAT ATGGAGCGCC GTTATAAAGA GACGGAGTCC AGCGCGGTAC GCGAAGAATT AGCCAAGTTC ATCAGCAATC GCCCGTGCGC CAGCTGCGAA GGGACGCGTC TGCGTCGGGA AGCGCGCCAC GTGTATGTCG AGAATACGCC GCTGCCTGCT ATCTCCGACA TGAGCATTGG TCATGCGATG GAATTCTTCA ACAATCTCAA ACTCGCAGGT CAGCGGGCGA AGATTGCGGA AAAAATTCTT AAAGAGATCG GCGATCGTTT GAAATTCCTC GTTAACGTCG GCCTGAATTA CCTGACGCTT TCCCGCTCGG CAGAAACGCT TTCTGGCGGT GAAGCCCAGC GTATTCGTCT GGCGAGCCAG ATTGGTGCGG GCCTGGTTGG CGTTATGTAC GTGCTGGATG AGCCGTCTAT CGGCCTGCAC CAGCGCGATA ACGAGCGCCT GTTGGGTACG CTTATCCATC TGCGCGATCT CGGTAATACC GTGATTGTGG TGGAGCACGA CGAAGACGCG ATTCGCGCGG CTGACCATGT GATCGATATC GGTCCTGGTG CGGGTGTTCA CGGCGGTGAA GTGGTCGCGG AAGGTCCACT GGAAGCGATT ATGGCGGTGC CGGAGTCGTT GACCGGACAG TACATGAGCG GTAAACGCAA GATTGAAGTG CCGAAGAAAC GCGTTCCGGC GAATCCGGAA AAAGTGCTGA AGCTGACGGG TGCACGCGGT AATAACCTGA AAGACGTGAC GCTAACGCTG CCAGTCGGTC TGTTTACCTG CATCACAGGG GTTTCAGGTT CCGGTAAATC GACACTGATT AACGACACAC TGTTCCCGAT TGCCCAACGC CAGTTGAATG GTGCGACCAT CGCCGAACCG GCACCGTATC GCGATATTCA GGGACTGGAG CATTTCGATA AAGTGATCGA TATCGACCAA AGCCCAATTG GTCGTACTCC GCGTTCTAAC CCAGCGACCT ATACCGGCGT GTTTACGCCT GTGCGCGAAC TGTTTGCGGG CGTACCGGAA TCCCGTGCGC GCGGCTATAC GCCGGGACGT TTCAGCTTTA ACGTCCGTGG CGGACGCTGC GAAGCCTGTC AGGGCGACGG CGTGATCAAA GTGGAGATGC ACTTCCTGCC GGACATTTAC GTACCGTGCG ACCAGTGTAA AGGTAAACGC TATAACCGTG AAACGCTGGA GATTAAGTAC AAAGGCAAAA CCATCCACGA AGTGCTGGAT ATGACCATCG AAGAGGCGCG TGAGTTCTTT GATGCGGTGC CAGCTCTGGC GCGTAAGCTG CAAACGCTGA TGGACGTTGG TCTGACGTAC ATTCGCCTGG GGCAGTCCGC AACCACCCTT TCAGGCGGTG AAGCCCAGCG CGTGAAGCTG GCGCGTGAGC TGTCAAAACG CGGCACCGGG CAGACACTGT ATATTCTCGA CGAGCCGACC ACCGGTCTGC ACTTCGCCGA TATTCAGCAA CTGCTCGACG TGCTGCATAA ACTGCGCGAT CAGGGCAATA CCATTGTGGT GATTGAACAC AATCTCGACG TGATCAAAAC CGCTGACTGG ATTGTCGACC TGGGACCGGA AGGCGGCAGT GGCGGCGGCG AGATCCTCGT CTCCGGTACG CCAGAAACCG TCGCGGAGTG CGAAGCTTCG CATACGGCAC GCTTCCTCAA GCCGATGCTG TAA
|
Protein sequence | MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID GEVCDLSDPP KLELQKKHTI EVVVDRFKVR DDLTQRLAES FETALELSGG TAVVADMDDP KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPEL SLAGGAIRGW DRRNFYYFQM LKSLADHYKF DVEAPWGSLS ANVHKVVLYG SGKENIEFKY MNDRGDTSIR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLRREARH VYVENTPLPA ISDMSIGHAM EFFNNLKLAG QRAKIAEKIL KEIGDRLKFL VNVGLNYLTL SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRDLGNT VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV PKKRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE SRARGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHKLRD QGNTIVVIEH NLDVIKTADW IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML
|
| |