Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3969 |
Symbol | uvrA |
ID | 6064504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4359981 |
End bp | 4362803 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603382 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001726897 |
Protein GI | 170021943 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0893236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAACAT CAACCTCGTT ATCCCCCGCG ACAAGCTCAT TGTCGTGACC GGGCTTTCGG GTTCTGGCAA ATCCTCGCTC GCTTTCGACA CCTTATATGC CGAAGGGCAG CGCCGTTACG TTGAATCCCT TTCCGCCTAC GCGCGGCAGT TTCTGTCACT GATGGAAAAG CCGGACGTCG ATCATATTGA GGGGCTTTCT CCTGCCATCT CAATTGAGCA GAAATCGACG TCTCATAACC CGCGTTCTAC GGTGGGGACA ATCACCGAAA TCCACGACTA TTTGCGTTTG TTGTTCGCCC GCGTTGGCGA GCCGCGCTGT CCGGACCACG ACGTCCCGCT GGCGGCGCAA ACCGTCAGCC AGATGGTGGA TAACGTGCTG TCGCAGCCGG AAGGCAAGCG TCTGATGCTA CTCGCGCCAA TCATTAAAGA GCGCAAAGGC GAACACACCA AAACGCTGGA GAACCTGGCA AGCCAGGGCT ACATCCGTGC TCGTATTGAT GGCGAAGTCT GCGATCTTTC CGATCCGCCA AAACTGGAAC TGCAAAAGAA ACATACCATT GAAGTGGTGG TTGATCGCTT CAAGGTGCGT GACGATCTTA CCCAACGTCT TGCCGAGTCA TTTGAAACCG CGCTGGAGCT TTCCGGTGGT ACCGCGGTAG TGGCGGATAT GGACGACCCG AAAGCGGAAG AGCTGCTGTT CTCCGCCAAC TTCGCCTGCC CAATTTGCGG CTACAGTATG CGTGAACTGG AGCCGCGACT GTTTTCGTTT AACAACCCGG CGGGGGCCTG CCCGACCTGC GACGGCCTTG GCGTACAGCA ATATTTCGAT CCTGATCGAG TGATCCAGAA TCCGGAACTG TCGCTGGCTG GTGGTGCGAT CCGTGGCTGG GATCGCCGCA ACTTCTATTA TTTCCAGATG CTGAAATCGC TGGCAGATCA CTATAAGTTC GACGTCGAAG CGCCGTGGGG CAGCCTGAGC GCGAACGTGC ATAAAGTGGT GTTGTACGGT TCTGGCAAAG AAAACATTGA ATTCAAATAC ATGAACGATC GTGGCGATAC CTCCATTCGT CGTCATCCGT TCGAAGGCGT GCTGCATAAT ATGGAGCGCC GCTATAAAGA GACGGAATCC AGCGCGGTAC GCGAAGAATT AGCCAAGTTT ATCAGTAATC GTCCGTGCGC CAGCTGCGAA GGGACGCGTC TGCGTCGGGA AGCGCGCCAC GTGTATGTCG AGAATACGCC GCTGCCTGCT ATCTCCGACA TGAGCATTGG TCATGCGATG GAATTCTTCA ACAATCTCAA ACTCGCAGGT CAGCGGGCGA AGATTGCAGA AAAAATCCTT AAAGAGATCG GCGATCGTCT GAAATTCCTC GTTAACGTCG GCCTGAATTA CCTGACGCTT TCCCGCTCGG CAGAAACGCT TTCTGGCGGT GAAGCACAGC GTATCCGTCT GGCGAGCCAG ATTGGTGCGG GCCTGGTTGG CGTTATGTAC GTGCTGGACG AGCCGTCTAT CGGCCTGCAC CAGCGTGATA ACGAGCGCCT GTTGGGTACG CTTATCCATC TGCGCGATCT CGGTAATACC GTGATTGTGG TGGAGCACGA CGAAGACGCA ATTCGCGCCG CTGACCATGT GATCGACATT GGCCCGGGCG CAGGTGTTCA CGGCGGTGAA GTGGTCGCAG AAGGTCCGCT GGAAGCGATT ATGGCGGTGC CGGAGTCGTT GACCGGGCAG TACATGAGCG GCAAACGCAA GATTGAAGTG CCGAAGAAAC GCGTTCCGGC GAATCCGGAA AAAGTGCTGA AGCTGACAGG CGCACGCGGC AACAACCTGA AGGACGTGAC GCTGACGCTG CCGGTGGGTC TGTTTACCTG CATCACCGGG GTTTCAGGTT CCGGTAAATC GACGCTGATT AACGACACGC TGTTCCCTAT TGCCCAACGC CAGTTGAATG GTGCGACCAT CGCCGAACCG GCACCGTATC GCGATATTCA GGGGCTGGAG CATTTCGACA AAGTGATCGA TATCGACCAA AGCCCAATTG GTCGTACTCC GCGTTCTAAC CCGGCGACCT ATACCGGCGT GTTTACACCT GTGCGCGAAC TTTTTGCGGG CGTACCGGAA TCCCGTGCGC GTGGTTATAC GCCAGGACGT TTCAGCTTTA ACGTCCGTGG CGGACGCTGC GAAGCCTGTC AGGGCGACGG TGTGATCAAA GTGGAGATGC ACTTCCTGCC GGACATTTAC GTACCGTGCG ACCAGTGTAA AGGTAAACGC TATAACCGTG AAACGCTGGA AATTAAGTAC AAAGGCAAAA CCATCCACGA AGTGCTGGAT ATGACCATCG AAGAGGCGCG TGAGTTCTTT GATGCCGTAC CTGCACTGGC GCGTAAGCTG CAAACGTTGA TGGACGTTGG CCTGACGTAC ATTCGCCTGG GGCAGTCCGC AACCACCCTT TCTGGTGGTG AAGCCCAGCG CGTGAAGCTG GCGCGTGAGC TGTCAAAACG CGGCACCGGG CAGACACTGT ATATTCTCGA CGAGCCGACC ACCGGTTTGC ACTTCGCCGA TATTCAGCAA CTGCTCGACG TGCTGCATAA ACTGCGCGAT CAGGGCAATA CCATTGTGGT AATTGAGCAC AATCTCGACG TGATTAAAAC CGCTGACTGG ATTGTCGACC TGGGACCGGA AGGCGGCAGT GGCGGCGGCG AGATCCTCGT CTCCGGTACG CCAGAAACCG TCGCGGAGTG CGAAGCTTCG CATACGGCGC GCTTCCTCAA GCCGATGCTG TAA
|
Protein sequence | MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID GEVCDLSDPP KLELQKKHTI EVVVDRFKVR DDLTQRLAES FETALELSGG TAVVADMDDP KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPEL SLAGGAIRGW DRRNFYYFQM LKSLADHYKF DVEAPWGSLS ANVHKVVLYG SGKENIEFKY MNDRGDTSIR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLRREARH VYVENTPLPA ISDMSIGHAM EFFNNLKLAG QRAKIAEKIL KEIGDRLKFL VNVGLNYLTL SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRDLGNT VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV PKKRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE SRARGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHKLRD QGNTIVVIEH NLDVIKTADW IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML
|
| |