Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4572 |
Symbol | uvrA |
ID | 6272340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4269421 |
End bp | 4272243 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728352 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001882750 |
Protein GI | 187730738 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAACAT CAACCTCGTT ATCCCCCGCG ACAAGCTCAT TGTCGTGACC GGGCTTTCGG GTTCTGGCAA ATCCTCGCTC GCTTTCGACA CCTTATATGC CGAAGGGCAG CGCCGTTACG TTGAATCCCT TTCCGCCTAT GCGCGGCAGT TTCTGTCACT GATGGAAAAG CCGGACGTCG ATCATATTGA GGGGCTTTCT CCTGCCATCT CAATTGAGCA GAAATCGACG TCTCATAACC CGCGTTCTAC GGTGGGGACA ATCACCGAAA TCCACGACTA TTTGCGTTTG TTGTTCGCCC GCGTCGGCGA ACCGCGCTGT CCGGACCACG ACGTCCCGCT GGCGGCGCAA ACCGTCAGCC AGATGGTGGA TAACGTGCTG TCGCAGCCGG AAGGCAAGCG TCTGATGCTA CTCGCGCCAA TCATTAAAGA GCGCAAAGGC GAACACACCA AAACGCTGGA GAACCTAGCA AGCCAGGGTT ACATCCGTGC TCGTATTGAT GGCGAAGTCT GCGATCTTTC CGATCCGCCG AAACTGGAAC TGCAAAAGAA ACACACTATT GAAGTGGTGG TTGATCGCTT CAAGGTGCGT GACGATCTTA CCCAACGTCT TGCGGAGTCG TTTGAAACCG CGCTGGAGCT TTCCGGTGGT ACAGCGGTAG TGGCGGATAT GGACGACCCG AAAGCGGAAG AGCTGCTGTT CTCCGCCAAC TTCGCCTGCC CAATTTGCGG CTACAGTATG CGTGAACTGG AGCCACGACT GTTTTCGTTT AACAACCCGG CAGGTGCCTG CCCAACCTGT GACGGCCTTG GCGTACAGCA ATATTTCGAT CCTGACCGCG TGATCCAAAA CCCCGAGCTG TCACTGGCTG GCGGTGCGAT CCGTGGCTGG GATCGCCGCA ACTTCTATTA CTTCCAGATG CTGAAATCGC TGGCAGATCA CTATAAGTTC GACGTCGAAG CGCCGTGGGG CAGCCTGAGC GCGAACGTGC ATAAAGTGGT GTTGTACGGT TCTGGCAAAG AAAACATTGA ATTCAAATAC ATGAACGATC GTGGCGATAC CTCCATCCGT CGTCATCCGT TCGAAGGCGT GCTGCACAAT ATGGAGCGCC GTTATAAAGA GACAGAATCC AGTGCGGTAC GTGAAGAATT AGCCAAGTTT ATCAGCAATC GCCCATGCGC CAGCTGCGAA GGGACGCGTC TGCGTCGGGA AGCGCGCCAC GTTTATGTCG AGAATACGCC GCTGCCTGCT ATCTCCGACA TGAGCATCGG TCATGCGATG GAATTCTTCA ACAATCTCAA ACTAGCTGGT CAGCGGGCGA AGATTGCAGA AAAAATCCTT AAAGAGATCG GCGATCGTCT GAAATTCCTC GTTAACGTCG GCCTGAATTA CCTGACACTT TCCCGCTCGG CAGAAACACT TTCCGGCGGT GAAGCCCAGC GTATCCGTCT GGCGAGCCAG ATTGGTGCGG GCCTGGTTGG CGTTATGTAT GTACTGGACG AGCCGTCTAT CGGCCTGCAC CAGCGCGATA ACGAGCGCCT GTTGGGTACG CTTATCCATC TGCGCGATCT CGGTAATACC GTGATTGTGG TGGAGCATGA CGAAGACGCA ATTCGCGCCG CTGACCATGT GATCGATATC GGTCCTGGTG CGGGTGTTCA CGGCGGTGAA GTGGTCGCAG AAGGTCCGCT GGAAGCGATC ATGGCGGTGC CTGAATCGTT GACCGGGCAG TACATGAGCG GTAAACGCAA GATTGAAGTG CCGAAGAAAC GCGTTCCGGC GAATCCAGAA AAAGTGCTGA AGCTGACAGG TGCACGCGGC AACAACCTGA AGGACGTGAC GCTCACGCTG CCAGTCGGTC TGTTTACCTG CATCACAGGG GTTTCAGGTT CCGGTAAATC GACGCTGATT AACGACACAC TGTTCCCGAT TGCCCAACGC CAGTTGAATG GTGCGACCAT CGCCGAACCG GCACCGTATC GCGATATTCA GGGGCTGGAG CATTTCGATA AAGTGATCGA TATCGACCAA AGCCCAATTG GTCGTACTCC ACGTTCTAAC CCGGCGACCT ATACCGGCGT GTTTACACCT GTGCGCGAAC TGTTTGCGGG CGTACCGGAA TCCCGTGCGC GTGGTTATAC GCCAGGACGT TTCAGCTTTA ACGTCCGTGG CGGGCGCTGC GAAGCCTGTC AGGGCGACGG TGTGATCAAA GTGGAGATGC ACTTCCTGCC GGATATCTAC GTGCCGTGCG ATCAGTGCAA AGGTAAACGC TATAACCGTG AAACACTGGA AATTAAGTAC AAAGGCAAAA CCATCCACGA AGTGCTGGAT ATGACCATCG AAGAGGCGCG TGAGTTCTTT GATGCGGTGC CAGCTCTGGC GCGTAAGCTG CAAACGTTGA TGGACGTTGG TCTGACGTAC ATTCGCCTGG GGCAGTCCGC AACCACACTT TCTGGTGGTG AAGCCCAGCG CGTGAAGCTG GCGCGTGAGC TGTCAAAACG TGGCACCGGG CAGACGCTGT ATATTCTCGA CGAGCCGACC ACCGGTCTGC ACTTCGCCGA TATTCAGCAA CTCCTCGACG TGCTGCATAA ACTGCGCGAT CAGGGCAATA CCATTGTGGT GATTGAGCAC AATCTCGACG TGATCAAAAC CGCTGACTGG ATTGTCGACC TGGGGCCAGA AGGCGGCAGT GGCGGCGGCG AAATCCTCGT CTCCGGTACG CCAGAAACCG TCGCGGAGTG CGAAGCTTCG CATACGGCAC GCTTCCTCAA GCCGATGCTG TAA
|
Protein sequence | MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID GEVCDLSDPP KLELQKKHTI EVVVDRFKVR DDLTQRLAES FETALELSGG TAVVADMDDP KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPEL SLAGGAIRGW DRRNFYYFQM LKSLADHYKF DVEAPWGSLS ANVHKVVLYG SGKENIEFKY MNDRGDTSIR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLRREARH VYVENTPLPA ISDMSIGHAM EFFNNLKLAG QRAKIAEKIL KEIGDRLKFL VNVGLNYLTL SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRDLGNT VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV PKKRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE SRARGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHKLRD QGNTIVVIEH NLDVIKTADW IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML
|
| |