Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3107 |
Symbol | uvrA |
ID | 5200350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 3416276 |
End bp | 3419242 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640582655 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001263594 |
Protein GI | 148556012 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.378595 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACTC ATATTTCGGT GCGCGGCGCG CGCGAGCACA ACCTCAAGGG CGTCGATGTC GAGCTGCCCC GCGACCGGCT GATCGTGATC ACCGGCCTGT CCGGATCGGG CAAGTCGAGT CTCGCCTTCG ACACCATCTA TGCCGAGGGG CAACGCCGCT ACGTCGAGTC GCTGTCGGCC TATGCGCGCC AGTTCCTCGA GCTGATGCAG AAGCCCGACG TCGACCATAT CGAGGGCCTC TCCCCCGCCA TCTCGATCGA GCAGAAGACG ACCAGCCGCA ACCCGCGCTC GACCGTCGCG ACGGTGACCG AGATCTACGA CTATATGCGC CTGCTCTGGG CGCGGGTTGG CGTGCCCTAT TCGCCCGCGA CGGGCCTGCC GATCTCGGCG CAGACGGTCA GCCAGATGGT CGACCGCATC CTGCAATTGA AGGAGGGCAC CCGCTTCCTG CTGCTCGCCC CGGTCGTGCG CGGCCGCAAG GGCGAGTATC GCAAGGAGCT GGCCGAATGG CAGAAGGCCG GCTTCACCCG CGTCCGCATC GACGGCGAGA TGCACGAGAT CGAGGCGGCG CCCGCGCTCG ACAAGAAGTA CAAGCACGAC ATCGAGGTGG TGGTCGACCG CCTCGTCGTC CGCGAGGGGA TGGAGACCCG CCTCGCCCAG AGCCTGGAGA CCGCGCTGCG CCTGGCCGAC GGCCTGGCCT ATGCCGACCT GGTCGACACG ACGGTGGCGG AGGCGATGGC ACCGAGCGGC GTCCGCGAGA TGCCGGCCAG CTTCCAATAT GCCGAGACGA TGGAGACGGG CGGCGCGATG AAGGGCGCCG GCGTTCCCGC CAACCGCATC ACCTTCTCGG AGAAATTCGC CTGCCCCGTC TCGGGCTTCA CCATCGCCGA GATCGAGCCG CGGCTGTTCT CGTTCAACGC GCCGCAGGGC GCCTGCCCGG CCTGCGACGG CCTCGGCGAG AAGCTCTATT TCGACCCGCA GCTCGTCGTC CCCAACGAGG CGCTGTCGAT CAAGCAGGGC GCGGTGGTGC CCTGGGCCAA GTCCAACCCG CCCAGCCCCT ATTACATGCA GGTACTGAGC TCGGTCGCGC GCGCCTACGG CTTCAAGCTG GAGACGCCGT GGAACGAGCT GCCCGAGGAG ATCCGCGATA CCGTCCTCTA CGGCACCAAG GGGCGGGTGA TCACCCTCAC CTTCATCGAC GGCCGCAAGA GCTACGACGT GCAGAAGCCG TTCGAGGGGG TGATCCACAA TCTCGAGCGC CGCATGCTGT CGACCGAGAG CGCCTGGATG CGCGAGGAGC TCGCCAAGTA TCAGAGCGCC GCGCATTGCG AGGTGTGCGG CGGCGCGCGC CTCAAGCCCG AGGCGCTGGC GGTCAAGATC GCGGGCGAGG ACATCTCGAT GTCGACCCGC CGCGCGGTCG GCCCCGCGCT CAAATTCTTC CGCGAGATGC CCGACCATCT CAACGAGCAG CAGAAGGCGA TCGCCGAGCG CATCCTCAAG GAGATCGTCG AGCGGCTGGG CTTCCTCGAC AATGTCGGGC TCGACTATCT CAACCTCGAC CGCACCTCGG GCACGCTGTC GGGCGGCGAG AGCCAGCGCA TCCGCCTCGC CAGCCAGATC GGGTCGGGAC TTTCCGGCGT CCTCTACGTG CTCGACGAGC CGTCGATCGG CCTGCACCAG CGCGACAACG ACCGGCTGCT GGTGACGCTG CGGCGGCTGC GCGACCTCGG CAACACGGTG ATCGTCGTCG AGCATGACGA GGATGCGATC CGCACCGCCG ACTATATCCT CGACATGGGG CCGGGCGCCG GCGTCCATGG CGGCGAGGTG GTGGCGTCGG GCACGCTCGA CGACATCCTC GCCAACGAGG CGAGCCTAAC CGGCGACTAT CTGTCGGGCC GCCGCATGGT CGAGGTGCCG GCGAAGCGGC GCAAGGGCAA CGGCAAGAAG GTCACCGTCC ACAATGCGCG GGCGAACAAC CTGAAGAATG TCACCGCGGC GATCCCGCTC GGCACCTTCA CCTGCATCAC CGGCGTCTCG GGATCGGGCA AGTCGAGCTT CACGATCGAC ACGCTCTACG CCTCGGCCGC GCGCCAGCTC AACGGCGCGC GCGTGGTGGC GGGCGCGCAC GACAAGATCA CCGGTCTGCA ATATTGCGAC AAGGTCATCG ACATCGACCA GTCGCCGATC GGCCGCACCC CGCGCTCGAA CCCGGCGACC TATACCGGCG CCTTCACCAA CATCCGCGAC TGGTTCGCGG GGCTCCCGGA GAGCGAGGCG CGCGGCTACA AGCCGGGCCG GTTCAGCTTC AACGTCAAGG GCGGCCGCTG CGAGGCGTGC ACCGGCGACG GCCTGCTCAA GATCGAGATG CACTTCCTGC CCGACGTCTA CGTCACCTGC GACGTCTGCC ACGGCGCGCG CTACAACCGC GAGACGCTGG AGGTGAAGTT CAAGGACAAG AGCATCGCCG ACGTGCTCGA CATGACCGTC GAGGACGCGG TCGAGTTCTT CAAGGCGGTC CCGCCGATCC GCGACAAGAT GGCGATGCTG GCCGAGGTCG GCCTCGGCTA CATCAAGGTC GGCCAGCAGG CGACGACGCT GTCGGGCGGC GAGGCGCAGC GCGTCAAGCT CGCCAAGGAA CTGTCGCGCC GGTCGACCGG CAACACCCTC TACATCCTCG ACGAGCCGAC CACCGGCCTC CACTTCGAGG ACGTCCGCAA GCTGCTCGAA GTGCTCCACG CGCTGGTCGA GCAGGGCAAC ACCGTGGTGG TGATCGAGCA CAACCTCGAG GTCATCAAGA CCGCCGACTG GGTGATGGAC CTCGGCCCCG AGGGCGGCGT GCGCGGCGGC GAGATCGTGG CGGAAGGGAC GCCGGAGCAG GTGGCGAAGG AGCCGCGCAG CTACACGGGC GCCTATCTCA AGCCGCTGCT GGAGGCAGGG GCGAGGAGTG CTCAGGCTGC CGAATAG
|
Protein sequence | MLTHISVRGA REHNLKGVDV ELPRDRLIVI TGLSGSGKSS LAFDTIYAEG QRRYVESLSA YARQFLELMQ KPDVDHIEGL SPAISIEQKT TSRNPRSTVA TVTEIYDYMR LLWARVGVPY SPATGLPISA QTVSQMVDRI LQLKEGTRFL LLAPVVRGRK GEYRKELAEW QKAGFTRVRI DGEMHEIEAA PALDKKYKHD IEVVVDRLVV REGMETRLAQ SLETALRLAD GLAYADLVDT TVAEAMAPSG VREMPASFQY AETMETGGAM KGAGVPANRI TFSEKFACPV SGFTIAEIEP RLFSFNAPQG ACPACDGLGE KLYFDPQLVV PNEALSIKQG AVVPWAKSNP PSPYYMQVLS SVARAYGFKL ETPWNELPEE IRDTVLYGTK GRVITLTFID GRKSYDVQKP FEGVIHNLER RMLSTESAWM REELAKYQSA AHCEVCGGAR LKPEALAVKI AGEDISMSTR RAVGPALKFF REMPDHLNEQ QKAIAERILK EIVERLGFLD NVGLDYLNLD RTSGTLSGGE SQRIRLASQI GSGLSGVLYV LDEPSIGLHQ RDNDRLLVTL RRLRDLGNTV IVVEHDEDAI RTADYILDMG PGAGVHGGEV VASGTLDDIL ANEASLTGDY LSGRRMVEVP AKRRKGNGKK VTVHNARANN LKNVTAAIPL GTFTCITGVS GSGKSSFTID TLYASAARQL NGARVVAGAH DKITGLQYCD KVIDIDQSPI GRTPRSNPAT YTGAFTNIRD WFAGLPESEA RGYKPGRFSF NVKGGRCEAC TGDGLLKIEM HFLPDVYVTC DVCHGARYNR ETLEVKFKDK SIADVLDMTV EDAVEFFKAV PPIRDKMAML AEVGLGYIKV GQQATTLSGG EAQRVKLAKE LSRRSTGNTL YILDEPTTGL HFEDVRKLLE VLHALVEQGN TVVVIEHNLE VIKTADWVMD LGPEGGVRGG EIVAEGTPEQ VAKEPRSYTG AYLKPLLEAG ARSAQAAE
|
| |