Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1192 |
Symbol | uvrA |
ID | 3916489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1235856 |
End bp | 1238768 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443928 |
Product | excinuclease ABC subunit A |
Protein accession | YP_496471 |
Protein GI | 87199214 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.548367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCTCA CTACCATTTC CGTCCGCGGC GCTCGTGAAC ACAATCTCAA GGGGATCGAC GTCGATCTCC CGCGCGACAG CCTGATCGTC ATCACCGGCC TTTCCGGCTC CGGCAAGTCG AGCCTGGCGT TCGACACGAT CTACGCCGAA GGGCAGCGCC GCTACGTGGA ATCGCTCTCG GCCTACGCGC GCCAGTTCCT CGAGATGATG CAGAAGCCGG ATGTCGAGCA TATCGACGGC CTCTCGCCCG CCATCTCGAT CGAGCAGAAG ACCACCAGCC GCAACCCGCG CTCCACCGTC GCCACGGTGA CCGAGGTCTA CGACTACATG CGCCTGCTCT GGGCGCGCGT CGGCATTCCC TATTCCCCCG CCACCGGCGA ACCGATCAGC GCGCAGACCG TCAGCCAGAT GGTCGACCGC GTCATGGCCC TGCCCGAAGG CACCCGCGCC TACCTGCTCG CTCCCGTCGT GCGCGGCCGC AAGGGCGAAT ACCGCCGCGA ACTCGCCGAA TGGCAGAAGG CCGGCTACAC CCGCGTCCGC ATCAACGGCG AAATGATGCC CATCGAGGAC GCCCCCGCGC TCGACAAGAA GCTCAAGCAC GACATCGAGG TTGTGGTCGA TCGCATCGCC GTCCGCGACG GCATCCAGAC GCGCCTCGCC GACAGCTTCG AACAGGCCCT CAAGCTCGCC GACGGCCTTG CCTACATCGA TCTGGCGGAT GGCGTCGTGC CTGGCCGTGA GGACGAGGCG GAGGCCGATG CCAAGGGCAA GAAGGGCTCG ATGAAGAAGA CCGGCCTCCC GGCCAACCGC ATCGTCTTCT CCGAAAAGTT CGCCTGCCCC GTCAGCGGCT TCACCATCGA GAGCATCGAG CCGCGCCTGT TCTCGTTCAA CGCCCCCCAG GGCGCCTGCC CCGCGTGCGA CGGTCTGGGC GAAAAGCTGC TGTTCGATCC CCAGCTCGTC GTCCCCAACG AGAACCTCAG CCTCAAGCAG GGCGCCGTCG TCCCCTGGGC GAAAAGCAAC CCGCCGTCGC CCTACTACAT GCAGGTCCTC GCCAGCCTCG CCGGCCACTT CGGCTTCAGC CTCGACACCC CGTGGGACGC GCTCCCGGCG GAAGTGAAGA TCGTCATCCT CTATGGCACC GCCGGCAAGG CCGTCCCGCT CACCTTCATC GACGGCAAGA AGTCCTACAC CGTCCAGAAG GCCTTCGAAG GCGTGATCGG CAACCTCAAC CGCCGCATGC TGCAAACCGA ATCCGCCTGG ATGCGCGAGG AACTCGGCAA GTTCCAGACC GCCCAGCCGT GCGAGACCTG CGAAGGCAAG CGCCTCAAGC CCGAGGCCCT GTCGGTCAAG GTCGCCGGGG CAGACATCTC CTCGATCACC CGCCTCTCCG TCGCCGCCGC GGTCGAATGG TACGGCGCGC TCGACAGCAA GCTCACCCCG CAGCAAAGCC AGATCGCCCG CGCCATCCTC AAGGAAATCA ACGAGCGTCT CGGCTTCCTC AACAACGTCG GCCTCGATTA CCTCAACCTC GATCGCACCT CGGGCACGCT CTCGGGCGGC GAGAGCCAGC GCATCCGCCT CGCCAGCCAG ATCGGCTCCG GCCTCTCGGG CGTGCTCTAC GTGCTCGACG AACCGTCCAT CGGCCTCCAC CAGCGCGACA ACGACCGCCT CCTCGAAACC CTCAAGCGGC TGCGCGATCT CGGCAACACC GTGATCGTCG TCGAACACGA CGAGGACGCA ATCCGCACCG CCGACCACGT GGTCGATCTT GGCCCCGGCG CGGGCGTCCA CGGCGGCGAG ATCGTCGCGC AGGGTACGCT CGACGATATC CTGTCCAACC CGAACAGCCT CACCGGCCAG TACCTCACCG GCGCGCGCCG CATCGAGGTC CCGGCGCACC GCCGCCCTGG CAACGGCCTG TACATCGGCG TCGAAGGGGC GCGCGCCAAC AACCTCCAGA ACGTCAGCGC GAAGATCCCT CTCGGCACCT TCACCTGCGT CACCGGCGTC TCCGGCTCGG GCAAGTCGAG CTTCACGATC GACACCCTCC ACGCCGTTGC CGCCCGCACG CTCAATGGCG CGCGCGTGAT CGCCGGCGCG CACGACCGCG TCACCGGCCT CGAACATTGC GACAAGGTGA TCGAGATCGA CCAGTCGCCC ATCGGCCGCA CCCCCCGGTC AAACCCCGCC ACCTACACCG GCGCCTTCAC CCTGATCCGC GACTGGTTCG CCGGACTGCC CGAATCCGCC GCGCGCGGCT ACAAGGCCGG TCGGTTCAGC TTCAACGTCA AGGGCGGCCG TTGCGAGAAG TGCCAGGGCG ATGGCCTGAT CAAGATCGAG ATGCACTTCC TGCCCGACGT CTACGTCACC TGCGAGGAGT GCGACGGCAA GCGCTACAAC CGCGAAACGC TCGAGGTGAA GTTCAAGGGC CACTCCATCG CCGACGTGCT CGACATGACG ATCGAGGACG CCGAGGAGTT CTTCAAGGCC GTCCCCCCGA TCCGCGACCG GATGCACATG CTCAACGAAG TGGGCCTGGG CTACGTCAAG GTCGGCCAGC AGGCGACGAC GCTAAGCGGC GGCGAGGCCC AACGCGTGAA ACTCGCCAAG GAACTCGCCC GCCGCTCCAC CGGGCAGACG CTCTACATCC TCGACGAACC CACCACCGGC CTCCACTTCG AAGACGTGCG CAAACTGCTC GAAGTCCTCC ACCGCCTCGT CGAACAGGGC AACAGCGTGG TCGTGATCGA ACACAACCTC GACGTGATCA AGACCGCCGA CTGGATCATC GACCTCGGCC CCGAAGGCGG CGTGCGCGGC GGCGAAATCG TCGCCGAAGG CACTCCCGAA CAGGTGGCCA AGAACCCGCG CAGCTTCACC GGCCAATACA TGGCCCCCCT GCTGAAACGC TGA
|
Protein sequence | MSLTTISVRG AREHNLKGID VDLPRDSLIV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS AYARQFLEMM QKPDVEHIDG LSPAISIEQK TTSRNPRSTV ATVTEVYDYM RLLWARVGIP YSPATGEPIS AQTVSQMVDR VMALPEGTRA YLLAPVVRGR KGEYRRELAE WQKAGYTRVR INGEMMPIED APALDKKLKH DIEVVVDRIA VRDGIQTRLA DSFEQALKLA DGLAYIDLAD GVVPGREDEA EADAKGKKGS MKKTGLPANR IVFSEKFACP VSGFTIESIE PRLFSFNAPQ GACPACDGLG EKLLFDPQLV VPNENLSLKQ GAVVPWAKSN PPSPYYMQVL ASLAGHFGFS LDTPWDALPA EVKIVILYGT AGKAVPLTFI DGKKSYTVQK AFEGVIGNLN RRMLQTESAW MREELGKFQT AQPCETCEGK RLKPEALSVK VAGADISSIT RLSVAAAVEW YGALDSKLTP QQSQIARAIL KEINERLGFL NNVGLDYLNL DRTSGTLSGG ESQRIRLASQ IGSGLSGVLY VLDEPSIGLH QRDNDRLLET LKRLRDLGNT VIVVEHDEDA IRTADHVVDL GPGAGVHGGE IVAQGTLDDI LSNPNSLTGQ YLTGARRIEV PAHRRPGNGL YIGVEGARAN NLQNVSAKIP LGTFTCVTGV SGSGKSSFTI DTLHAVAART LNGARVIAGA HDRVTGLEHC DKVIEIDQSP IGRTPRSNPA TYTGAFTLIR DWFAGLPESA ARGYKAGRFS FNVKGGRCEK CQGDGLIKIE MHFLPDVYVT CEECDGKRYN RETLEVKFKG HSIADVLDMT IEDAEEFFKA VPPIRDRMHM LNEVGLGYVK VGQQATTLSG GEAQRVKLAK ELARRSTGQT LYILDEPTTG LHFEDVRKLL EVLHRLVEQG NSVVVIEHNL DVIKTADWII DLGPEGGVRG GEIVAEGTPE QVAKNPRSFT GQYMAPLLKR
|
| |