Gene Saro_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1192 
SymboluvrA 
ID3916489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1235856 
End bp1238768 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content66% 
IMG OID640443928 
Productexcinuclease ABC subunit A 
Protein accessionYP_496471 
Protein GI87199214 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.548367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTCA CTACCATTTC CGTCCGCGGC GCTCGTGAAC ACAATCTCAA GGGGATCGAC 
GTCGATCTCC CGCGCGACAG CCTGATCGTC ATCACCGGCC TTTCCGGCTC CGGCAAGTCG
AGCCTGGCGT TCGACACGAT CTACGCCGAA GGGCAGCGCC GCTACGTGGA ATCGCTCTCG
GCCTACGCGC GCCAGTTCCT CGAGATGATG CAGAAGCCGG ATGTCGAGCA TATCGACGGC
CTCTCGCCCG CCATCTCGAT CGAGCAGAAG ACCACCAGCC GCAACCCGCG CTCCACCGTC
GCCACGGTGA CCGAGGTCTA CGACTACATG CGCCTGCTCT GGGCGCGCGT CGGCATTCCC
TATTCCCCCG CCACCGGCGA ACCGATCAGC GCGCAGACCG TCAGCCAGAT GGTCGACCGC
GTCATGGCCC TGCCCGAAGG CACCCGCGCC TACCTGCTCG CTCCCGTCGT GCGCGGCCGC
AAGGGCGAAT ACCGCCGCGA ACTCGCCGAA TGGCAGAAGG CCGGCTACAC CCGCGTCCGC
ATCAACGGCG AAATGATGCC CATCGAGGAC GCCCCCGCGC TCGACAAGAA GCTCAAGCAC
GACATCGAGG TTGTGGTCGA TCGCATCGCC GTCCGCGACG GCATCCAGAC GCGCCTCGCC
GACAGCTTCG AACAGGCCCT CAAGCTCGCC GACGGCCTTG CCTACATCGA TCTGGCGGAT
GGCGTCGTGC CTGGCCGTGA GGACGAGGCG GAGGCCGATG CCAAGGGCAA GAAGGGCTCG
ATGAAGAAGA CCGGCCTCCC GGCCAACCGC ATCGTCTTCT CCGAAAAGTT CGCCTGCCCC
GTCAGCGGCT TCACCATCGA GAGCATCGAG CCGCGCCTGT TCTCGTTCAA CGCCCCCCAG
GGCGCCTGCC CCGCGTGCGA CGGTCTGGGC GAAAAGCTGC TGTTCGATCC CCAGCTCGTC
GTCCCCAACG AGAACCTCAG CCTCAAGCAG GGCGCCGTCG TCCCCTGGGC GAAAAGCAAC
CCGCCGTCGC CCTACTACAT GCAGGTCCTC GCCAGCCTCG CCGGCCACTT CGGCTTCAGC
CTCGACACCC CGTGGGACGC GCTCCCGGCG GAAGTGAAGA TCGTCATCCT CTATGGCACC
GCCGGCAAGG CCGTCCCGCT CACCTTCATC GACGGCAAGA AGTCCTACAC CGTCCAGAAG
GCCTTCGAAG GCGTGATCGG CAACCTCAAC CGCCGCATGC TGCAAACCGA ATCCGCCTGG
ATGCGCGAGG AACTCGGCAA GTTCCAGACC GCCCAGCCGT GCGAGACCTG CGAAGGCAAG
CGCCTCAAGC CCGAGGCCCT GTCGGTCAAG GTCGCCGGGG CAGACATCTC CTCGATCACC
CGCCTCTCCG TCGCCGCCGC GGTCGAATGG TACGGCGCGC TCGACAGCAA GCTCACCCCG
CAGCAAAGCC AGATCGCCCG CGCCATCCTC AAGGAAATCA ACGAGCGTCT CGGCTTCCTC
AACAACGTCG GCCTCGATTA CCTCAACCTC GATCGCACCT CGGGCACGCT CTCGGGCGGC
GAGAGCCAGC GCATCCGCCT CGCCAGCCAG ATCGGCTCCG GCCTCTCGGG CGTGCTCTAC
GTGCTCGACG AACCGTCCAT CGGCCTCCAC CAGCGCGACA ACGACCGCCT CCTCGAAACC
CTCAAGCGGC TGCGCGATCT CGGCAACACC GTGATCGTCG TCGAACACGA CGAGGACGCA
ATCCGCACCG CCGACCACGT GGTCGATCTT GGCCCCGGCG CGGGCGTCCA CGGCGGCGAG
ATCGTCGCGC AGGGTACGCT CGACGATATC CTGTCCAACC CGAACAGCCT CACCGGCCAG
TACCTCACCG GCGCGCGCCG CATCGAGGTC CCGGCGCACC GCCGCCCTGG CAACGGCCTG
TACATCGGCG TCGAAGGGGC GCGCGCCAAC AACCTCCAGA ACGTCAGCGC GAAGATCCCT
CTCGGCACCT TCACCTGCGT CACCGGCGTC TCCGGCTCGG GCAAGTCGAG CTTCACGATC
GACACCCTCC ACGCCGTTGC CGCCCGCACG CTCAATGGCG CGCGCGTGAT CGCCGGCGCG
CACGACCGCG TCACCGGCCT CGAACATTGC GACAAGGTGA TCGAGATCGA CCAGTCGCCC
ATCGGCCGCA CCCCCCGGTC AAACCCCGCC ACCTACACCG GCGCCTTCAC CCTGATCCGC
GACTGGTTCG CCGGACTGCC CGAATCCGCC GCGCGCGGCT ACAAGGCCGG TCGGTTCAGC
TTCAACGTCA AGGGCGGCCG TTGCGAGAAG TGCCAGGGCG ATGGCCTGAT CAAGATCGAG
ATGCACTTCC TGCCCGACGT CTACGTCACC TGCGAGGAGT GCGACGGCAA GCGCTACAAC
CGCGAAACGC TCGAGGTGAA GTTCAAGGGC CACTCCATCG CCGACGTGCT CGACATGACG
ATCGAGGACG CCGAGGAGTT CTTCAAGGCC GTCCCCCCGA TCCGCGACCG GATGCACATG
CTCAACGAAG TGGGCCTGGG CTACGTCAAG GTCGGCCAGC AGGCGACGAC GCTAAGCGGC
GGCGAGGCCC AACGCGTGAA ACTCGCCAAG GAACTCGCCC GCCGCTCCAC CGGGCAGACG
CTCTACATCC TCGACGAACC CACCACCGGC CTCCACTTCG AAGACGTGCG CAAACTGCTC
GAAGTCCTCC ACCGCCTCGT CGAACAGGGC AACAGCGTGG TCGTGATCGA ACACAACCTC
GACGTGATCA AGACCGCCGA CTGGATCATC GACCTCGGCC CCGAAGGCGG CGTGCGCGGC
GGCGAAATCG TCGCCGAAGG CACTCCCGAA CAGGTGGCCA AGAACCCGCG CAGCTTCACC
GGCCAATACA TGGCCCCCCT GCTGAAACGC TGA
 
Protein sequence
MSLTTISVRG AREHNLKGID VDLPRDSLIV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
AYARQFLEMM QKPDVEHIDG LSPAISIEQK TTSRNPRSTV ATVTEVYDYM RLLWARVGIP
YSPATGEPIS AQTVSQMVDR VMALPEGTRA YLLAPVVRGR KGEYRRELAE WQKAGYTRVR
INGEMMPIED APALDKKLKH DIEVVVDRIA VRDGIQTRLA DSFEQALKLA DGLAYIDLAD
GVVPGREDEA EADAKGKKGS MKKTGLPANR IVFSEKFACP VSGFTIESIE PRLFSFNAPQ
GACPACDGLG EKLLFDPQLV VPNENLSLKQ GAVVPWAKSN PPSPYYMQVL ASLAGHFGFS
LDTPWDALPA EVKIVILYGT AGKAVPLTFI DGKKSYTVQK AFEGVIGNLN RRMLQTESAW
MREELGKFQT AQPCETCEGK RLKPEALSVK VAGADISSIT RLSVAAAVEW YGALDSKLTP
QQSQIARAIL KEINERLGFL NNVGLDYLNL DRTSGTLSGG ESQRIRLASQ IGSGLSGVLY
VLDEPSIGLH QRDNDRLLET LKRLRDLGNT VIVVEHDEDA IRTADHVVDL GPGAGVHGGE
IVAQGTLDDI LSNPNSLTGQ YLTGARRIEV PAHRRPGNGL YIGVEGARAN NLQNVSAKIP
LGTFTCVTGV SGSGKSSFTI DTLHAVAART LNGARVIAGA HDRVTGLEHC DKVIEIDQSP
IGRTPRSNPA TYTGAFTLIR DWFAGLPESA ARGYKAGRFS FNVKGGRCEK CQGDGLIKIE
MHFLPDVYVT CEECDGKRYN RETLEVKFKG HSIADVLDMT IEDAEEFFKA VPPIRDRMHM
LNEVGLGYVK VGQQATTLSG GEAQRVKLAK ELARRSTGQT LYILDEPTTG LHFEDVRKLL
EVLHRLVEQG NSVVVIEHNL DVIKTADWII DLGPEGGVRG GEIVAEGTPE QVAKNPRSFT
GQYMAPLLKR