Gene Swit_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3107 
SymboluvrA 
ID5200350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3416276 
End bp3419242 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content68% 
IMG OID640582655 
Productexcinuclease ABC subunit A 
Protein accessionYP_001263594 
Protein GI148556012 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.378595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACTC ATATTTCGGT GCGCGGCGCG CGCGAGCACA ACCTCAAGGG CGTCGATGTC 
GAGCTGCCCC GCGACCGGCT GATCGTGATC ACCGGCCTGT CCGGATCGGG CAAGTCGAGT
CTCGCCTTCG ACACCATCTA TGCCGAGGGG CAACGCCGCT ACGTCGAGTC GCTGTCGGCC
TATGCGCGCC AGTTCCTCGA GCTGATGCAG AAGCCCGACG TCGACCATAT CGAGGGCCTC
TCCCCCGCCA TCTCGATCGA GCAGAAGACG ACCAGCCGCA ACCCGCGCTC GACCGTCGCG
ACGGTGACCG AGATCTACGA CTATATGCGC CTGCTCTGGG CGCGGGTTGG CGTGCCCTAT
TCGCCCGCGA CGGGCCTGCC GATCTCGGCG CAGACGGTCA GCCAGATGGT CGACCGCATC
CTGCAATTGA AGGAGGGCAC CCGCTTCCTG CTGCTCGCCC CGGTCGTGCG CGGCCGCAAG
GGCGAGTATC GCAAGGAGCT GGCCGAATGG CAGAAGGCCG GCTTCACCCG CGTCCGCATC
GACGGCGAGA TGCACGAGAT CGAGGCGGCG CCCGCGCTCG ACAAGAAGTA CAAGCACGAC
ATCGAGGTGG TGGTCGACCG CCTCGTCGTC CGCGAGGGGA TGGAGACCCG CCTCGCCCAG
AGCCTGGAGA CCGCGCTGCG CCTGGCCGAC GGCCTGGCCT ATGCCGACCT GGTCGACACG
ACGGTGGCGG AGGCGATGGC ACCGAGCGGC GTCCGCGAGA TGCCGGCCAG CTTCCAATAT
GCCGAGACGA TGGAGACGGG CGGCGCGATG AAGGGCGCCG GCGTTCCCGC CAACCGCATC
ACCTTCTCGG AGAAATTCGC CTGCCCCGTC TCGGGCTTCA CCATCGCCGA GATCGAGCCG
CGGCTGTTCT CGTTCAACGC GCCGCAGGGC GCCTGCCCGG CCTGCGACGG CCTCGGCGAG
AAGCTCTATT TCGACCCGCA GCTCGTCGTC CCCAACGAGG CGCTGTCGAT CAAGCAGGGC
GCGGTGGTGC CCTGGGCCAA GTCCAACCCG CCCAGCCCCT ATTACATGCA GGTACTGAGC
TCGGTCGCGC GCGCCTACGG CTTCAAGCTG GAGACGCCGT GGAACGAGCT GCCCGAGGAG
ATCCGCGATA CCGTCCTCTA CGGCACCAAG GGGCGGGTGA TCACCCTCAC CTTCATCGAC
GGCCGCAAGA GCTACGACGT GCAGAAGCCG TTCGAGGGGG TGATCCACAA TCTCGAGCGC
CGCATGCTGT CGACCGAGAG CGCCTGGATG CGCGAGGAGC TCGCCAAGTA TCAGAGCGCC
GCGCATTGCG AGGTGTGCGG CGGCGCGCGC CTCAAGCCCG AGGCGCTGGC GGTCAAGATC
GCGGGCGAGG ACATCTCGAT GTCGACCCGC CGCGCGGTCG GCCCCGCGCT CAAATTCTTC
CGCGAGATGC CCGACCATCT CAACGAGCAG CAGAAGGCGA TCGCCGAGCG CATCCTCAAG
GAGATCGTCG AGCGGCTGGG CTTCCTCGAC AATGTCGGGC TCGACTATCT CAACCTCGAC
CGCACCTCGG GCACGCTGTC GGGCGGCGAG AGCCAGCGCA TCCGCCTCGC CAGCCAGATC
GGGTCGGGAC TTTCCGGCGT CCTCTACGTG CTCGACGAGC CGTCGATCGG CCTGCACCAG
CGCGACAACG ACCGGCTGCT GGTGACGCTG CGGCGGCTGC GCGACCTCGG CAACACGGTG
ATCGTCGTCG AGCATGACGA GGATGCGATC CGCACCGCCG ACTATATCCT CGACATGGGG
CCGGGCGCCG GCGTCCATGG CGGCGAGGTG GTGGCGTCGG GCACGCTCGA CGACATCCTC
GCCAACGAGG CGAGCCTAAC CGGCGACTAT CTGTCGGGCC GCCGCATGGT CGAGGTGCCG
GCGAAGCGGC GCAAGGGCAA CGGCAAGAAG GTCACCGTCC ACAATGCGCG GGCGAACAAC
CTGAAGAATG TCACCGCGGC GATCCCGCTC GGCACCTTCA CCTGCATCAC CGGCGTCTCG
GGATCGGGCA AGTCGAGCTT CACGATCGAC ACGCTCTACG CCTCGGCCGC GCGCCAGCTC
AACGGCGCGC GCGTGGTGGC GGGCGCGCAC GACAAGATCA CCGGTCTGCA ATATTGCGAC
AAGGTCATCG ACATCGACCA GTCGCCGATC GGCCGCACCC CGCGCTCGAA CCCGGCGACC
TATACCGGCG CCTTCACCAA CATCCGCGAC TGGTTCGCGG GGCTCCCGGA GAGCGAGGCG
CGCGGCTACA AGCCGGGCCG GTTCAGCTTC AACGTCAAGG GCGGCCGCTG CGAGGCGTGC
ACCGGCGACG GCCTGCTCAA GATCGAGATG CACTTCCTGC CCGACGTCTA CGTCACCTGC
GACGTCTGCC ACGGCGCGCG CTACAACCGC GAGACGCTGG AGGTGAAGTT CAAGGACAAG
AGCATCGCCG ACGTGCTCGA CATGACCGTC GAGGACGCGG TCGAGTTCTT CAAGGCGGTC
CCGCCGATCC GCGACAAGAT GGCGATGCTG GCCGAGGTCG GCCTCGGCTA CATCAAGGTC
GGCCAGCAGG CGACGACGCT GTCGGGCGGC GAGGCGCAGC GCGTCAAGCT CGCCAAGGAA
CTGTCGCGCC GGTCGACCGG CAACACCCTC TACATCCTCG ACGAGCCGAC CACCGGCCTC
CACTTCGAGG ACGTCCGCAA GCTGCTCGAA GTGCTCCACG CGCTGGTCGA GCAGGGCAAC
ACCGTGGTGG TGATCGAGCA CAACCTCGAG GTCATCAAGA CCGCCGACTG GGTGATGGAC
CTCGGCCCCG AGGGCGGCGT GCGCGGCGGC GAGATCGTGG CGGAAGGGAC GCCGGAGCAG
GTGGCGAAGG AGCCGCGCAG CTACACGGGC GCCTATCTCA AGCCGCTGCT GGAGGCAGGG
GCGAGGAGTG CTCAGGCTGC CGAATAG
 
Protein sequence
MLTHISVRGA REHNLKGVDV ELPRDRLIVI TGLSGSGKSS LAFDTIYAEG QRRYVESLSA 
YARQFLELMQ KPDVDHIEGL SPAISIEQKT TSRNPRSTVA TVTEIYDYMR LLWARVGVPY
SPATGLPISA QTVSQMVDRI LQLKEGTRFL LLAPVVRGRK GEYRKELAEW QKAGFTRVRI
DGEMHEIEAA PALDKKYKHD IEVVVDRLVV REGMETRLAQ SLETALRLAD GLAYADLVDT
TVAEAMAPSG VREMPASFQY AETMETGGAM KGAGVPANRI TFSEKFACPV SGFTIAEIEP
RLFSFNAPQG ACPACDGLGE KLYFDPQLVV PNEALSIKQG AVVPWAKSNP PSPYYMQVLS
SVARAYGFKL ETPWNELPEE IRDTVLYGTK GRVITLTFID GRKSYDVQKP FEGVIHNLER
RMLSTESAWM REELAKYQSA AHCEVCGGAR LKPEALAVKI AGEDISMSTR RAVGPALKFF
REMPDHLNEQ QKAIAERILK EIVERLGFLD NVGLDYLNLD RTSGTLSGGE SQRIRLASQI
GSGLSGVLYV LDEPSIGLHQ RDNDRLLVTL RRLRDLGNTV IVVEHDEDAI RTADYILDMG
PGAGVHGGEV VASGTLDDIL ANEASLTGDY LSGRRMVEVP AKRRKGNGKK VTVHNARANN
LKNVTAAIPL GTFTCITGVS GSGKSSFTID TLYASAARQL NGARVVAGAH DKITGLQYCD
KVIDIDQSPI GRTPRSNPAT YTGAFTNIRD WFAGLPESEA RGYKPGRFSF NVKGGRCEAC
TGDGLLKIEM HFLPDVYVTC DVCHGARYNR ETLEVKFKDK SIADVLDMTV EDAVEFFKAV
PPIRDKMAML AEVGLGYIKV GQQATTLSGG EAQRVKLAKE LSRRSTGNTL YILDEPTTGL
HFEDVRKLLE VLHALVEQGN TVVVIEHNLE VIKTADWVMD LGPEGGVRGG EIVAEGTPEQ
VAKEPRSYTG AYLKPLLEAG ARSAQAAE