Gene Mjls_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3003 
SymboluvrA 
ID4878716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3135299 
End bp3138220 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content67% 
IMG OID640140301 
Productexcinuclease ABC subunit A 
Protein accessionYP_001071273 
Protein GI126435582 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.855569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.034696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG CCTGGGAGAG GACCGAGTTG GCTGACCGCC TGATCGTCAA GGGTGCGCGC 
GAGCACAACC TGCGAAGCGT CGACCTTGAC CTACCGCGTG ACGCCTTGAT CGTGTTCACC
GGCCTGTCCG GTTCGGGCAA GTCGTCGCTC GCGTTCGACA CGATCTTCGC CGAGGGGCAG
CGCCGCTACG TCGAATCGCT CTCGGCGTAC GCCCGGCAGT TCCTCGGGCA GATGGACAAA
CCCGACGTCG ACTTCATCGA GGGTCTGTCG CCCGCGGTGT CCATCGACCA GAAGTCGACC
AACCGCAACC CGCGGTCGAC GGTCGGCACC ATCACCGAGG TCTACGACTA CCTGCGCCTT
CTGTACGCGC GGGCCGGCAT GCCGCACTGC CCGGTGTGTG GTGAGCGGAT CGCACGGCAG
ACCCCTCAGC AGATCGTCGA CCAGGTCCTG GCCATGGACG AAGGGCTGCG TTTCCAGGTG
CTCGCACCGG TCGTCCGCAC CCGCAAGGGT GAGTTCGTCG ACCTGTTCGA GAAGCTCAAC
TCCCAGGGTT ACAGCCGCGT GCGCGTCGAC GGGGTGGTCT ATCCGCTGAC CGAACCGCCC
AAGCTCAAGA AGCAGGAGAA GCACGACATC GAGGTGGTGG TCGACCGGCT CACGGTGAAG
GCCACCGCCA AACAGCGGCT GACCGATTCG GTGGAGACGG CTCTGACGCT GGCCGACGGC
ATCGTGGTGC TCGAGTTCGT CGACCGTGAG GACGACCATC CGCACCGTGA GCAGCGGTTC
TCCGAGAAAC TGGCGTGTCC CAACGGCCAC CCCCTGGCCG TCGACGATCT CGAACCGCGG
TCCTTCTCGT TCAACTCGCC CTACGGCGCC TGTCCGGAGT GCGCCGGGCT CGGCGTCCGC
AAGGAGGTCG ACCCCGAACT CGTCATCCCC GACCCGGATC TGACGCTGGC CGAGGGCGCC
ATCGCCCCGT GGTCGATGGG CCAGACCGCG GAGTACTTCA CCCGCATGCT GACCGGCCTG
GGTGAGTCGA TGGGCTTCGA CGTCGACACA CCGTGGAAGA AGCTGCCCGC CAAGGTGCGT
CGCGCGATCC TCGAGGGCTG CGACGAGCAG GTGCACGTCA AGTACCGCAA TCGGTACGGC
CGCACCCGCT CGTACTACGC CGACTTCGAG GGCGTCATGG CGTTCCTGCA GCGCCGCATG
GAGCAGACCG ACTCCGAGCA GATGAAGGAG CGCTACGAGG GCTTCATGCG CGACGTGCCG
TGCCCGGAGT GCGACGGCAC CCGCCTCAAG CCCGAGATCC TCGCGGTCAC GCTGGCCGCG
GGCGACCACG GCGCGAAGTC GATCGCCGAG GTCGCGGAGC TGTCGATCGC CGACTGCGCC
GACTTCCTCA ACGCGCTCAC CCTCGGCGCG CGCGAACAGG CCATCGCCGG TCAGGTCCTC
AAGGAGATCC AGTCCCGGCT GGGCTTCCTG CTCGACGTCG GGCTGGACTA CCTGTCGCTC
TCGCGCGCTG CGGCCACGCT GTCCGGCGGC GAGGCGCAGC GGATCCGGCT CGCCACGCAG
ATCGGTTCGG GTCTGGTGGG CGTGCTCTAC GTCCTCGACG AGCCGTCTAT CGGTCTGCAC
CAGCGCGACA ACCGCCGCCT GATCGACACC CTGGTCCGGC TGCGGGAACT GGGCAACACG
CTCATCGTGG TCGAACACGA CCTCGACACC ATCGCCCACG CCGACTGGGT GGTCGACATC
GGCCCGGCCG CCGGTGAGCA CGGCGGGCGG ATCGTGCACA GCGGCACCTA CCAGGATCTG
CTGCGCAATC CGGAGTCGCT GACCGGCGCG TACCTCTCCG GTCGCGAGAG CATCGAGGTG
CCTGCGATCC GGCGTCCCGT CGACCGCAAA CGACAACTCA CCGTCATCGG CGCGCGTGAG
CACAACCTCA AGGACGTCGA CGTCGCATTC CCCCTCGGGG TGCTCACGTC GGTCACCGGG
GTGTCGGGCT CGGGTAAGTC GACGCTGGTC AACGACATCC TGGCGTCCGT GCTGGCCAAC
AAACTCAACG GCGCCCGGCA GGTGCCCGGC CGCCACACCC GCATCAACGG CCTCGACCAA
CTCGACAAGC TCGTGCGGGT GGACCAGTCG CCGATCGGGC GGACCCCACG GTCGAACCCC
GCGACCTACA CCGGTGTATT CGACAAGATC CGGTCGCTGT TCGCGGCGAC GACGGAGGCC
AAGGTCCGCG GATACCAGCC GGGACGTTTC TCGTTCAACG TCAAGGGCGG CCGCTGCGAG
GCCTGTTCGG GCGACGGCAC CATCAAGATC GAGATGAACT TCCTGCCGGA CGTGTACGTC
CCGTGCGAGG TGTGCCACGG TGCCCGCTAC AACCGCGAGA CGCTCGAGGT GCACTACAAG
GGCAAGACCA TCGCCGAGGT GCTCGACCTG TCGATCGAGG ACGCTGCGGA GTTCTTCGAA
CCGATCAGCT CGATCCACCG CTACCTCAAG ACGCTGGTCG ACGTCGGCCT GGGCTACGTG
CGGCTGGGCC AGCCCGCACC GACGCTGTCC GGGGGTGAGG CCCAGCGCGT GAAACTGGCC
GCCGAGCTGC AGAAGCGGTC GACCGGCCGC ACGGTCTACA TCCTCGACGA GCCGACCACC
GGCCTGCACT TCGAGGACAT CCGCAAGCTG CTCAAGGTCA TCAACGGTCT GGTCGACAAG
GGCAACACGG TGATCGTCAT CGAGCACAAC CTCGACGTGA TCAAGACCTC CGACTGGATC
ATCGACATGG GTCCCGAAGG CGGCGCCGGC GGCGGCACCG TGGTCGCGCA GGGCACCCCC
GAGGACGTCG CGGCCAACAC CGACAGCTAC ACCGGCCACT TCCTCGCCGA GATGCTCGAC
GTACCCGCGC CGACCCCGAA GCGGCGCAAG GTCAGCGCGT GA
 
Protein sequence
MSEAWERTEL ADRLIVKGAR EHNLRSVDLD LPRDALIVFT GLSGSGKSSL AFDTIFAEGQ 
RRYVESLSAY ARQFLGQMDK PDVDFIEGLS PAVSIDQKST NRNPRSTVGT ITEVYDYLRL
LYARAGMPHC PVCGERIARQ TPQQIVDQVL AMDEGLRFQV LAPVVRTRKG EFVDLFEKLN
SQGYSRVRVD GVVYPLTEPP KLKKQEKHDI EVVVDRLTVK ATAKQRLTDS VETALTLADG
IVVLEFVDRE DDHPHREQRF SEKLACPNGH PLAVDDLEPR SFSFNSPYGA CPECAGLGVR
KEVDPELVIP DPDLTLAEGA IAPWSMGQTA EYFTRMLTGL GESMGFDVDT PWKKLPAKVR
RAILEGCDEQ VHVKYRNRYG RTRSYYADFE GVMAFLQRRM EQTDSEQMKE RYEGFMRDVP
CPECDGTRLK PEILAVTLAA GDHGAKSIAE VAELSIADCA DFLNALTLGA REQAIAGQVL
KEIQSRLGFL LDVGLDYLSL SRAAATLSGG EAQRIRLATQ IGSGLVGVLY VLDEPSIGLH
QRDNRRLIDT LVRLRELGNT LIVVEHDLDT IAHADWVVDI GPAAGEHGGR IVHSGTYQDL
LRNPESLTGA YLSGRESIEV PAIRRPVDRK RQLTVIGARE HNLKDVDVAF PLGVLTSVTG
VSGSGKSTLV NDILASVLAN KLNGARQVPG RHTRINGLDQ LDKLVRVDQS PIGRTPRSNP
ATYTGVFDKI RSLFAATTEA KVRGYQPGRF SFNVKGGRCE ACSGDGTIKI EMNFLPDVYV
PCEVCHGARY NRETLEVHYK GKTIAEVLDL SIEDAAEFFE PISSIHRYLK TLVDVGLGYV
RLGQPAPTLS GGEAQRVKLA AELQKRSTGR TVYILDEPTT GLHFEDIRKL LKVINGLVDK
GNTVIVIEHN LDVIKTSDWI IDMGPEGGAG GGTVVAQGTP EDVAANTDSY TGHFLAEMLD
VPAPTPKRRK VSA