Gene Mmcs_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2988 
SymboluvrA 
ID4111820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3155397 
End bp3158318 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content67% 
IMG OID638032116 
Productexcinuclease ABC subunit A 
Protein accessionYP_640151 
Protein GI108799954 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG CCTGGGAGAG GACCGAGTTG GCTGACCGCC TGATCGTCAA GGGTGCGCGC 
GAGCACAACC TGCGAAGCGT CGACCTTGAC CTACCGCGTG ACGCCTTGAT CGTGTTCACC
GGCCTGTCCG GTTCGGGCAA GTCGTCGCTC GCGTTCGACA CGATCTTCGC CGAGGGGCAG
CGCCGCTACG TCGAATCGCT CTCGGCGTAC GCCCGGCAGT TCCTCGGGCA GATGGACAAA
CCCGACGTCG ACTTCATCGA GGGTCTGTCG CCCGCGGTGT CCATCGACCA GAAATCGACC
AACCGCAACC CGCGTTCGAC CGTCGGCACC ATCACCGAGG TCTACGACTA CCTGCGCCTT
CTGTACGCGC GGGCCGGCAC GCCGCACTGC CCGGTGTGTG GTGAGCGGAT CGCGCGCCAG
ACCCCTCAGC AGATCGTCGA CCAGGTCCTC GCCATGGACG AGGGGCTGCG TTTCCAGGTG
CTCGCACCGG TCGTACGCAC CCGCAAGGGT GAGTTCGTCG ACCTGTTCGA GAAGCTCAAC
TCCCAGGGTT ACAGCCGGGT GCGCGTCGAC GGGGTGGTCT ATCCGCTGAC CGAACCGCCC
AAGCTCAAGA AGCAGGAGAA GCACGACATC GAGGTGGTGG TCGACCGGCT CACGGTGAAG
GCCACCGCCA AACAGCGGCT GACCGATTCG GTGGAGACGG CTCTGACGCT GGCCGACGGC
ATCGTGGTGC TCGAGTTCGT CGACCGTGAG GACGACCATC CACACCGTGA GCAGCGATTC
TCCGAGAAAC TGGCGTGTCC CAACGGCCAC CCGCTGGCCG TCGACGATCT CGAACCGCGG
TCGTTCTCGT TCAACTCGCC CTACGGCGCC TGTCCGGAAT GCGCCGGGCT CGGCGTCCGC
AAGGAGGTCG ATCCCGAACT CGTCATCCCC GACCCGGATC TGACGCTGGC CGAGGGCGCC
ATCGCCCCGT GGTCGATGGG CCAGACCGCG GAGTACTTCA CCCGCATGCT GACCGGCCTG
GGTGAGTCGA TGGGCTTTGA CGTCGACACG CCTTGGAAGA AGCTGCCCGC CAAGGTGCGT
CGCGCGATCC TCGAGGGCTG CGACGAGCAG GTGCACGTCA AGTACCGCAA CCGGTACGGC
CGCACCCGCT CGTACTACGC CGACTTCGAG GGCGTCATGG CGTTCCTGCA GCGCCGCATG
GAGCAGACCG ACTCCGAGCA GATGAAGGAG CGCTACGAGG GCTTCATGCG CGACGTGCCG
TGTCCGGAGT GCGACGGCAC CCGCCTCAAG CCCGAGATCC TGGCGGTCAC GCTGGCCGCG
GGCGACCACG GCGCGAAGTC GATCGCCGAG GTCGCGGAGC TGTCGATCGC CGACTGCGCC
GACTTCCTCA ATGCGCTCAC CCTCGGCCCC CGCGAGCAGG CCATCGCCGG TCAGGTCCTC
AAGGAGATCC AGTCCCGACT GGGCTTCCTG CTCGACGTCG GGTTGGACTA CCTGTCGCTC
TCACGCGCGG CGGCCACGCT GTCCGGCGGC GAGGCGCAGC GGATCCGGCT CGCCACGCAG
ATCGGTTCCG GTCTGGTGGG CGTGCTCTAC GTCCTCGACG AGCCGTCGAT CGGTCTGCAT
CAGCGCGACA ACCGCCGCCT GATCGACACC CTGGTCCGGC TGCGGGAACT GGGCAACACG
CTCATCGTGG TCGAACACGA CCTCGACACC ATCGCCCACG CCGACTGGGT GGTCGACATC
GGCCCGGCCG CCGGCGAGCA CGGCGGGCGG ATCGTGCACA GCGGCACCTA CCAGGATCTG
CTCCGCAATC CGGAGTCGCT GACCGGCGCG TACCTCTCCG GTCGCGAGAG CATCGAGGTG
CCTGCGATCC GGCGCCCCGT CGACCGCAAA CGACAACTCA CCGTCATCGG CGCGCGTGAG
CACAACCTCA AGGACGTCGA CGTCGCATTC CCCCTCGGGG TGCTCACGTC GGTCACCGGG
GTGTCGGGCT CGGGTAAGTC GACGCTGGTC AACGACATCC TGGCGTCGGT GCTGGCCAAC
AAACTCAACG GCGCCCGGCA GGTACCCGGC CGCCACACCC GCATCAACGG CCTCGACCAA
CTCGACAAGC TCGTGCGGGT GGACCAGTCG CCGATCGGGC GGACCCCACG GTCGAACCCC
GCGACCTACA CCGGAGTGTT CGACAAGATC CGGTCGCTGT TCGCGGCGAC CACGGAGGCC
AAGGTCCGCG GATACCAGCC GGGACGTTTC TCGTTCAACG TCAAAGGCGG CCGCTGCGAG
GCCTGTTCGG GCGACGGCAC CATCAAGATC GAGATGAACT TCCTACCGGA CGTGTACGTC
CCGTGCGAGG TGTGCCACGG TGCCCGCTAC AACCGCGAGA CGCTCGAGGT GCACTACAAG
GGCAAGACCA TCGCCGAGGT GCTCGACCTG TCGATCGAGG ACGCTGCGGA GTTCTTCGAA
CCGATCAGCT CGATCCACCG CTACCTCAAG ACGCTGGTGG ACGTCGGCCT GGGGTACGTG
CGGCTCGGCC AGCCGGCGCC GACGCTGTCC GGCGGTGAGG CCCAGCGCGT GAAGCTGGCC
GCCGAACTGC AGAAGCGGTC GACAGGCCGT ACGGTCTACA TTCTCGACGA ACCGACCACC
GGCCTGCACT TCGAGGACAT CCGCAAGCTG CTCAAGGTCA TCAACGGTCT GGTCGACAAG
GGCAACACGG TGATCGTCAT CGAGCACAAC CTCGACGTGA TCAAGACCTC CGACTGGATC
ATCGACATGG GTCCCGAAGG CGGCGCGGGC GGCGGCACCG TGGTCGCGCA GGGCACGCCT
GAGGACGTCG CGGCCAACAC CGACAGCTAC ACCGGAGACT TCCTCGCCGA GATGCTCGAC
GTACCCGCGC CGACCCGGAA GCGGCGCAAG GTCAGCGCGT GA
 
Protein sequence
MSEAWERTEL ADRLIVKGAR EHNLRSVDLD LPRDALIVFT GLSGSGKSSL AFDTIFAEGQ 
RRYVESLSAY ARQFLGQMDK PDVDFIEGLS PAVSIDQKST NRNPRSTVGT ITEVYDYLRL
LYARAGTPHC PVCGERIARQ TPQQIVDQVL AMDEGLRFQV LAPVVRTRKG EFVDLFEKLN
SQGYSRVRVD GVVYPLTEPP KLKKQEKHDI EVVVDRLTVK ATAKQRLTDS VETALTLADG
IVVLEFVDRE DDHPHREQRF SEKLACPNGH PLAVDDLEPR SFSFNSPYGA CPECAGLGVR
KEVDPELVIP DPDLTLAEGA IAPWSMGQTA EYFTRMLTGL GESMGFDVDT PWKKLPAKVR
RAILEGCDEQ VHVKYRNRYG RTRSYYADFE GVMAFLQRRM EQTDSEQMKE RYEGFMRDVP
CPECDGTRLK PEILAVTLAA GDHGAKSIAE VAELSIADCA DFLNALTLGP REQAIAGQVL
KEIQSRLGFL LDVGLDYLSL SRAAATLSGG EAQRIRLATQ IGSGLVGVLY VLDEPSIGLH
QRDNRRLIDT LVRLRELGNT LIVVEHDLDT IAHADWVVDI GPAAGEHGGR IVHSGTYQDL
LRNPESLTGA YLSGRESIEV PAIRRPVDRK RQLTVIGARE HNLKDVDVAF PLGVLTSVTG
VSGSGKSTLV NDILASVLAN KLNGARQVPG RHTRINGLDQ LDKLVRVDQS PIGRTPRSNP
ATYTGVFDKI RSLFAATTEA KVRGYQPGRF SFNVKGGRCE ACSGDGTIKI EMNFLPDVYV
PCEVCHGARY NRETLEVHYK GKTIAEVLDL SIEDAAEFFE PISSIHRYLK TLVDVGLGYV
RLGQPAPTLS GGEAQRVKLA AELQKRSTGR TVYILDEPTT GLHFEDIRKL LKVINGLVDK
GNTVIVIEHN LDVIKTSDWI IDMGPEGGAG GGTVVAQGTP EDVAANTDSY TGDFLAEMLD
VPAPTRKRRK VSA