Gene Mvan_5002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5002 
Symbol 
ID4645065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5348524 
End bp5351346 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content73% 
IMG OID639808473 
Productregulatory protein, LuxR 
Protein accessionYP_955780 
Protein GI120405951 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACTC GGCTGGACAC TCGGCATCCC GCGGGACACC GCGCCCGGCC GCCGCTGCGC 
GGACGCGGTC ACGAATGCCG GGAACTGCAG AACCTGATTG CCGCGGTGCG CACCGGCGCC
AGCCGGTCCC TGGTCCTGCG TGGCGAGGCA GGCATCGGAA AGACCGCGCT GCTCGACCAC
GTGGTGGCGC AGGCCGAGGG TTTCCGCACC ACCTACGTCG CAGGTGTGGA ATCGGACATG
GAGCTCGCCT TCGCCGGGCT GCACCAGCTG TGCGCCCCGC TGCTCACCCA CCTCGACGAG
CTGCCCGACC CGCAGCGAGA CGCCCTGACC GTCGCGTTCG GTCAGGGCGC CGGCGCCACC
CCGGACCGCT TCCTGGTCGG GTTGGCGGTG CTCAGCCTGA TGGCTGCCGC GTCGACCGGT
GGGCCGCTGC TGTGCGTCAT CGACGACGCC CAGTGGCTCG ACCAGGTGTC GGTGCAGACG
TTGGCGTTCG TGGCGCGCCG CCTGCTGGCC GAACCCATCG CGATGGTGTT CGCGGCCCGC
GACACCGGCG CCGAGGCGCT GAGCGGTCTG CCCGAGCTGG CCGTCCCGGG TCTGTCCGAC
GGCGAGGCCA GGGACCTGTT GGACTCGGTG GTGGTGGGGC GGCTGGATGA CCGGGTGCGC
GACCGCATCA TCGCCGAGAC GCGGGGCAAC CCGCTGGCAC TGCTGGACCT GCCGCACAAC
CACGTCGGCA CCGAGCTGCC CGGCAGCGCC GCCACCCCGG GCAGCCGCCC CCTCGCCCGG
CGTCTTGAGC AGAGTTACGC ACGTCGCATC AAGGCCCTGC CGCCGCAGAC CCAGCTTCTG
CTGCTGGCGG CCGCGGCCGA GCCGGTCGGC GACGCCGCGG TGTTGATCCG CGCGGCCGCG
CAACTCGGCA TCGCGGTGGA CATGCTGACA CCCGCCGAGG CCGCGGGGGT GATCGAACTC
GGTACCCGGG TGCGGTTCCG GCACCCGCTG GTCCGGTCGG CCGCCTATCA GGCCGCCGAC
CTGGCCGACC GGCGCACGGT CCACCGCGCG CTGGCCGAGG CGACCGACTC GGCCACCGAC
CCCGACCGCC GCGCCTGGCA CGCCGCCAAC GCGGCGACCG GACCCGACGA CGCGGTCGCC
GCCGAGCTCG AAGCGTCCGC AGCCCGCGCG CAGGCCCGCG GTGGCGTGGC GGCTGCGGCG
GCCTTCCTGG AGCGGGCGGC CGCACTGACC TCCGATCCCG CGCTGCGCGG CGCCCGTGCA
CTCGCCGCCG CGCAGGCCAA GCGCGACGCG GCGGCTCCTG CCGAGGCCCA GGAACTCCTC
TCCACCGCCG AGCTGGGGCG GCTCTCCGAA CTTCAACAGG CGCAGGCGGC GCGACTGCGC
GCACAGATGG AGTTCACCCG GCGCCGCGGC GGCGATACCG GCGCCCGCCC ACTCGACGAG
ACGGCATCGC AATTGCTCAC GGCAGCAGCC AGTTTCGAAC AACTCGACGG CTACGTGTCG
CGGGAGGCCT ACCTGGAGGC GCTGGCCGCC GCGATGTTCG CCGGCCGGCT CGGTGACCCC
GGTGCGGTCC CGCGGATCGC CGCTGCCGCG AGTGCAGCGC TCGACCGTCT CCCAGAGTCA
GCGCGCCCCG TCGACGCGCT GCTGCGCGGC ATCGTGGCGC GAATCACCGG CGGCGTCAGC
GCCGGTGCCG CACCGCTACG CCTCGCGATG GACCTCATGC AGGAGCAGGC GCGCAACAAC
GACCACCGGG TGGCGCGCTG GATGGTGCCC GCGTTCCCGA TCATGCAGGA AACAGCTGCG
CTGGAGCTGT GGGACGAGAC CGTTGTGCAC CACCTCGCGT CCGTCGTGGT GCGCCGCGCC
CGGGACGCGG GTGCCCTTGA GGCACTCCCC CAGGTGCTGG CCTACCGTGC CGGCGCACAC
CTACTCGCCG GTGAGCTATC CTCGGCGGCA ACACTTCTGG AGGAAGCCGC ATCGATCACC
GCGGCGACCA ACAACTACAC ACCGGTGCGC TACCAGACGC TGACCCTGGC GGCGTGGCGC
GGCGAACCCG CCGACGCGGA GGCGGTGATC GAGGCGGCCA TCGCCGATGC CAATGCCAGA
GGTGAGGGCC GCGTCCTCGG CGCCGCCGAT TATGCGTTCG CGGTGCTGTA CAACGGTCTC
GGGCGTTACC AGGAGGCGTG CAGCGCTGCC CGGCGGGCGT GCGAGTACGA GGATCTCGGC
GTCCACAGCT GGGTGTTGGC CGAACTGGTC GAGGCGGCGG ACCACTGCGG TGACACCGCG
TTGGCGGTGT CGGCGCTTGA GCGCCTGCAG GAGCGCACCG CCGACACCGG GACCGACTGG
GGCCTGGGAA CTCTGGCGGG TGCGCAGGCC CTGGTGGCCG ACGACGATCA CGCCGAAGCC
CTGTTCGAGG AATCGATCGA CCGCCTGGCA CGCACCCGGG TGGCCGTGCA CCTGGCCCGC
GCGCATCTGC GCTACGGCGA GTGGTTACGG CGGGCGCTGC GCCGCAACGA CGCCCGCGAA
CAGCTCACGC TGGCCTCCGG TATGTTCACC CGATTCGGTG CCGCCGCCTT CGCCGAACGG
ACCCGCCGTG AGCTGATCGC GACCGGCGAG AAGGCCCGCC GGCAACCGGT CACCTCCGGC
GCTCAGCTCA CCGCCCAGGA GTCGCAGATC GCGCGGCTCG CAGGCGACGG GCTGACCAAT
CAGGAGATCG GGGCCCAGCT GTTCATCAGC ACCCACACCG TCGACTGGCA CCTGCGGAAG
GTGTTCGTCA AGCTCGGCAT CACCTCACGC AGGCAGTTGC GCAGCGCGTC GTGGGCGAGT
TGA
 
Protein sequence
MDTRLDTRHP AGHRARPPLR GRGHECRELQ NLIAAVRTGA SRSLVLRGEA GIGKTALLDH 
VVAQAEGFRT TYVAGVESDM ELAFAGLHQL CAPLLTHLDE LPDPQRDALT VAFGQGAGAT
PDRFLVGLAV LSLMAAASTG GPLLCVIDDA QWLDQVSVQT LAFVARRLLA EPIAMVFAAR
DTGAEALSGL PELAVPGLSD GEARDLLDSV VVGRLDDRVR DRIIAETRGN PLALLDLPHN
HVGTELPGSA ATPGSRPLAR RLEQSYARRI KALPPQTQLL LLAAAAEPVG DAAVLIRAAA
QLGIAVDMLT PAEAAGVIEL GTRVRFRHPL VRSAAYQAAD LADRRTVHRA LAEATDSATD
PDRRAWHAAN AATGPDDAVA AELEASAARA QARGGVAAAA AFLERAAALT SDPALRGARA
LAAAQAKRDA AAPAEAQELL STAELGRLSE LQQAQAARLR AQMEFTRRRG GDTGARPLDE
TASQLLTAAA SFEQLDGYVS REAYLEALAA AMFAGRLGDP GAVPRIAAAA SAALDRLPES
ARPVDALLRG IVARITGGVS AGAAPLRLAM DLMQEQARNN DHRVARWMVP AFPIMQETAA
LELWDETVVH HLASVVVRRA RDAGALEALP QVLAYRAGAH LLAGELSSAA TLLEEAASIT
AATNNYTPVR YQTLTLAAWR GEPADAEAVI EAAIADANAR GEGRVLGAAD YAFAVLYNGL
GRYQEACSAA RRACEYEDLG VHSWVLAELV EAADHCGDTA LAVSALERLQ ERTADTGTDW
GLGTLAGAQA LVADDDHAEA LFEESIDRLA RTRVAVHLAR AHLRYGEWLR RALRRNDARE
QLTLASGMFT RFGAAAFAER TRRELIATGE KARRQPVTSG AQLTAQESQI ARLAGDGLTN
QEIGAQLFIS THTVDWHLRK VFVKLGITSR RQLRSASWAS