Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5002 |
Symbol | |
ID | 4645065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 5348524 |
End bp | 5351346 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639808473 |
Product | regulatory protein, LuxR |
Protein accession | YP_955780 |
Protein GI | 120405951 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACACTC GGCTGGACAC TCGGCATCCC GCGGGACACC GCGCCCGGCC GCCGCTGCGC GGACGCGGTC ACGAATGCCG GGAACTGCAG AACCTGATTG CCGCGGTGCG CACCGGCGCC AGCCGGTCCC TGGTCCTGCG TGGCGAGGCA GGCATCGGAA AGACCGCGCT GCTCGACCAC GTGGTGGCGC AGGCCGAGGG TTTCCGCACC ACCTACGTCG CAGGTGTGGA ATCGGACATG GAGCTCGCCT TCGCCGGGCT GCACCAGCTG TGCGCCCCGC TGCTCACCCA CCTCGACGAG CTGCCCGACC CGCAGCGAGA CGCCCTGACC GTCGCGTTCG GTCAGGGCGC CGGCGCCACC CCGGACCGCT TCCTGGTCGG GTTGGCGGTG CTCAGCCTGA TGGCTGCCGC GTCGACCGGT GGGCCGCTGC TGTGCGTCAT CGACGACGCC CAGTGGCTCG ACCAGGTGTC GGTGCAGACG TTGGCGTTCG TGGCGCGCCG CCTGCTGGCC GAACCCATCG CGATGGTGTT CGCGGCCCGC GACACCGGCG CCGAGGCGCT GAGCGGTCTG CCCGAGCTGG CCGTCCCGGG TCTGTCCGAC GGCGAGGCCA GGGACCTGTT GGACTCGGTG GTGGTGGGGC GGCTGGATGA CCGGGTGCGC GACCGCATCA TCGCCGAGAC GCGGGGCAAC CCGCTGGCAC TGCTGGACCT GCCGCACAAC CACGTCGGCA CCGAGCTGCC CGGCAGCGCC GCCACCCCGG GCAGCCGCCC CCTCGCCCGG CGTCTTGAGC AGAGTTACGC ACGTCGCATC AAGGCCCTGC CGCCGCAGAC CCAGCTTCTG CTGCTGGCGG CCGCGGCCGA GCCGGTCGGC GACGCCGCGG TGTTGATCCG CGCGGCCGCG CAACTCGGCA TCGCGGTGGA CATGCTGACA CCCGCCGAGG CCGCGGGGGT GATCGAACTC GGTACCCGGG TGCGGTTCCG GCACCCGCTG GTCCGGTCGG CCGCCTATCA GGCCGCCGAC CTGGCCGACC GGCGCACGGT CCACCGCGCG CTGGCCGAGG CGACCGACTC GGCCACCGAC CCCGACCGCC GCGCCTGGCA CGCCGCCAAC GCGGCGACCG GACCCGACGA CGCGGTCGCC GCCGAGCTCG AAGCGTCCGC AGCCCGCGCG CAGGCCCGCG GTGGCGTGGC GGCTGCGGCG GCCTTCCTGG AGCGGGCGGC CGCACTGACC TCCGATCCCG CGCTGCGCGG CGCCCGTGCA CTCGCCGCCG CGCAGGCCAA GCGCGACGCG GCGGCTCCTG CCGAGGCCCA GGAACTCCTC TCCACCGCCG AGCTGGGGCG GCTCTCCGAA CTTCAACAGG CGCAGGCGGC GCGACTGCGC GCACAGATGG AGTTCACCCG GCGCCGCGGC GGCGATACCG GCGCCCGCCC ACTCGACGAG ACGGCATCGC AATTGCTCAC GGCAGCAGCC AGTTTCGAAC AACTCGACGG CTACGTGTCG CGGGAGGCCT ACCTGGAGGC GCTGGCCGCC GCGATGTTCG CCGGCCGGCT CGGTGACCCC GGTGCGGTCC CGCGGATCGC CGCTGCCGCG AGTGCAGCGC TCGACCGTCT CCCAGAGTCA GCGCGCCCCG TCGACGCGCT GCTGCGCGGC ATCGTGGCGC GAATCACCGG CGGCGTCAGC GCCGGTGCCG CACCGCTACG CCTCGCGATG GACCTCATGC AGGAGCAGGC GCGCAACAAC GACCACCGGG TGGCGCGCTG GATGGTGCCC GCGTTCCCGA TCATGCAGGA AACAGCTGCG CTGGAGCTGT GGGACGAGAC CGTTGTGCAC CACCTCGCGT CCGTCGTGGT GCGCCGCGCC CGGGACGCGG GTGCCCTTGA GGCACTCCCC CAGGTGCTGG CCTACCGTGC CGGCGCACAC CTACTCGCCG GTGAGCTATC CTCGGCGGCA ACACTTCTGG AGGAAGCCGC ATCGATCACC GCGGCGACCA ACAACTACAC ACCGGTGCGC TACCAGACGC TGACCCTGGC GGCGTGGCGC GGCGAACCCG CCGACGCGGA GGCGGTGATC GAGGCGGCCA TCGCCGATGC CAATGCCAGA GGTGAGGGCC GCGTCCTCGG CGCCGCCGAT TATGCGTTCG CGGTGCTGTA CAACGGTCTC GGGCGTTACC AGGAGGCGTG CAGCGCTGCC CGGCGGGCGT GCGAGTACGA GGATCTCGGC GTCCACAGCT GGGTGTTGGC CGAACTGGTC GAGGCGGCGG ACCACTGCGG TGACACCGCG TTGGCGGTGT CGGCGCTTGA GCGCCTGCAG GAGCGCACCG CCGACACCGG GACCGACTGG GGCCTGGGAA CTCTGGCGGG TGCGCAGGCC CTGGTGGCCG ACGACGATCA CGCCGAAGCC CTGTTCGAGG AATCGATCGA CCGCCTGGCA CGCACCCGGG TGGCCGTGCA CCTGGCCCGC GCGCATCTGC GCTACGGCGA GTGGTTACGG CGGGCGCTGC GCCGCAACGA CGCCCGCGAA CAGCTCACGC TGGCCTCCGG TATGTTCACC CGATTCGGTG CCGCCGCCTT CGCCGAACGG ACCCGCCGTG AGCTGATCGC GACCGGCGAG AAGGCCCGCC GGCAACCGGT CACCTCCGGC GCTCAGCTCA CCGCCCAGGA GTCGCAGATC GCGCGGCTCG CAGGCGACGG GCTGACCAAT CAGGAGATCG GGGCCCAGCT GTTCATCAGC ACCCACACCG TCGACTGGCA CCTGCGGAAG GTGTTCGTCA AGCTCGGCAT CACCTCACGC AGGCAGTTGC GCAGCGCGTC GTGGGCGAGT TGA
|
Protein sequence | MDTRLDTRHP AGHRARPPLR GRGHECRELQ NLIAAVRTGA SRSLVLRGEA GIGKTALLDH VVAQAEGFRT TYVAGVESDM ELAFAGLHQL CAPLLTHLDE LPDPQRDALT VAFGQGAGAT PDRFLVGLAV LSLMAAASTG GPLLCVIDDA QWLDQVSVQT LAFVARRLLA EPIAMVFAAR DTGAEALSGL PELAVPGLSD GEARDLLDSV VVGRLDDRVR DRIIAETRGN PLALLDLPHN HVGTELPGSA ATPGSRPLAR RLEQSYARRI KALPPQTQLL LLAAAAEPVG DAAVLIRAAA QLGIAVDMLT PAEAAGVIEL GTRVRFRHPL VRSAAYQAAD LADRRTVHRA LAEATDSATD PDRRAWHAAN AATGPDDAVA AELEASAARA QARGGVAAAA AFLERAAALT SDPALRGARA LAAAQAKRDA AAPAEAQELL STAELGRLSE LQQAQAARLR AQMEFTRRRG GDTGARPLDE TASQLLTAAA SFEQLDGYVS REAYLEALAA AMFAGRLGDP GAVPRIAAAA SAALDRLPES ARPVDALLRG IVARITGGVS AGAAPLRLAM DLMQEQARNN DHRVARWMVP AFPIMQETAA LELWDETVVH HLASVVVRRA RDAGALEALP QVLAYRAGAH LLAGELSSAA TLLEEAASIT AATNNYTPVR YQTLTLAAWR GEPADAEAVI EAAIADANAR GEGRVLGAAD YAFAVLYNGL GRYQEACSAA RRACEYEDLG VHSWVLAELV EAADHCGDTA LAVSALERLQ ERTADTGTDW GLGTLAGAQA LVADDDHAEA LFEESIDRLA RTRVAVHLAR AHLRYGEWLR RALRRNDARE QLTLASGMFT RFGAAAFAER TRRELIATGE KARRQPVTSG AQLTAQESQI ARLAGDGLTN QEIGAQLFIS THTVDWHLRK VFVKLGITSR RQLRSASWAS
|
| |