Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1070 |
Symbol | |
ID | 4794607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 1085961 |
End bp | 1087568 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640099741 |
Product | regulatory protein, ArsR |
Protein accession | YP_001030506 |
Protein GI | 124485890 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.802527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAT CCTCAAAAAC ACTTACATTT CATCTCTCGG AAGATGCGGT CGGGCAGATC AGTTCTTTAA CAGCCGAAGC ACTCAAAGAA GGAAATATTG AAGAATACGA CATTGAGCGT TTGCATCTTG CCGTCGAACA GGTCCTGTTG AAATGGCTCG CTGTTCTCGG AGAAGGAATC GAAGGAACAT ACCGGAGCGG CAAACGTCTG GGCCGTCAGT ACATTAACCT CTCGGTTATA GGACCAAGAG TAAATCCGTT CGGTGAAGAC GGAGGGGACT GTGTTTTAAA CGGGGGCAAT TCGGTCCAGA CACTGCTTGC AAATATCGGA CTTACCCCGT CGTATCATTA CGTAAACGGC GAAAATCAGA TATCGCTCAT GCCGAAGCGG AAAAAGATCA ATCCGCTGAT TTATCTTTTC TGCTCGATTT TTGCAGGAGT ATTCGTTGGA CTATTGTGCC GTGAACTACC GTATGACATC AGACACGCAG TCTCCGAGGT CGTTGTTGTC CCGCTGTTCG ACACCTTCAT TGGACTTTTA ATCGCGGCGG CCCTTCCCAT GATGTTTTTA TCCCTTATCT GGGGTATTTA CAGTATTGGC GATACGGCTA CACTCGGAAA TATCGGCAAA CGTGTTATAG GCAGATACCT TGGAAGAATC TACTTCACGT TGGTCATCTG CACACTGGTC TGTATTCCGT TTTTTACGTA TGCGGCCGGC GGGACAATAG TGAACGGCGG GGAGTTCACG GCGATTTTTT CCATGATCCT CGATATCATT CCATCCAATC TCATCTCTCC GTTCGTCGAA GGAAACTCTC TTCAGATCAT CTTTCTGGCA GTGACTTTTG GTCTTGCCAT GCTTATCCTG AACAAAAAGA TCCCGGTGAT CATACAGTTC GTTGGACAGG CAAACAACAT CATCCAGCTG ATCATGGAAT GGATCACGTC ACTCCTTCCA GTGATCATCT TCATCAGCAT TCTTCAGTTG ATGCTGACGG ATATGCTTTC AGATATGGCA GGTCTCGTGA AACTTTTTGT AATCATCTTC CTCTGTGTAG GTGCAAACCT TGTCCTGATG ATCATGAGTG TTTCGATTCG AAGAAAAATC TCCCCGATCA TTCTGGTAAA AAAACTGCTC CCCTCATTTC TGGTGGCTCT TACGACCGCT TCATCAGCGG CTACCTTCTC GACGAACATG GAGTGCTGCG AGAAAAAACT GGGCATACAG CGTAAACTCG TGAACTTCGG CGTTCCTCTT GGAACCGCCT TCTCGAGACC GGGGCATGCC GCGGTATTTT TCTGCGTCTG TTTATTTATG GCCGATACCT ACGGCGTTCC GATCACCTTT TCCTGGATAT TTGCCGCCAT ACTGACCTGC GGCCTTCTTG CTTTGGCCGT CCCTCCGGTG CCCGGAGGAG GGATCGCCTG TTACTCGATT CTATTCCTTC AGTTGGGAAT ACCTGTAGAA GCGCTTGGTA TCGCTGTGGT TTTGGAGATC GTGCTGGACT TCCTGAGTAC ATCACTGAAC ATGGTGGCCG TGCCTGTGGA TATGATCCAT GTGGCAGGCA AACTGGATCT GGTTGATGAA AAAGTGATGA GAGGCTGA
|
Protein sequence | MAVSSKTLTF HLSEDAVGQI SSLTAEALKE GNIEEYDIER LHLAVEQVLL KWLAVLGEGI EGTYRSGKRL GRQYINLSVI GPRVNPFGED GGDCVLNGGN SVQTLLANIG LTPSYHYVNG ENQISLMPKR KKINPLIYLF CSIFAGVFVG LLCRELPYDI RHAVSEVVVV PLFDTFIGLL IAAALPMMFL SLIWGIYSIG DTATLGNIGK RVIGRYLGRI YFTLVICTLV CIPFFTYAAG GTIVNGGEFT AIFSMILDII PSNLISPFVE GNSLQIIFLA VTFGLAMLIL NKKIPVIIQF VGQANNIIQL IMEWITSLLP VIIFISILQL MLTDMLSDMA GLVKLFVIIF LCVGANLVLM IMSVSIRRKI SPIILVKKLL PSFLVALTTA SSAATFSTNM ECCEKKLGIQ RKLVNFGVPL GTAFSRPGHA AVFFCVCLFM ADTYGVPITF SWIFAAILTC GLLALAVPPV PGGGIACYSI LFLQLGIPVE ALGIAVVLEI VLDFLSTSLN MVAVPVDMIH VAGKLDLVDE KVMRG
|
| |