Gene Mlab_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1070 
Symbol 
ID4794607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1085961 
End bp1087568 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content48% 
IMG OID640099741 
Productregulatory protein, ArsR 
Protein accessionYP_001030506 
Protein GI124485890 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.802527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTAT CCTCAAAAAC ACTTACATTT CATCTCTCGG AAGATGCGGT CGGGCAGATC 
AGTTCTTTAA CAGCCGAAGC ACTCAAAGAA GGAAATATTG AAGAATACGA CATTGAGCGT
TTGCATCTTG CCGTCGAACA GGTCCTGTTG AAATGGCTCG CTGTTCTCGG AGAAGGAATC
GAAGGAACAT ACCGGAGCGG CAAACGTCTG GGCCGTCAGT ACATTAACCT CTCGGTTATA
GGACCAAGAG TAAATCCGTT CGGTGAAGAC GGAGGGGACT GTGTTTTAAA CGGGGGCAAT
TCGGTCCAGA CACTGCTTGC AAATATCGGA CTTACCCCGT CGTATCATTA CGTAAACGGC
GAAAATCAGA TATCGCTCAT GCCGAAGCGG AAAAAGATCA ATCCGCTGAT TTATCTTTTC
TGCTCGATTT TTGCAGGAGT ATTCGTTGGA CTATTGTGCC GTGAACTACC GTATGACATC
AGACACGCAG TCTCCGAGGT CGTTGTTGTC CCGCTGTTCG ACACCTTCAT TGGACTTTTA
ATCGCGGCGG CCCTTCCCAT GATGTTTTTA TCCCTTATCT GGGGTATTTA CAGTATTGGC
GATACGGCTA CACTCGGAAA TATCGGCAAA CGTGTTATAG GCAGATACCT TGGAAGAATC
TACTTCACGT TGGTCATCTG CACACTGGTC TGTATTCCGT TTTTTACGTA TGCGGCCGGC
GGGACAATAG TGAACGGCGG GGAGTTCACG GCGATTTTTT CCATGATCCT CGATATCATT
CCATCCAATC TCATCTCTCC GTTCGTCGAA GGAAACTCTC TTCAGATCAT CTTTCTGGCA
GTGACTTTTG GTCTTGCCAT GCTTATCCTG AACAAAAAGA TCCCGGTGAT CATACAGTTC
GTTGGACAGG CAAACAACAT CATCCAGCTG ATCATGGAAT GGATCACGTC ACTCCTTCCA
GTGATCATCT TCATCAGCAT TCTTCAGTTG ATGCTGACGG ATATGCTTTC AGATATGGCA
GGTCTCGTGA AACTTTTTGT AATCATCTTC CTCTGTGTAG GTGCAAACCT TGTCCTGATG
ATCATGAGTG TTTCGATTCG AAGAAAAATC TCCCCGATCA TTCTGGTAAA AAAACTGCTC
CCCTCATTTC TGGTGGCTCT TACGACCGCT TCATCAGCGG CTACCTTCTC GACGAACATG
GAGTGCTGCG AGAAAAAACT GGGCATACAG CGTAAACTCG TGAACTTCGG CGTTCCTCTT
GGAACCGCCT TCTCGAGACC GGGGCATGCC GCGGTATTTT TCTGCGTCTG TTTATTTATG
GCCGATACCT ACGGCGTTCC GATCACCTTT TCCTGGATAT TTGCCGCCAT ACTGACCTGC
GGCCTTCTTG CTTTGGCCGT CCCTCCGGTG CCCGGAGGAG GGATCGCCTG TTACTCGATT
CTATTCCTTC AGTTGGGAAT ACCTGTAGAA GCGCTTGGTA TCGCTGTGGT TTTGGAGATC
GTGCTGGACT TCCTGAGTAC ATCACTGAAC ATGGTGGCCG TGCCTGTGGA TATGATCCAT
GTGGCAGGCA AACTGGATCT GGTTGATGAA AAAGTGATGA GAGGCTGA
 
Protein sequence
MAVSSKTLTF HLSEDAVGQI SSLTAEALKE GNIEEYDIER LHLAVEQVLL KWLAVLGEGI 
EGTYRSGKRL GRQYINLSVI GPRVNPFGED GGDCVLNGGN SVQTLLANIG LTPSYHYVNG
ENQISLMPKR KKINPLIYLF CSIFAGVFVG LLCRELPYDI RHAVSEVVVV PLFDTFIGLL
IAAALPMMFL SLIWGIYSIG DTATLGNIGK RVIGRYLGRI YFTLVICTLV CIPFFTYAAG
GTIVNGGEFT AIFSMILDII PSNLISPFVE GNSLQIIFLA VTFGLAMLIL NKKIPVIIQF
VGQANNIIQL IMEWITSLLP VIIFISILQL MLTDMLSDMA GLVKLFVIIF LCVGANLVLM
IMSVSIRRKI SPIILVKKLL PSFLVALTTA SSAATFSTNM ECCEKKLGIQ RKLVNFGVPL
GTAFSRPGHA AVFFCVCLFM ADTYGVPITF SWIFAAILTC GLLALAVPPV PGGGIACYSI
LFLQLGIPVE ALGIAVVLEI VLDFLSTSLN MVAVPVDMIH VAGKLDLVDE KVMRG