Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1678 |
Symbol | |
ID | 4794406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 1711599 |
End bp | 1712666 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640100368 |
Product | hypothetical protein |
Protein accession | YP_001031106 |
Protein GI | 124486490 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000000385803 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATGTTT CCGCAGGCAG ACTGGCGATC GGATTTGCAA TTCTGGCGGC TCTTTTGTAT GGTATCAGTG CGCCGGTCTC AAAGATCCTG CTGGAAACGA CGTCCCCCGA ACTGATGGCG GCACTTTTGT ATCTGGGGGC GGGTATCGGG ATGTTTGCGG TGAATCTGGC GACCCAGCGG CGCAGCCGGG AGCGAAAGGA GGCTCCGCTT TCGAAAAAAG ATCTGCCGTT TGTGATCGGC ATGATCGTAC TGGATATCGC GGCGCCGATC CTTTTGATGC TGGGTCTTTC GCTCACGACT GCGGCAAACG CGTCTCTTTT GAATAATTTC GAGATCGTGA CGACGGCTCT GGTGGCTCTT CTGCTGTTTC GCGAAGCGAT CGACCGGCGT TTGTGGATCG CTATCCTTTT GATCGTGGCA GGAAGTGTTC TTCTGACCGT CGATGATCTG AGCAGTTTTT CGTTCTCGGC AGGGTCGATT TTAGTGATCC TCGCGTGTGT CTGCTGGGGT GTGGAGAACA ACTGTACGCG GATGCTTTCC CTAAAAGATC CGATGCAGAT AGTCGTTGTG AAAGGGTTCG GCGCAGGGTC GGGGGCTTTG CTGATCGCGT TTCTGGCGTC CGGGATGCAG ACGGATCTCG TAAGCGTGAT CGCGGCGCTG GTCCTGGGAT TTTTCGCGTA CGGGCTGTCG ATCTATCTGT ATGTCCGGGC ACAGCGGGAT CTTGGGGCGG CGAGGACGAG TGCGTTTTAC GCGGTGGCGC CGTTTATCGG AGCGGCGATT TCGTTCGCGG TGTTTCAGAC GCCGCTCACC CCTCTGTTCG CTTTGGCGGC CGGTCTTATG GTTCTTGGAG CGTATTTTGC GGCGAAGGGA GGGCATATGC ATAAGCATAT CCACGAGTCG GTGATGCATG ACCATAGGCA TGGGCACGAG GACGGTCATC ACACGCATGT CCATGAGCCG CCTGTATTCG AAGAACACAC GCACGAGCAT ACGCATGAAC GTCTCGAGCA TGACCATCCC CACGGGGCGG ATATGCATCA CGTTCACGTG CATGAAGAGA AGAAGTGA
|
Protein sequence | MHVSAGRLAI GFAILAALLY GISAPVSKIL LETTSPELMA ALLYLGAGIG MFAVNLATQR RSRERKEAPL SKKDLPFVIG MIVLDIAAPI LLMLGLSLTT AANASLLNNF EIVTTALVAL LLFREAIDRR LWIAILLIVA GSVLLTVDDL SSFSFSAGSI LVILACVCWG VENNCTRMLS LKDPMQIVVV KGFGAGSGAL LIAFLASGMQ TDLVSVIAAL VLGFFAYGLS IYLYVRAQRD LGAARTSAFY AVAPFIGAAI SFAVFQTPLT PLFALAAGLM VLGAYFAAKG GHMHKHIHES VMHDHRHGHE DGHHTHVHEP PVFEEHTHEH THERLEHDHP HGADMHHVHV HEEKK
|
| |