Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0284 |
Symbol | |
ID | 4795366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 266815 |
End bp | 267990 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640098931 |
Product | hypothetical protein |
Protein accession | YP_001029727 |
Protein GI | 124485111 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAAG ATACTATACA AAAAAAGTTC ATCATCTACT TCTGCATAAT CGGCTGCTTC GCGATATTCT CGACAACGAT TTCAAAAAAT CCCGTTCTCC CCTTATATGC AGCGTCTCTT GGAGCAAATG ATGCCTACCT CGGCCTTATC GCCGCCGTAT CCCCTCTCGC CGGCATTCTC TTCAGTTTTC CCGTAGGCCT GATCTCCGAC AGGCTTGGCC GAAGAAAACT CCTTATCGTC TCAGGATGCA TATTTCTTAT AGCTCCCCTG CTTTATCTTC TGATAACGGA CCCGGTATTT CTTATCCCCA TCAGATTCTT TCACGGCTTG GCGACGGCGA TCCTTGGACC GGTGGTAGGA GCGGCGATCG CAGAAAAATT CGGCAACAGA AAAGGCGTAA TGATGGGGAC ATACAGTTCT GCCACACTCG TCGGCAGGAC GGCAGCTCCG CTGATTGGCG GAACAATTAT CACATTATTT GCCTTGGCAC CGGGATTCAC CGCCTATCAT ATGGTGTATC TCGCAGCGTT TTGTGCTGCA GTACCGGTGT TTATTCTGAT CCTTTTCTTC CGTGATACGC ATGGGAGCGT GCAAAAAGTA ACCGTTGCCG ATTTTACAAA CAGTCTGAAG ACATTTCTCT CGAACAAAGG GCTTCGTGCT GCATCCACAG CAGAGATGAT AACCTACTTC TGTTTTGGCA CGTTTGAAAC ATTCTTGCCT GTCTATCTGC TCCTGATCGG CGTTCCTGCC TGGCAGACCG GCGTCATTTT TGCCGTGCAG GTGGTGGTCA TCGCACTCAC CAAGCCGTTC TTCGGCAGAC GTGCAGACAC GGGAAATCCA CAAAAACAGA TCGCAGCGGG TATGCTCATA ACAGGTATCT CGCTTGGAAT CATGGGTTTA ACAATAAACT TCTGGATCCT CCTCGTTCTA AGCAGTATTT TTGGGATCGG GATGTCGCTC TCTACAGTAG CGACCAACGT GTATGCTGCT AACACTGCAG AAAAGAACGA ACTCGGCGCA TCTCTCGGGG CCCTTTCCTC GATCATGGAC ATCGGTCATA CATCGGGCCC TCTTGTCAGC GGGATCGTAA TAACGCTTGC AGGATACCAG ATTGGTTTTG GTCTTTGCCT TGCCCTGTCG GTACTGACGT CAGTATTTGT GCTGATAACC AGATAA
|
Protein sequence | MSEDTIQKKF IIYFCIIGCF AIFSTTISKN PVLPLYAASL GANDAYLGLI AAVSPLAGIL FSFPVGLISD RLGRRKLLIV SGCIFLIAPL LYLLITDPVF LIPIRFFHGL ATAILGPVVG AAIAEKFGNR KGVMMGTYSS ATLVGRTAAP LIGGTIITLF ALAPGFTAYH MVYLAAFCAA VPVFILILFF RDTHGSVQKV TVADFTNSLK TFLSNKGLRA ASTAEMITYF CFGTFETFLP VYLLLIGVPA WQTGVIFAVQ VVVIALTKPF FGRRADTGNP QKQIAAGMLI TGISLGIMGL TINFWILLVL SSIFGIGMSL STVATNVYAA NTAEKNELGA SLGALSSIMD IGHTSGPLVS GIVITLAGYQ IGFGLCLALS VLTSVFVLIT R
|
| |