Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0724 |
Symbol | |
ID | 4794484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 698353 |
End bp | 699573 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640099384 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001030163 |
Protein GI | 124485547 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0248517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.732521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGATT ATCAATACTC ATCTTCAGGC GACTCCATGC GTCATACCTA CAATACAAAA ATAACCGGGG CGATCAGCGG TCATTTTCTG ATCGATCTGT ATACGCCGAT CCTGCCGGTG ATCCTCCCTC TTCTCATTGT CAATCTGGGG CTCTCGTATT TTCTTGCAGG GCTTGTCGTT ACCGTATTCA ATGTGGTCTC GTCCGTGACC CAGCCGTTTA TCGGGCTTTA CGGAGACCGG ACTGGTAAAT GGGTGAGCGT GCCGTTCTGT GTTCTGATCG GCAGCGTAGG AATATCACTC TCGGTTCTGG CGAACAATTA CCTGATCGTT CTGTTCCTTG TCGGAGGAGC GGCCGTCGGT CATGCCCTCT TCCATCCTTC CGCGATGGCC CTCGTCCATA AACTGAGTCC GCCTGCCAAA AAGGGATTGT ACAACTCGAT TTTTACCACA AGCGGCAGCA TCAGTTATTC GCTCGGCCCG ATGATCGCCG GTTTTTTCAT CACGTTCGGC GGTCTGCCGT CGGTCGCCTG GATGATGGTT CCGGGGATCG TCGGGGCTGC CTGGATCTAT CGAAACGACC GCCGCACCGG AACGGTTGAG CCGATCAAAA AAGAGCCGGT CGAAAAGAAA CAGGTCCACG GGAAGTACTG GTGGATCCCG GCCGGTCTGG TTGTGACGGT GTGTGCTCTG CGGGCCTGGG CGTATGTTGG GGTCATTACG TATCTGCCGA CCCTGCTCCT TTTATGGAAT CAGGGGATCG ACACGTTCAC GGTCTCGGTC ATCATCACGG CCATGCTCTT TACCGGCGTG TTCGGGCAGG TGGCCGGCGG CTATCTTTCC GACCGTTTTG GACGGAAAAA AGTCCTGGTG CTTGGATTCC TCTGTGCGAT CCCGTGTTTC TGTCTGATAT TCCTCACGAC CGGATGGACG ATGTATGCCG GGATCATGCT GTATGCATTC TTTGCGAGCT TCTGTTATGT GGCTTCGGTG ACGATGACTC AGGATCTTCT GCCGGGCAGC GTCGGGTTTG CGTCTGGCCT CACGCTCGGT CTTTCGATGG GTATCGGCGG AGTTGGAGCG GCCCTGATCG GGTGGGTAGC GGATGTCATG GGTTCCCTGC CGGACGCGAT GTTTCTGCTG ATTATTCCAA CGATCCTCTG TCCGATTCTC GCGCTGTTTA TCAAATACTC GGATAAGCCC GCTGCGGTCG CCGAAGAGTG A
|
Protein sequence | MRDYQYSSSG DSMRHTYNTK ITGAISGHFL IDLYTPILPV ILPLLIVNLG LSYFLAGLVV TVFNVVSSVT QPFIGLYGDR TGKWVSVPFC VLIGSVGISL SVLANNYLIV LFLVGGAAVG HALFHPSAMA LVHKLSPPAK KGLYNSIFTT SGSISYSLGP MIAGFFITFG GLPSVAWMMV PGIVGAAWIY RNDRRTGTVE PIKKEPVEKK QVHGKYWWIP AGLVVTVCAL RAWAYVGVIT YLPTLLLLWN QGIDTFTVSV IITAMLFTGV FGQVAGGYLS DRFGRKKVLV LGFLCAIPCF CLIFLTTGWT MYAGIMLYAF FASFCYVASV TMTQDLLPGS VGFASGLTLG LSMGIGGVGA ALIGWVADVM GSLPDAMFLL IIPTILCPIL ALFIKYSDKP AAVAEE
|
| |