Gene Mlab_0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0724 
Symbol 
ID4794484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp698353 
End bp699573 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID640099384 
ProductXRE family transcriptional regulator 
Protein accessionYP_001030163 
Protein GI124485547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0248517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.732521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGATT ATCAATACTC ATCTTCAGGC GACTCCATGC GTCATACCTA CAATACAAAA 
ATAACCGGGG CGATCAGCGG TCATTTTCTG ATCGATCTGT ATACGCCGAT CCTGCCGGTG
ATCCTCCCTC TTCTCATTGT CAATCTGGGG CTCTCGTATT TTCTTGCAGG GCTTGTCGTT
ACCGTATTCA ATGTGGTCTC GTCCGTGACC CAGCCGTTTA TCGGGCTTTA CGGAGACCGG
ACTGGTAAAT GGGTGAGCGT GCCGTTCTGT GTTCTGATCG GCAGCGTAGG AATATCACTC
TCGGTTCTGG CGAACAATTA CCTGATCGTT CTGTTCCTTG TCGGAGGAGC GGCCGTCGGT
CATGCCCTCT TCCATCCTTC CGCGATGGCC CTCGTCCATA AACTGAGTCC GCCTGCCAAA
AAGGGATTGT ACAACTCGAT TTTTACCACA AGCGGCAGCA TCAGTTATTC GCTCGGCCCG
ATGATCGCCG GTTTTTTCAT CACGTTCGGC GGTCTGCCGT CGGTCGCCTG GATGATGGTT
CCGGGGATCG TCGGGGCTGC CTGGATCTAT CGAAACGACC GCCGCACCGG AACGGTTGAG
CCGATCAAAA AAGAGCCGGT CGAAAAGAAA CAGGTCCACG GGAAGTACTG GTGGATCCCG
GCCGGTCTGG TTGTGACGGT GTGTGCTCTG CGGGCCTGGG CGTATGTTGG GGTCATTACG
TATCTGCCGA CCCTGCTCCT TTTATGGAAT CAGGGGATCG ACACGTTCAC GGTCTCGGTC
ATCATCACGG CCATGCTCTT TACCGGCGTG TTCGGGCAGG TGGCCGGCGG CTATCTTTCC
GACCGTTTTG GACGGAAAAA AGTCCTGGTG CTTGGATTCC TCTGTGCGAT CCCGTGTTTC
TGTCTGATAT TCCTCACGAC CGGATGGACG ATGTATGCCG GGATCATGCT GTATGCATTC
TTTGCGAGCT TCTGTTATGT GGCTTCGGTG ACGATGACTC AGGATCTTCT GCCGGGCAGC
GTCGGGTTTG CGTCTGGCCT CACGCTCGGT CTTTCGATGG GTATCGGCGG AGTTGGAGCG
GCCCTGATCG GGTGGGTAGC GGATGTCATG GGTTCCCTGC CGGACGCGAT GTTTCTGCTG
ATTATTCCAA CGATCCTCTG TCCGATTCTC GCGCTGTTTA TCAAATACTC GGATAAGCCC
GCTGCGGTCG CCGAAGAGTG A
 
Protein sequence
MRDYQYSSSG DSMRHTYNTK ITGAISGHFL IDLYTPILPV ILPLLIVNLG LSYFLAGLVV 
TVFNVVSSVT QPFIGLYGDR TGKWVSVPFC VLIGSVGISL SVLANNYLIV LFLVGGAAVG
HALFHPSAMA LVHKLSPPAK KGLYNSIFTT SGSISYSLGP MIAGFFITFG GLPSVAWMMV
PGIVGAAWIY RNDRRTGTVE PIKKEPVEKK QVHGKYWWIP AGLVVTVCAL RAWAYVGVIT
YLPTLLLLWN QGIDTFTVSV IITAMLFTGV FGQVAGGYLS DRFGRKKVLV LGFLCAIPCF
CLIFLTTGWT MYAGIMLYAF FASFCYVASV TMTQDLLPGS VGFASGLTLG LSMGIGGVGA
ALIGWVADVM GSLPDAMFLL IIPTILCPIL ALFIKYSDKP AAVAEE