Gene Mlab_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0284 
Symbol 
ID4795366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp266815 
End bp267990 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content49% 
IMG OID640098931 
Producthypothetical protein 
Protein accessionYP_001029727 
Protein GI124485111 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAG ATACTATACA AAAAAAGTTC ATCATCTACT TCTGCATAAT CGGCTGCTTC 
GCGATATTCT CGACAACGAT TTCAAAAAAT CCCGTTCTCC CCTTATATGC AGCGTCTCTT
GGAGCAAATG ATGCCTACCT CGGCCTTATC GCCGCCGTAT CCCCTCTCGC CGGCATTCTC
TTCAGTTTTC CCGTAGGCCT GATCTCCGAC AGGCTTGGCC GAAGAAAACT CCTTATCGTC
TCAGGATGCA TATTTCTTAT AGCTCCCCTG CTTTATCTTC TGATAACGGA CCCGGTATTT
CTTATCCCCA TCAGATTCTT TCACGGCTTG GCGACGGCGA TCCTTGGACC GGTGGTAGGA
GCGGCGATCG CAGAAAAATT CGGCAACAGA AAAGGCGTAA TGATGGGGAC ATACAGTTCT
GCCACACTCG TCGGCAGGAC GGCAGCTCCG CTGATTGGCG GAACAATTAT CACATTATTT
GCCTTGGCAC CGGGATTCAC CGCCTATCAT ATGGTGTATC TCGCAGCGTT TTGTGCTGCA
GTACCGGTGT TTATTCTGAT CCTTTTCTTC CGTGATACGC ATGGGAGCGT GCAAAAAGTA
ACCGTTGCCG ATTTTACAAA CAGTCTGAAG ACATTTCTCT CGAACAAAGG GCTTCGTGCT
GCATCCACAG CAGAGATGAT AACCTACTTC TGTTTTGGCA CGTTTGAAAC ATTCTTGCCT
GTCTATCTGC TCCTGATCGG CGTTCCTGCC TGGCAGACCG GCGTCATTTT TGCCGTGCAG
GTGGTGGTCA TCGCACTCAC CAAGCCGTTC TTCGGCAGAC GTGCAGACAC GGGAAATCCA
CAAAAACAGA TCGCAGCGGG TATGCTCATA ACAGGTATCT CGCTTGGAAT CATGGGTTTA
ACAATAAACT TCTGGATCCT CCTCGTTCTA AGCAGTATTT TTGGGATCGG GATGTCGCTC
TCTACAGTAG CGACCAACGT GTATGCTGCT AACACTGCAG AAAAGAACGA ACTCGGCGCA
TCTCTCGGGG CCCTTTCCTC GATCATGGAC ATCGGTCATA CATCGGGCCC TCTTGTCAGC
GGGATCGTAA TAACGCTTGC AGGATACCAG ATTGGTTTTG GTCTTTGCCT TGCCCTGTCG
GTACTGACGT CAGTATTTGT GCTGATAACC AGATAA
 
Protein sequence
MSEDTIQKKF IIYFCIIGCF AIFSTTISKN PVLPLYAASL GANDAYLGLI AAVSPLAGIL 
FSFPVGLISD RLGRRKLLIV SGCIFLIAPL LYLLITDPVF LIPIRFFHGL ATAILGPVVG
AAIAEKFGNR KGVMMGTYSS ATLVGRTAAP LIGGTIITLF ALAPGFTAYH MVYLAAFCAA
VPVFILILFF RDTHGSVQKV TVADFTNSLK TFLSNKGLRA ASTAEMITYF CFGTFETFLP
VYLLLIGVPA WQTGVIFAVQ VVVIALTKPF FGRRADTGNP QKQIAAGMLI TGISLGIMGL
TINFWILLVL SSIFGIGMSL STVATNVYAA NTAEKNELGA SLGALSSIMD IGHTSGPLVS
GIVITLAGYQ IGFGLCLALS VLTSVFVLIT R