Gene Mlab_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0297 
Symbol 
ID4795316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp279648 
End bp280769 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content53% 
IMG OID640098945 
Producthypothetical protein 
Protein accessionYP_001029740 
Protein GI124485124 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTAA TGGGGGGAGC TGCTGTGGCT CCGGCTTTGC CGGAAATTAG TAGTGCCTTT 
CCCGAGGCCT CCGAGGCAAC AATAGCCCTC ATTATCTCAC TTCCAGCTCT TGCGATCGCC
TGTACCGGAT TTATTATCGG CGCAGTTGTC GACAGACTCG GGAGAATCCC CGTCCTTGCA
ATATCACTTG TCATATTCAC GTTAGCCGGA GTTTCCGGAT TCTTCCTGAC GACGCTTCCG
GCGATCCTTG TCGGACGTTT CATCTTAGGT ATCGGTATCG CCGGTATAAT CAGTTCGACG
ACATCCCTGA TCACCGATTA CTACAACGGC CCCTGCCGGG TACGTGTTCT CGGCTATCAG
GCGGCTGCTA TGGGAATCGG CGTCCTGATT CTGGATACGA GCGGGGGACT GCTTGCCGGA
ATCTCGTGGA GAATGTCATT TCTTATCTAC GGACTTGGTC TTTTCATTCT GATCGGTACC
CTGATCACCA TGAAGGAACC GGTCAAACAA CAGACGGAGG TCAACCGGAA TGCGCCGAAA
ACCAAAGTGC CGGTTACCTC GATCGCCCTG ATCTATGGAA CACTGTTTAT CGGAATGATA
TTGTTCTTCC TGATGCCGAC GAAATTCCCG TATCTTGTCT CTGAGATCTC CGGCGATTCG
GCGATCCTGT CCGGCATTCT CCTTGGCGTG ATGGGGTGTT TCTCGGCTCT CATTGGGGTC
TTCTACTGGA GGATCGCCGG CAAAGTCCAC CGGGTGATGA CGCTTGCGCT CTCCTTTATC
CTGCTTGGCC TTGGGTACTG TCTGTTTGGC ATTTCCGTCT CTCTTGAGAC GCTTATCACC
GCGGTGATGA TTGTTGGAAT AGGAAACGGT CTGTTGATGC CGACGGTCCT TGGCTGGCTC
GGCCTTATCA CGCCGCCCGC TGTCATGGGA AAAGTGATGG GAGGATATGG GATGTCTCTA
AACCTTGGTC AGTTCGTCTC TTCCTTTGCC GCCGTACCGA TCCTGCTTCT CGCGGCAAGT
TACGGACACA TGTTCCTGAT ATTCGGTCTC GTTTCGCTGT GTATTGGCGT CGTGTACGTT
GTCGGATATC TGCACGTCAG ACGTAATCCC GATGCAGCCT GA
 
Protein sequence
MILMGGAAVA PALPEISSAF PEASEATIAL IISLPALAIA CTGFIIGAVV DRLGRIPVLA 
ISLVIFTLAG VSGFFLTTLP AILVGRFILG IGIAGIISST TSLITDYYNG PCRVRVLGYQ
AAAMGIGVLI LDTSGGLLAG ISWRMSFLIY GLGLFILIGT LITMKEPVKQ QTEVNRNAPK
TKVPVTSIAL IYGTLFIGMI LFFLMPTKFP YLVSEISGDS AILSGILLGV MGCFSALIGV
FYWRIAGKVH RVMTLALSFI LLGLGYCLFG ISVSLETLIT AVMIVGIGNG LLMPTVLGWL
GLITPPAVMG KVMGGYGMSL NLGQFVSSFA AVPILLLAAS YGHMFLIFGL VSLCIGVVYV
VGYLHVRRNP DAA