Gene Mboo_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1566 
Symbol 
ID5410091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1634040 
End bp1635491 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content57% 
IMG OID640868800 
Productmajor facilitator transporter 
Protein accessionYP_001404726 
Protein GI154151108 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAC GGCCCGGGGC AGCCGCCCCT GAACGGTACT GGCGGCTCAA CCTCGCCCTC 
ATCTCCCTTG GCGCCCTTAT TGGCGCCTTT GCGGCGAGCT GCATCACCGT CCCGCTCCAG
CAGGTGGCAG GCGATCTCCA CACAAGCACC GGCCTTGCCT CCGCTTCTGT TATTGTATAC
CTTCTGGTCC TCTCGGGACT GTTCCTCTTT TTTGGGAAAC TTGGCGACGT ATGGGGATAT
CGCAGGGTGT TTCTTTTCGG CACTGCTGCA TTTGCCCTGG GATCGCTTCT CTGCGGTCTC
TCGGATACGG TAAACCAGCT TATCCTTTTC CGCATCATCG AGGCCGTAGG TGCAACGATG
GTAGCGGCCG TTGTCACCCC GTATGTTACC TGCCATGTTC CCGAAGCCTG GCACGGGCGG
GGATTTGCCT ATCTTTCGGG AGCGGCGGTA CTGGGGGTCA TCCTCGGGCC AACGCTCGGT
GTGGCAATCG CCGGTACGCT TTCCTGGCGC TGGATTTTTT TTGTCCTGGT GCCGGTCGGG
ATAGCGATCA TTGCAACCGG CTGGTTCACT CTCCCGGCGG GCGAAGGGAC TGCAAAGGGA
AGGAAGTTCG ATCTTCTCGG GGCCTGCCTT TTTTCCTTTG CCACCCTTGC ACTTGTCCTT
GCCATCAGTT CGGTGAATGT TTTTGGAATT GAAAGTCTCA TCACGGCGGG CCTTGTCATT
TTCACGTTTT CTTTCTGGAC GCTTGCGATC GTGTACGAAA GTACGACCGA AGACCCGGCC
TTTGAGCTCT CCCTGTTTCG GAACCGGCAT TTCACCAGTG CGAATATCTC GTTTTTCCTC
ATGAAGCTGG TCATCAACGG CCCGGTCTTC CTTTTCCCGT TCTACATGAC CCTTGTTTTG
GGCTATTCCT ATGCACTCAC CAGCCTCGTT GTCATCGTGC CCGCTGCCGT GATGCTCCTT
TCCACCCCGG TGGTGGGCAT CTTAACCGAC CGTTTCAGTG CACGGACACT CTGTCTCATC
GGTGCATCAG GATCGGCGGC GGTGTACCTG TTCTTTGCCG GGTTTACTCA GGATATCACG
CTCATTATGG CAGTCCTCGC ACTCGTCATC CTTGGACTTA CCCGGGGGGT ATTCATTGTC
CCCAATACAA AACTGATCAT GGACCACAGC CCTGTAGATA TGAAAGGCGC TGCATCCGGG
GTCATGAAGG CGCTCGGGAA TACCGGCATC ATCCTTGGGA TCGTGATCTT CCAGATCGCA
TTCTCCGAGA CGCTGATTGC CGCGGAAGCC GTAGGGCAGG TACACGATCC ATTCTCTGTC
CCGATGCCGG CGCTCGCAAG TGGGTTCCAG GCGGCGTTCT TCCTTGCCGC AGGACTCAGT
CTTGTTGCGG TGTTCTTTGC CTGGCATGCA CGGGACATTG CACCCGCGGA TACCGATCAG
GAAAGATCCT GA
 
Protein sequence
MDARPGAAAP ERYWRLNLAL ISLGALIGAF AASCITVPLQ QVAGDLHTST GLASASVIVY 
LLVLSGLFLF FGKLGDVWGY RRVFLFGTAA FALGSLLCGL SDTVNQLILF RIIEAVGATM
VAAVVTPYVT CHVPEAWHGR GFAYLSGAAV LGVILGPTLG VAIAGTLSWR WIFFVLVPVG
IAIIATGWFT LPAGEGTAKG RKFDLLGACL FSFATLALVL AISSVNVFGI ESLITAGLVI
FTFSFWTLAI VYESTTEDPA FELSLFRNRH FTSANISFFL MKLVINGPVF LFPFYMTLVL
GYSYALTSLV VIVPAAVMLL STPVVGILTD RFSARTLCLI GASGSAAVYL FFAGFTQDIT
LIMAVLALVI LGLTRGVFIV PNTKLIMDHS PVDMKGAASG VMKALGNTGI ILGIVIFQIA
FSETLIAAEA VGQVHDPFSV PMPALASGFQ AAFFLAAGLS LVAVFFAWHA RDIAPADTDQ
ERS