Gene Mboo_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0388 
Symbol 
ID5410619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp376396 
End bp377796 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content55% 
IMG OID640867602 
Productmajor facilitator transporter 
Protein accessionYP_001403551 
Protein GI154149933 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00816882 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAT CCAACTTGCC TCAACCCGCC GAAAGTAAAA TAGCGTTCGA CTGGCGGTTT 
GTAACACCGC TGTATATCGG CTCCGCCCTC AATCCTGTCA ACACTTCGTT CATTGCCACT
GCCCTGGTGC CAATTGCAGC GGCCATCAAC GTTCCGGTCG GACAGACTGC TGTTCTTGTC
GCGGCACTCT ATATCGCATG TATCGTTGCC CAGCCGGCAG CCGGAAAATT ATCGGAGGCA
TTTGGGCCAC GGAGGGTGTT CCTTGCAGGT ATTCTCGCAG TACTTGCCGG AGGAGTGCTG
GGTGGATTAG GCCATGACCT CGCAACGCTG ATCGTATCAC GGGTCCTGAT TGGTGTGGGC
ACCTCGACCG GATATCCTTC GGCAATGCTT TTGATCCGAC AGCGGGCCGA ATCGGCCGGG
CTGACCGGGC CCCCGGGAGG AGTGCTTGGC GGCCTTGTGA TTGCCGGAAT GGCGACTGCG
GTTATAGGTC TGCCCATTGG CGGATTCCTC GTCGCCGCCT GGGGCTGGCA GAGCGTGTTT
TTTATTAACG TCCCGCTGGC TCTCGTGGCG CTCATTATGG CTGCATCTTG GATCCCCCGG
GATCCGCCAT GCAGGAGCAT AAAGACGCTC CGTGACCTGG CAACCCGCAT TGATCTGGCC
GGCATCACGG TCTTTAGTGG CGCGATGATT GCCCTTCTGG TCTTTCTCAT GTCACTGCCG
GATCCGGATT GGGTTGTTTT AGGCGTAGTT ATTCTGCTCG GTCTGGCCTT TGTCTGGTGG
GAAGGACAGG TGAGCCAGCC TTTTATTGAC CTCCGTCTGT TAGGAACGAA CCGGCCATTG
ATACTCACCT ATGTGCGCTT TGCCCTTGCG ATGCTGTGCG TCTACACCGT AATGTATGGT
GTCACGCAAT GGCTTGAGAT CGACAAAAAT ATTTCGTCCG CTGATGCAGG ATTCATCATT
TTGCCCATGA GTCTCATATC CATTGTGCTA GCGTGGCTGG TCTCGCGGCT GAACCTCGTG
CGCACTCCCC TTATTGTGTC TGCCGTTGCC TGCCTGGCAG GTTCTGCGGG CGTATTTTTA
TTCACCACGG CGACGCCGAT ACTCTGGATC GTTATAATCA CGGCGATCTT CGGGATTACC
ATGGGGATGT GTGCCAGTGC GAACCAGACA ACATTTTACA CCCAGGTAAC CGCAGATCAG
ATCGGTACCG CTTCAGGCCT GTTCCGTACT TTTGGGTATT TTGGCTCGGT TGCATCGTCG
GCCCTTATCG CGATATTCTT TAATCCCGAT GTCAGCGATC AGAGCCTGCA TTCAATTGCT
GCAGTACTGG TGATCCTCAG CGTTGTGGGA CTGCTTATTG TCATTGCCGA CAGGAAAATC
ATGGCTCTGG CAAAAGTATA G
 
Protein sequence
MNTSNLPQPA ESKIAFDWRF VTPLYIGSAL NPVNTSFIAT ALVPIAAAIN VPVGQTAVLV 
AALYIACIVA QPAAGKLSEA FGPRRVFLAG ILAVLAGGVL GGLGHDLATL IVSRVLIGVG
TSTGYPSAML LIRQRAESAG LTGPPGGVLG GLVIAGMATA VIGLPIGGFL VAAWGWQSVF
FINVPLALVA LIMAASWIPR DPPCRSIKTL RDLATRIDLA GITVFSGAMI ALLVFLMSLP
DPDWVVLGVV ILLGLAFVWW EGQVSQPFID LRLLGTNRPL ILTYVRFALA MLCVYTVMYG
VTQWLEIDKN ISSADAGFII LPMSLISIVL AWLVSRLNLV RTPLIVSAVA CLAGSAGVFL
FTTATPILWI VIITAIFGIT MGMCASANQT TFYTQVTADQ IGTASGLFRT FGYFGSVASS
ALIAIFFNPD VSDQSLHSIA AVLVILSVVG LLIVIADRKI MALAKV