Gene Mboo_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1597 
Symbol 
ID5410911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1667809 
End bp1669965 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content61% 
IMG OID640868831 
Productglycosyl transferase family protein 
Protein accessionYP_001404757 
Protein GI154151139 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0514449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.160515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGGCA GCGACTATCT CCGGAAGTAC AAAACAGAAT TCCTGCTTGC CGGGATCCTC 
TCCCTTGCCC TGTTCCTGAA CCTCTGGGGC ATCTGGAACC AGGGGATTTC GAATTCGTTT
TATGCTGCTG CGGTAAAAAG TGCGCTCGTC AATCCCCCGG TTGCCCTTTT TAACTCCTTT
GATCCGGCGG GATTTGTCAC GATTGACAAG CCCCCGGTGG GCATCTGGGT CCAGGCGGCA
TTTGCTTTTG TCCTTGGTTT CTCGGGCTGG GTCCTTGTCC TCCCGCAGGC CCTTGCCGGC
GTGGGATCCG TTCTCCTCGT TTATCTTATC GTCTCCCGCC CGTTTGGGAA ACCGGCCGGT
CTTGTTGCTG CCCTTGCCTT AGCGGTAACA CCCATCCTTG TTGCGGTCTC ACGGAACGGC
ACCATGGACT CCCAGCTCAT CTTTGTCCTT CTTCTCGCGC TCTGGGCGGT CTTAAAAGCT
GCACGGGAGC GGTCCCTGCC CTGGCTCCTT GTTTCGGTTG TCCTTATCGG GATCGGTTTT
AACATCAAGA TGATTCAGGC ATTCGTGGTT GTGCCTGCGA TCCTTGCGGT CTATTTCCTT
GGCGCTACGG ATCTTTCGTT AAAAAAGAAG GGACTCCATC TTGGGCTTGC CGTCCTTGTC
CTTCTTGCCG TCTCACTTTC ATGGGCGGTC GCGGTTGATG CGATCCCCGC AGACCAGCGG
CCCTACATCG GCGGCAGCGG CGACAACACG GTCCTTGGCC TTATTGTCAA TTACAATGGC
CTCGAACGCC TCGGACTCGA AGGCCGCGGC ATGGGCGGAG CATTCAATGG TACCCGGGAC
CCGGGGGCGG GAGCCGCGGG CGAATACCGT TCGGGCAGTA CACGGCTGAC CGGTGCGGGG
GAACGGACGG GTGCAGGAGC GGCGTCCGCG GGGACAACAG CCCAGGCTGC ATTCACCCTG
GACCCGGCAC GCGGAGGCAC GGCTGCCGGC GGCATGGCAA ACGGTGATGG TACGCCGGGC
ATCACAAGGT TCTTTGGCGA AGGGCTTGCC GGCCAGGTCT CCTGGCTTAT TCCCCTTGCA
CTGATTGGGC TCCTTGCATG GATACGAAAA CCGGCGGCCC TCTCGATAAG GTCGCTTGAA
GATGCCGGCC TGACCAGCGA GCGGGGAGTT CTCCTTGCCG CACTGCTCCT CTGGTTTGTG
CCCGGCCTGT TCTTCTTCAG CTTCTCGACC GCATTTGCGC ATACCTACTA TATCGCGACC
ATCACGCCGC CGCTTGCCGG CCTTGTCGGT ATCGGGGCTG CGGGCATGTA CCAGCAGTAC
ATGGCAGGGG GCAGGAAAGG CTGGATCCTT GTCGGCGGCA TTCTTGCCAC CGGACTCTGC
CAGGCTCTCT TTCTTTCCTA TGCCATCCAC TGGGACGGCC TGCTTATCCC GGTCATTCTT
GTTGGGACAC TTGCCTGTGC AGGTCTCCTT GCATACTTCC TTGCCCGGGA CCACCCGGTG
CCGCACAATC ACAAAAAAAT CCTTGCGGTT GTTGCACTCG GCCTTCTCTT CATTGCCCCC
CTTGCGTGGT CCTCCACGCC CATCCTGTAT GGGGACCGGG ATGCGGTGGC CGGCCCGCCC
ACGACCGTAT CCGGCGCGGG ATATGCACAG GCAGGCTTTG CCCTCGCCGG CGCGGACCGG
ATCCCTGCCA TGGACGGGAA CGCCAATCGT GGGGCGGGAG GTTTTGCGCT GACCGGTGCG
AGAGCCGCAA CGGCCCGTAA TGCCACGGCA TATACCGGAC AGGGCAGTTC AACCGACACG
GCGCTCATCA ATTACCTGCT TACCCATACC ACAAATGAGA CATGGATCCT TGCAACGCCG
AGCAGCCAGT CCGCATCCCC CATCATTGTG GCGACAGGAA AACCGGTCAT GGCAATCGGG
GGATTCTCGG GCAGCGACCG GATCCTCACA GCGCAGTCAT TTGCAGCGCT GGTCAGTGAG
GGCAAGGTCC GGTACTTCCT TGGCGGAGGT ACCGCAATGG GAGGAGCGGG TGGAGGGAAC
AGTGCGGTTG CCACATGGGT GGAGGAGCAC TGCCGGGCAA TCGTTCTCCC TGCAGGAAAC
GGGACTGCGG GAGCCGGATC CCTGTACGAT TGTGCAGGCG CCACATCCGG CTCATGA
 
Protein sequence
MIGSDYLRKY KTEFLLAGIL SLALFLNLWG IWNQGISNSF YAAAVKSALV NPPVALFNSF 
DPAGFVTIDK PPVGIWVQAA FAFVLGFSGW VLVLPQALAG VGSVLLVYLI VSRPFGKPAG
LVAALALAVT PILVAVSRNG TMDSQLIFVL LLALWAVLKA ARERSLPWLL VSVVLIGIGF
NIKMIQAFVV VPAILAVYFL GATDLSLKKK GLHLGLAVLV LLAVSLSWAV AVDAIPADQR
PYIGGSGDNT VLGLIVNYNG LERLGLEGRG MGGAFNGTRD PGAGAAGEYR SGSTRLTGAG
ERTGAGAASA GTTAQAAFTL DPARGGTAAG GMANGDGTPG ITRFFGEGLA GQVSWLIPLA
LIGLLAWIRK PAALSIRSLE DAGLTSERGV LLAALLLWFV PGLFFFSFST AFAHTYYIAT
ITPPLAGLVG IGAAGMYQQY MAGGRKGWIL VGGILATGLC QALFLSYAIH WDGLLIPVIL
VGTLACAGLL AYFLARDHPV PHNHKKILAV VALGLLFIAP LAWSSTPILY GDRDAVAGPP
TTVSGAGYAQ AGFALAGADR IPAMDGNANR GAGGFALTGA RAATARNATA YTGQGSSTDT
ALINYLLTHT TNETWILATP SSQSASPIIV ATGKPVMAIG GFSGSDRILT AQSFAALVSE
GKVRYFLGGG TAMGGAGGGN SAVATWVEEH CRAIVLPAGN GTAGAGSLYD CAGATSGS