Gene Mboo_2377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2377 
Symbol 
ID5410670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2444579 
End bp2446237 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content56% 
IMG OID640869633 
Productextracellular solute-binding protein 
Protein accessionYP_001405534 
Protein GI154151916 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000449683 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTA CAGTACTCTC CGAAACAAAG ACATACCGTG CCGGTATTAT TTATGCCCTC 
TGTATGTCGG GCATCCTGGT TTTCTGTCTT GTTGCCGGAT GCACGAACAG CGTTTCAACG
TCCCCAGATA CCGGAAACCT GACCGGCGTC CCTGCCAATG ATGTCATCAT CCCGGTCGAT
TCCCCATCAT CGCTCATGTA CACCAGCAAC ATGCAGAAAG GCGGTGTCCC GGGAAGCTCG
CTTATTTACG AAGGGCTTGT GATTAAGGAC CGGAACGGCA TCTTCGATCC CGCGCTCGCA
CAGAGATGGA GTGTCTCACC GGATGCAAAG ACCTGGACAT TCGACCTTGT ACAGAACGCT
ACATGGAGCG ACGGCGTTCC TTTCACCTGC AACGAGGTCA AGTTCACCAA CGATTACATG
AAAGCCAACA ACCTGACCAT GGGTTACGTT CTCTCTGACG TACAGTCCGT GGAATGCCCT
GACAATTACA CGGCAGTCTT CAACCTCAAG ACCCCGTACT CCGCATTCCT CGACCAGATC
TCAAGAACCC CGGGGATCAC CATCTCGCCT GCGCATATCT GGCAGAACAT CTCCGATCCC
CAGCATTACA AGGACAACCA GATGATCGGG ACCGGGCCGT TTGTCTTTGC CCAGGCAGCT
CCCGGGTATT ACCAGTTTTC CACCAATGAA AATTACCACG GGCGGGTTCC CACTATCCCC
GGTGTGGTTC TCAAGGTGAT CACAAACGCC GACAGCCAGG TTCTCGCGCT CAAAAACGGC
GAGATCGATG TGGTCTCCGG CCTCACTCCC GCCGTTGCCC AGAGCCTGTC CGGCAATGCT
AACATCTCCA TCTACTCGAT CAACGACACC GGAGCCTGTG AAGTTGCATT CAACATGGCC
CAGTACCCGG CAAACATCTC CGCGTTCCGG CACGCGATGA GCCACGCGAT CGACCGGGAT
ACCATCAGTT CCCTCTTTGG CACCGGCCGG CCCACGGAGA CAACCTTCCT GATCCCGGAT
CTCGCCGGGG ATTACGTTAA CCCGGCCGAT GTCGGGATGT ACAACTATAA CCTGACCGAG
GCGCAGGAAC TCCTCGCACA GGCCGGTTTT GTCAGGAACG CAAACGGGGT CCTCATTGGA
CCCGATGGTA ACCCCGTCAC CATCACCATC CCCCTGGGCA CCAAAGGCGC CGATGTGAAC
GATAAGATCA TTGCGGTCCT CAAGAACGAC TGGGCACAAC TCGGGATCAG CGTGAGCACC
CTCAATTACC AGGACGCCAC CCAGTACCGC AACGCGGTCA ATGCCAACCC GGTCTTTATT
GACTCCTTCC CGGTCCTCCT CCACGATGAC CCGGATGCAC TGGGCAATTT TGCGGTCACT
CCCCTGCAGG AGACCAACTA CTACAACTAC AATGACCCTG AGTACAACCG CCTCGTTGCC
CGGGTAAAGA ATACCACGGA CCCGGTTGAG GTAAAGGAGA TGACATACCA GCTTCAGGAT
CTTCTGGCCC AGGATATCCC CACGGTACCC GTTGCTACCA CGGATACCCT GGTGGCATAC
CGGTCGGACC GGTTTGTCGG CTGGGACATC GGGCCCGGAT ACCACAGCAC CATGGACCCA
AGAGTCCTCG AAAACCTCAC ACCGGTACAG CAGACATAA
 
Protein sequence
MKSTVLSETK TYRAGIIYAL CMSGILVFCL VAGCTNSVST SPDTGNLTGV PANDVIIPVD 
SPSSLMYTSN MQKGGVPGSS LIYEGLVIKD RNGIFDPALA QRWSVSPDAK TWTFDLVQNA
TWSDGVPFTC NEVKFTNDYM KANNLTMGYV LSDVQSVECP DNYTAVFNLK TPYSAFLDQI
SRTPGITISP AHIWQNISDP QHYKDNQMIG TGPFVFAQAA PGYYQFSTNE NYHGRVPTIP
GVVLKVITNA DSQVLALKNG EIDVVSGLTP AVAQSLSGNA NISIYSINDT GACEVAFNMA
QYPANISAFR HAMSHAIDRD TISSLFGTGR PTETTFLIPD LAGDYVNPAD VGMYNYNLTE
AQELLAQAGF VRNANGVLIG PDGNPVTITI PLGTKGADVN DKIIAVLKND WAQLGISVST
LNYQDATQYR NAVNANPVFI DSFPVLLHDD PDALGNFAVT PLQETNYYNY NDPEYNRLVA
RVKNTTDPVE VKEMTYQLQD LLAQDIPTVP VATTDTLVAY RSDRFVGWDI GPGYHSTMDP
RVLENLTPVQ QT