Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_2377 |
Symbol | |
ID | 5410670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 2444579 |
End bp | 2446237 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640869633 |
Product | extracellular solute-binding protein |
Protein accession | YP_001405534 |
Protein GI | 154151916 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000449683 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTA CAGTACTCTC CGAAACAAAG ACATACCGTG CCGGTATTAT TTATGCCCTC TGTATGTCGG GCATCCTGGT TTTCTGTCTT GTTGCCGGAT GCACGAACAG CGTTTCAACG TCCCCAGATA CCGGAAACCT GACCGGCGTC CCTGCCAATG ATGTCATCAT CCCGGTCGAT TCCCCATCAT CGCTCATGTA CACCAGCAAC ATGCAGAAAG GCGGTGTCCC GGGAAGCTCG CTTATTTACG AAGGGCTTGT GATTAAGGAC CGGAACGGCA TCTTCGATCC CGCGCTCGCA CAGAGATGGA GTGTCTCACC GGATGCAAAG ACCTGGACAT TCGACCTTGT ACAGAACGCT ACATGGAGCG ACGGCGTTCC TTTCACCTGC AACGAGGTCA AGTTCACCAA CGATTACATG AAAGCCAACA ACCTGACCAT GGGTTACGTT CTCTCTGACG TACAGTCCGT GGAATGCCCT GACAATTACA CGGCAGTCTT CAACCTCAAG ACCCCGTACT CCGCATTCCT CGACCAGATC TCAAGAACCC CGGGGATCAC CATCTCGCCT GCGCATATCT GGCAGAACAT CTCCGATCCC CAGCATTACA AGGACAACCA GATGATCGGG ACCGGGCCGT TTGTCTTTGC CCAGGCAGCT CCCGGGTATT ACCAGTTTTC CACCAATGAA AATTACCACG GGCGGGTTCC CACTATCCCC GGTGTGGTTC TCAAGGTGAT CACAAACGCC GACAGCCAGG TTCTCGCGCT CAAAAACGGC GAGATCGATG TGGTCTCCGG CCTCACTCCC GCCGTTGCCC AGAGCCTGTC CGGCAATGCT AACATCTCCA TCTACTCGAT CAACGACACC GGAGCCTGTG AAGTTGCATT CAACATGGCC CAGTACCCGG CAAACATCTC CGCGTTCCGG CACGCGATGA GCCACGCGAT CGACCGGGAT ACCATCAGTT CCCTCTTTGG CACCGGCCGG CCCACGGAGA CAACCTTCCT GATCCCGGAT CTCGCCGGGG ATTACGTTAA CCCGGCCGAT GTCGGGATGT ACAACTATAA CCTGACCGAG GCGCAGGAAC TCCTCGCACA GGCCGGTTTT GTCAGGAACG CAAACGGGGT CCTCATTGGA CCCGATGGTA ACCCCGTCAC CATCACCATC CCCCTGGGCA CCAAAGGCGC CGATGTGAAC GATAAGATCA TTGCGGTCCT CAAGAACGAC TGGGCACAAC TCGGGATCAG CGTGAGCACC CTCAATTACC AGGACGCCAC CCAGTACCGC AACGCGGTCA ATGCCAACCC GGTCTTTATT GACTCCTTCC CGGTCCTCCT CCACGATGAC CCGGATGCAC TGGGCAATTT TGCGGTCACT CCCCTGCAGG AGACCAACTA CTACAACTAC AATGACCCTG AGTACAACCG CCTCGTTGCC CGGGTAAAGA ATACCACGGA CCCGGTTGAG GTAAAGGAGA TGACATACCA GCTTCAGGAT CTTCTGGCCC AGGATATCCC CACGGTACCC GTTGCTACCA CGGATACCCT GGTGGCATAC CGGTCGGACC GGTTTGTCGG CTGGGACATC GGGCCCGGAT ACCACAGCAC CATGGACCCA AGAGTCCTCG AAAACCTCAC ACCGGTACAG CAGACATAA
|
Protein sequence | MKSTVLSETK TYRAGIIYAL CMSGILVFCL VAGCTNSVST SPDTGNLTGV PANDVIIPVD SPSSLMYTSN MQKGGVPGSS LIYEGLVIKD RNGIFDPALA QRWSVSPDAK TWTFDLVQNA TWSDGVPFTC NEVKFTNDYM KANNLTMGYV LSDVQSVECP DNYTAVFNLK TPYSAFLDQI SRTPGITISP AHIWQNISDP QHYKDNQMIG TGPFVFAQAA PGYYQFSTNE NYHGRVPTIP GVVLKVITNA DSQVLALKNG EIDVVSGLTP AVAQSLSGNA NISIYSINDT GACEVAFNMA QYPANISAFR HAMSHAIDRD TISSLFGTGR PTETTFLIPD LAGDYVNPAD VGMYNYNLTE AQELLAQAGF VRNANGVLIG PDGNPVTITI PLGTKGADVN DKIIAVLKND WAQLGISVST LNYQDATQYR NAVNANPVFI DSFPVLLHDD PDALGNFAVT PLQETNYYNY NDPEYNRLVA RVKNTTDPVE VKEMTYQLQD LLAQDIPTVP VATTDTLVAY RSDRFVGWDI GPGYHSTMDP RVLENLTPVQ QT
|
| |