Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_2129 |
Symbol | |
ID | 3998212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 2232800 |
End bp | 2236714 |
Gene Length | 3915 bp |
Protein Length | 1304 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637959865 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_566752 |
Protein GI | 91774060 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01567] S-layer-related duplication domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00431402 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC ATTTTAAATT TATACTAATA AGCTTGCTCA TTCTTGGAAT GAGTTGTGGG GTGGCGATGG CTGCTGCTCC TGTATTATCC GCCGGGATTG TAGCTCCGGG AAGTGGTGAT GTCAGCACGC CTTTCGTTTT TACTGTAACT TTTACAGATG CGGATAACGA TACAGCAACT GCTGTTGATG TAAACATCGA TGGTGTGCTC TACAATATGA CTGAAACAAA CCTTGCAGAT ATAAATTCCA CTGATGGGAA AGGTTATACT CATACTGCAT CTGGTTTCAG TGTTGCAACA CACAATTTTA ATTTTTCAGC TTCTGATGGA ACAACTCCTG TAGGTCCATC AGGTGCAGGA ACTTTCTCGG TTTTAAATTC AGCTCCTACT ATAGCAGGTT CTGTAAATCC TAGTAGTGGA AGTGGTACTG ACACTTTCGT TTTCACTGCA ACCTTTACAG ATGCTGATAA TGATCCTGCA AGTTTTGCCA ATGTCTCTAT TGATGGCGCG AACTATACTA TGACAGCTAC AGATGCTGGT GATATAACAA CAACCGATGG GAAAACTTAC TCTTACAGTA ATTCAGGCTT CAGCACGGCT ACTCATAGTT ACCAGTTCTT TGCGTCTGAT GGGACTGTTG CAGTCGGTCC TTCAAGTGCA GGAACATTTG TTGTTAACAA TAATCCCACA TTGACAGTAG ATTCAGTTTC ACCAACAAGT GGAACTGTTG CAACCATTTT TACTTTCAAT GTCACATATA CAGATGGTGA CAACGAATTT CCTTCTCCTA TTAATGTCTT TATCAATGGT ACGAGCTATG CAATGAATCA GGTTGATGCA AGTGATACTA ATGTTACAGA TGGTAAGCTT TATTCTTATA CTAAAGGAGG CTTCCCTGTT GCAACTCATA ATTACCAATT CAAAGCTTCA GATGGAAATG TTTCCGCAAG TGATACGACT CTGACTACAT TTTCAGTTGT TTCCGATATA CCAACATTAA CTTCTGGATC AGTAACCTCT AATCCTGATT CAACTGTTGG TTCTACTTTC ATGTTCAATG TGACATATAC TGACAATGTT GCAACTGATC CAAGTTACAT GCAGGTAAAT ATCACAGGAA CTAATTATGG GATGAGCAAA CTTGATCCAA GTGATGTTGA TGCTACGGAT GGTATCAAGT ACACATATAC AACAACATCA CTGGCGGTAG GGGACCACAA TTATTATTTC AAGGCATCAG ATAGTTCAAA TCCAATAAAC ACTACGACTT CCTCGGTAAC TGTCAATACT GCAACATATT TCTCAGGTGA CCGTATCTGG GATGAAAATG CTGGTCAGTC CACCAAGTAC ATTTGGGATG CATTGAGTTA TTCCGGCTTT TTCTATGACC TTGAATCAGG TCTTGGTTCC GAAAAAATGA CAATGGATGA CATTGACAGG AACATCGGCA AGGGCGACCT TGAATATGAG ACCACACCTA TCGAAACTGA CTTTGAGTAT GGAGCATGGG GATCTTACGA GGTTATTGGT TTCATGGCAG AGAAGTACTT CGCAGGATAT ACTGCCAATA CTTCTTCAGA TGTTTCAGAT AAAGTTATAA GCCTGATGTC CAAAGGTCAC CTGACAAAGG TGCTCATCGA TAGCGATGAT AAGGAGCGTG TCTATAGTGG TTCAGGTCTT GTGCTTGAAG AAGGCTATGT ACTTAATGTT GTGGAAGTCG ATAAGAACGG TGACAAGATA TTTGTTTCAC TCTCCAAGGA CGGGGATGAA GTGGATGAAA TCGTAGTTTC TTCCGGTGAT ACCTATGTCT ATGAAAAGGA TCTCGGTGAC GTTGATGATG TGCCTATTAT AGCAGTGAAC TTCGATGAGA TATTTAGTGG AACTGAGACA AATGCTGTTT TCATTGAAGG AATATTCCAG ATCTCTGATA AATACGAGGA CATTAACACA GGTGATGAAT ACGGGGCAAT GAAAGTCAAA TCCATAAGTT CTACTATGAT CAGGATGGAG AACGAAGACA CTATCAGTCT CGACAAGGGA GATATTGCTG ATATTATGGG CAAGCTCAAA TTCGTAGTTG CAGATCACAA TGATCTCAGA TTTGCACCGT TCGTTGACAT GTCCGAACCG GGAACATATG AACTCAGGGG TACTATTGCT GAAGACAAAG GTCTGAAATG GACTCCTCTT AACTTTGAAG GGTTCTATTA TAATATTGAC GAAGGCATTG GCACGGAATC ACTGAATCTG ACCTATACAG GCAGAACAAT TAATGATAAT GATCTTGTTT ACACTACCAG TCCTTCAAGT GTAAGCTTTG AGTATTCCGG ATGGGATAAT TATACTGTTA TAGGTTTCAT GGCAGAGAAA TACTTTGTTG GCTTCCCCAA CGATCCATTT AATGATGGTG AGGTTTCTGC TCTGAGCCTT ATGTCCCAGG GTCAGCTTTC AAAAGTGCTG ATCGATGACG ATGACAAGAA ATCAATATTT GGCGGTTCAT CACTCATTCT TGAAGAAGGA TACTCACTTG ATGTTGTTGA AGTGAACAAG GATGGTGACA AAGTATTCGT TGAGCTTAGC AAGGATGGGG ATGAAGTGGA TGAGCAGGTA CTATCCTCAG GTAAGACCTA TACCTATAAG AAAGACCTTG GGGAAGTTGA TGATGTACCT ATTATTTCAG TAAATTTCAA AGAAATATTC AGCGGAACTG AGACTAATGC TGTTTTTGTT GAAGGTATCT TCCAGATATC TGATAAATAT GAGGAACTGA ACACTGGTGA TGACTATGGT AAAATGGAAA TAAAAACCAT TAGTTCCATA AAGATCGAAA TGAAGAACAA GGATTCCATC TCACTCTCAA AAGGGGATGA GGTAGAGATC ATGGGTGATA TCAAGTTCAA GGTAGCTGAT TCCAGTTCTA ATGTGCGCTA CTATCCATTC GTTGAGATAA GCACAGCACC TGCTGATTCA CTTGATGTAG ATGTTGATCC TGAGGTTGTT AGTGAAGGCG ATAAGATAAC AGTTACTGTG ACATCACGTG GTTCTCTTAT CAATGGTGTG ACAGTGAAGG CAGGAAGTAT CGTACTAGGT ACAACTGACA ATGAAGGAGA GGTTGACTAC ACGTTCCATG CAGATGGGAC CTATACTATA ACTGCTGAAA AGGACGATTA CGTTACTGGT GAAGCATCTC TTGAAGTCAT TTCTCCGGAT GACGAGTCGA GGAAGATGAG TATTGAAATT TCACCTGAAG TGATCTATGA AGGCAATCTG GTAACTTTCA CAGTTGTCAA ATCCATTGGA GGAGATGCCC TTGAAGATGT GGATGTGACC ATTGATGGAA AATCCATTGG TGAAACTGAC AGTGACGGCG TCGTGACCTA TGTACTGAAA GATATCGGTA TGCACAAGAT AACTGCCGAG AAGGAAGGTT TCCTTGAAGC AGAAGACAAC ATTGAGGTCA AGGAGCTTGA AGCAAAGTTC GAGTTCAGTA ACCTTGTGGT CACTCCACTT GAGGTCAAGT CCGGAAAGGA CGTGAATGTC ATATTGGATG CTGTCAACAA CGGTAAAGCA GCAGGTTCTT ACACGGTAGA GCTTGTCGTT AATGACAACA CTACCGCTAC ACAGGAGATC TCGCTTGGTG TTGGTGAATC CACTCAGGTC GAATTCGAGT ACACTGCCGG TGAACCGGGT ACTTATCTTG TGAAAGTAGA TAGTATGACC GCAACAGTTG AAGTGGTCAA TGGTGCAGGC GCAGTTACCT ACTTGCTTGG TGGTGTAGCA ATTGCAGTTC TTGGTGGAGC AGTCTATCTG TTTACTGCAG GTGGCTGGAC CGTATCGACA GCTGGTGCAA AAGCAGGTGA AGCAGCAGCA ACACTTTCTG AGAAGTTATC CAGCTTGCTT TCAAGGGGCA AGTAA
|
Protein sequence | MNKHFKFILI SLLILGMSCG VAMAAAPVLS AGIVAPGSGD VSTPFVFTVT FTDADNDTAT AVDVNIDGVL YNMTETNLAD INSTDGKGYT HTASGFSVAT HNFNFSASDG TTPVGPSGAG TFSVLNSAPT IAGSVNPSSG SGTDTFVFTA TFTDADNDPA SFANVSIDGA NYTMTATDAG DITTTDGKTY SYSNSGFSTA THSYQFFASD GTVAVGPSSA GTFVVNNNPT LTVDSVSPTS GTVATIFTFN VTYTDGDNEF PSPINVFING TSYAMNQVDA SDTNVTDGKL YSYTKGGFPV ATHNYQFKAS DGNVSASDTT LTTFSVVSDI PTLTSGSVTS NPDSTVGSTF MFNVTYTDNV ATDPSYMQVN ITGTNYGMSK LDPSDVDATD GIKYTYTTTS LAVGDHNYYF KASDSSNPIN TTTSSVTVNT ATYFSGDRIW DENAGQSTKY IWDALSYSGF FYDLESGLGS EKMTMDDIDR NIGKGDLEYE TTPIETDFEY GAWGSYEVIG FMAEKYFAGY TANTSSDVSD KVISLMSKGH LTKVLIDSDD KERVYSGSGL VLEEGYVLNV VEVDKNGDKI FVSLSKDGDE VDEIVVSSGD TYVYEKDLGD VDDVPIIAVN FDEIFSGTET NAVFIEGIFQ ISDKYEDINT GDEYGAMKVK SISSTMIRME NEDTISLDKG DIADIMGKLK FVVADHNDLR FAPFVDMSEP GTYELRGTIA EDKGLKWTPL NFEGFYYNID EGIGTESLNL TYTGRTINDN DLVYTTSPSS VSFEYSGWDN YTVIGFMAEK YFVGFPNDPF NDGEVSALSL MSQGQLSKVL IDDDDKKSIF GGSSLILEEG YSLDVVEVNK DGDKVFVELS KDGDEVDEQV LSSGKTYTYK KDLGEVDDVP IISVNFKEIF SGTETNAVFV EGIFQISDKY EELNTGDDYG KMEIKTISSI KIEMKNKDSI SLSKGDEVEI MGDIKFKVAD SSSNVRYYPF VEISTAPADS LDVDVDPEVV SEGDKITVTV TSRGSLINGV TVKAGSIVLG TTDNEGEVDY TFHADGTYTI TAEKDDYVTG EASLEVISPD DESRKMSIEI SPEVIYEGNL VTFTVVKSIG GDALEDVDVT IDGKSIGETD SDGVVTYVLK DIGMHKITAE KEGFLEAEDN IEVKELEAKF EFSNLVVTPL EVKSGKDVNV ILDAVNNGKA AGSYTVELVV NDNTTATQEI SLGVGESTQV EFEYTAGEPG TYLVKVDSMT ATVEVVNGAG AVTYLLGGVA IAVLGGAVYL FTAGGWTVST AGAKAGEAAA TLSEKLSSLL SRGK
|
| |