Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_2047 |
Symbol | |
ID | 3997429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | - |
Start bp | 2155210 |
End bp | 2156430 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637959785 |
Product | hypothetical protein |
Protein accession | YP_566673 |
Protein GI | 91773981 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000156756 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCAAAA GTCTATGCAT CCAATGTAAA GGAAAAGGGC TTTGCGGAAG ACCCCTGTGT CCAATTCTTG AGAAGTTCAG GTCTGCCGAA AAGACCACAA CTTCGATATC TTCAGATGGT TCTATTTTTG GTGCTTCCCC GCCAGCAGTC TTTGTGGGAA GATACGGTTA CCCACAGGTC AAGGCCGGAC CAATGATCCC GCCACAGGTG GATGCTAAGG ATGCAATGGC ACTGGAAGAC CCTAAGCATT GGCTTTCGAT GGATATCCAG GACATCATAT CTGCCAGATG CCAGCTAGTC CGGGCAAATA CGACCATAGA TGTGAAAAAT GCGAACAGAC CGGATAAGCT TCTGGAAAAA TCACAGGAAC TCGCACTGTC AAAATCACCC ATCGATACAG AAGCATGGTT CACCAAACCA TTGCAACAAG ACCTGAAGTT TGACAGTGTA CTAACTCCCA TGGGCCCTTC CGGGACCATG AAGGACTTTG ATATTGCAGA GAATCCCAAG GTCCCGAAAA AAGTAGATCA TCTTGCATAC GACACAGATG CTCTTGCAAA GGATGCTGTG TGTGAACTTT TTAAAGGGGA TATCCCCACT GAACATATTA CAAGATTGCT TTCCATAGGC TTGTTGGGAC AGGAACGGAA ACTTGTACCT ACCCGCTGGG CAATTACCGC CACAGATGAC ATGGTCGGGA AGGACATCAC CGACCGTGTG ATAGACCTAC CCCTTATCAG TGAGATATCA GTGTTCAGCG GAGGATGTTT CGGGAACTAT TTTGAGATAC TCATGACCCC CCGCAGATAT TCTTACGAGC TGCTCGAGAT ATGGATGAAA AGCTCGGTCT GGTCAGGAGA TTCTTCATGG ATAGGGCAGG ACATGGAGGA CATTAACGGT AAAAAGGGAT ATTCGAACCT TGCCGGAGGA TATTATGCCG CACGTATAGC CGCTCTGGAA CATCTTGAGA AGATACAAAG ACAGGCATCA GTATTTATGA TACGGGAAAT AACGCCTGAA TACTGGGCAC CGCTCGGGGT ATGGGTGGTC CGCGAGGCTG CCAGAAATGC ATTGTCTTCC ATACCACGAA CGTTCGAGAC CATTGAAGAA GCACTGGATG ACATGGCAAC GCGAGTGAGA ACGCCTTCGA AGCAATGGAA GGCAAAGGCA AAGATGCTTT CAGACATTCG ATTCCAGAGA ACACTGGACT CTTTTTTCTA A
|
Protein sequence | MSKSLCIQCK GKGLCGRPLC PILEKFRSAE KTTTSISSDG SIFGASPPAV FVGRYGYPQV KAGPMIPPQV DAKDAMALED PKHWLSMDIQ DIISARCQLV RANTTIDVKN ANRPDKLLEK SQELALSKSP IDTEAWFTKP LQQDLKFDSV LTPMGPSGTM KDFDIAENPK VPKKVDHLAY DTDALAKDAV CELFKGDIPT EHITRLLSIG LLGQERKLVP TRWAITATDD MVGKDITDRV IDLPLISEIS VFSGGCFGNY FEILMTPRRY SYELLEIWMK SSVWSGDSSW IGQDMEDING KKGYSNLAGG YYAARIAALE HLEKIQRQAS VFMIREITPE YWAPLGVWVV REAARNALSS IPRTFETIEE ALDDMATRVR TPSKQWKAKA KMLSDIRFQR TLDSFF
|
| |