Gene Mbur_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_2047 
Symbol 
ID3997429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp2155210 
End bp2156430 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content48% 
IMG OID637959785 
Producthypothetical protein 
Protein accessionYP_566673 
Protein GI91773981 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000156756 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCAAAA GTCTATGCAT CCAATGTAAA GGAAAAGGGC TTTGCGGAAG ACCCCTGTGT 
CCAATTCTTG AGAAGTTCAG GTCTGCCGAA AAGACCACAA CTTCGATATC TTCAGATGGT
TCTATTTTTG GTGCTTCCCC GCCAGCAGTC TTTGTGGGAA GATACGGTTA CCCACAGGTC
AAGGCCGGAC CAATGATCCC GCCACAGGTG GATGCTAAGG ATGCAATGGC ACTGGAAGAC
CCTAAGCATT GGCTTTCGAT GGATATCCAG GACATCATAT CTGCCAGATG CCAGCTAGTC
CGGGCAAATA CGACCATAGA TGTGAAAAAT GCGAACAGAC CGGATAAGCT TCTGGAAAAA
TCACAGGAAC TCGCACTGTC AAAATCACCC ATCGATACAG AAGCATGGTT CACCAAACCA
TTGCAACAAG ACCTGAAGTT TGACAGTGTA CTAACTCCCA TGGGCCCTTC CGGGACCATG
AAGGACTTTG ATATTGCAGA GAATCCCAAG GTCCCGAAAA AAGTAGATCA TCTTGCATAC
GACACAGATG CTCTTGCAAA GGATGCTGTG TGTGAACTTT TTAAAGGGGA TATCCCCACT
GAACATATTA CAAGATTGCT TTCCATAGGC TTGTTGGGAC AGGAACGGAA ACTTGTACCT
ACCCGCTGGG CAATTACCGC CACAGATGAC ATGGTCGGGA AGGACATCAC CGACCGTGTG
ATAGACCTAC CCCTTATCAG TGAGATATCA GTGTTCAGCG GAGGATGTTT CGGGAACTAT
TTTGAGATAC TCATGACCCC CCGCAGATAT TCTTACGAGC TGCTCGAGAT ATGGATGAAA
AGCTCGGTCT GGTCAGGAGA TTCTTCATGG ATAGGGCAGG ACATGGAGGA CATTAACGGT
AAAAAGGGAT ATTCGAACCT TGCCGGAGGA TATTATGCCG CACGTATAGC CGCTCTGGAA
CATCTTGAGA AGATACAAAG ACAGGCATCA GTATTTATGA TACGGGAAAT AACGCCTGAA
TACTGGGCAC CGCTCGGGGT ATGGGTGGTC CGCGAGGCTG CCAGAAATGC ATTGTCTTCC
ATACCACGAA CGTTCGAGAC CATTGAAGAA GCACTGGATG ACATGGCAAC GCGAGTGAGA
ACGCCTTCGA AGCAATGGAA GGCAAAGGCA AAGATGCTTT CAGACATTCG ATTCCAGAGA
ACACTGGACT CTTTTTTCTA A
 
Protein sequence
MSKSLCIQCK GKGLCGRPLC PILEKFRSAE KTTTSISSDG SIFGASPPAV FVGRYGYPQV 
KAGPMIPPQV DAKDAMALED PKHWLSMDIQ DIISARCQLV RANTTIDVKN ANRPDKLLEK
SQELALSKSP IDTEAWFTKP LQQDLKFDSV LTPMGPSGTM KDFDIAENPK VPKKVDHLAY
DTDALAKDAV CELFKGDIPT EHITRLLSIG LLGQERKLVP TRWAITATDD MVGKDITDRV
IDLPLISEIS VFSGGCFGNY FEILMTPRRY SYELLEIWMK SSVWSGDSSW IGQDMEDING
KKGYSNLAGG YYAARIAALE HLEKIQRQAS VFMIREITPE YWAPLGVWVV REAARNALSS
IPRTFETIEE ALDDMATRVR TPSKQWKAKA KMLSDIRFQR TLDSFF