Gene Mbur_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1037 
Symbol 
ID3998777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1119253 
End bp1120365 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content47% 
IMG OID637958813 
Producthypothetical protein 
Protein accessionYP_565722 
Protein GI91773030 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR03282] putative methanogenesis marker 13 metalloprotein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00782668 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA ATATTACGAT CATACACCCA CGACCAAGTT CCATCGTGGC CGCATTGTAC 
ACTTTAAGGG ACCTGAATGT CGATGTTGCA GTACTGCACG GACCACCTGG ATGTTCTTTC
AAGCATGCGA GATTGTTGGA AGAGGACGGC ATACATGTAG TTACAACTGC CCTTGATGAA
ACCGGTTTCG TTTTTGGAGG GCATGATGCA CTCGTGAATG TGCTCCATAA AGTGAACGAG
ATGTTCAAAC CGAAGCTAAT CGGTGTTGTG GGCACCTGTG CCAGCATGAT AATCGGAGAG
GAAATGCATG AACCGGTCAT GGAAGCAGAC CTTGATGTGC CGGTGATAGA AGTGGAAGTG
CACGCAGGTT ACAGGAACAA TACAAAAGGT GTGATCATTG CACTTGAATC CGCACTCGAT
GTAGGTGTTA TTGACAAGAC AGAGTTCGAA AGACAGCGTG CCCTGCTCGA AGAAGCGACC
AATGTCGAAC TAAAACATGG TGCTGCAAGC CGGGAATATC TTGCGCCTTC ACGCGGCGAT
GTGAAATATA AGGTTGCACA GAGGATAATC GAGCTGCTCA AGGAAGGTAA GCGCGGACTT
GTCATCATGA ACGCCAAAAA AGAGACAGGA TATATGTTCG CAGACATCAC AGTTGCGATC
AACGAAGTAG CTGAGCAGCT TGGCAAAGCA GACAATATTA TCAATATGGC AAATATCGAT
GAGAAGCTGG GACTTCCAAG AGTTCGCCAC CATGCAGAAT GCATCGCGAA CGACCTGAAG
GAAAGGGACG TTGTCATCCA CGAGAACATT GGCGGACTTG ACGAGTATCC TATTGCAGGG
AATGCTGTTG ACCAGCTGAT AAAGGACAAG TACATAGACT TCGATTTTGC CGTGATAAGC
GGGGTCCCGC ATGCAATACC AATGGACCAC ATCTCCAATA TGGAACTGAT ATCCGTTACC
AACGGACCAA GACAGGTATT ACCCCTTAAG GAAATGGGAC ACGAACATGT GATCGTCGAA
ATAGACCTGC ATCCAAAGAC ACTCGGTGTC AACCACATCG TAGAATCCGA GTTCGGTGCA
ACACTGAGAG AAGTCGCAAA AGAATCATTA TAA
 
Protein sequence
MDKNITIIHP RPSSIVAALY TLRDLNVDVA VLHGPPGCSF KHARLLEEDG IHVVTTALDE 
TGFVFGGHDA LVNVLHKVNE MFKPKLIGVV GTCASMIIGE EMHEPVMEAD LDVPVIEVEV
HAGYRNNTKG VIIALESALD VGVIDKTEFE RQRALLEEAT NVELKHGAAS REYLAPSRGD
VKYKVAQRII ELLKEGKRGL VIMNAKKETG YMFADITVAI NEVAEQLGKA DNIINMANID
EKLGLPRVRH HAECIANDLK ERDVVIHENI GGLDEYPIAG NAVDQLIKDK YIDFDFAVIS
GVPHAIPMDH ISNMELISVT NGPRQVLPLK EMGHEHVIVE IDLHPKTLGV NHIVESEFGA
TLREVAKESL