Gene Mbur_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_0333 
Symbol 
ID3997622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp313510 
End bp314436 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content35% 
IMG OID637958158 
Productcell surface glycoprotein (s-layer protein) 
Protein accessionYP_565079 
Protein GI91772387 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID[TIGR03024] PEF-C-terminal archaeal protein sorting domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0000130039 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTA ATTATAAAAC CAATGGAATT ATAAAAATAA TTCTATTGCC ATTATTCATG 
GCTATTCTAA TGACACAGGC GAGTGCATCT ATCCTTGAGG TGGGGGAAGG GCAAGAGTAT
TCACATATCC AGGATGCTGT GAACAATGCA AATGAGGGTG ACAAGGTCAT TGTTCACAGT
GGTATTTATG AAGAAAATGT TATTCTGGAC AAACAAATAA TATTGCAGGG AGTGGGTAGT
CCTATCATAG ATGGAATGGG AGTGGGTAAT TCCCTTAGCC TTTATGCAGA AAGCTTAGTT
GTAGATGGTT TTGTTCTTTG TAATGGGAGA AGTGGTTCAT ATGTTGTATC AGACAACAAT
ATTCTAACGA ACAATACTTT TAAAGGCAAT CAATATGGGG TATATCTGTT TGGATCAAAA
GGAAATGTTA TTGAACAAAA CGTTATCGAG CACAACCAAA GATATGGGGT GTATTTGCTT
TTTAAGAGTG ATAATAATAT CATAACCGAT AATATGATCA ACAATAATGG TGGCGGGATA
CGAATTATTT CCTCTGATGA TAATAAGTTG TATCTGAACA GCATCATTGA GAATGTCGTG
ATCTCCAATG GAAATAATCA ATGGGATGAC GGTGTGGATA AAGGAAATCA TTATAGTTTC
TTCGATGAAG AAAGTGAAGG TTTCATTGAT AAAGATCATG ATGATGTATC AGATGTTCCT
TACAAGATAC CTGTAAAAAA TGAAGTTGAC AACTATCCTC TTGCAAGCAT AGGATCAACA
CGACCAACAA TAGTTTTAGT AAAAGAGAAC CCTATAGAGC CTTCAAAGGA AACTTCTGAA
GAGATCCCTG AATTTCCAAC AGTAGCATTT CCCATATTGC TTTTAATGGG AATATTTGTT
GTGTTCAATA AGAAAACGAA TTCATGA
 
Protein sequence
MSINYKTNGI IKIILLPLFM AILMTQASAS ILEVGEGQEY SHIQDAVNNA NEGDKVIVHS 
GIYEENVILD KQIILQGVGS PIIDGMGVGN SLSLYAESLV VDGFVLCNGR SGSYVVSDNN
ILTNNTFKGN QYGVYLFGSK GNVIEQNVIE HNQRYGVYLL FKSDNNIITD NMINNNGGGI
RIISSDDNKL YLNSIIENVV ISNGNNQWDD GVDKGNHYSF FDEESEGFID KDHDDVSDVP
YKIPVKNEVD NYPLASIGST RPTIVLVKEN PIEPSKETSE EIPEFPTVAF PILLLMGIFV
VFNKKTNS