Gene Mbur_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1040 
Symbol 
ID3998780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1122657 
End bp1123712 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content46% 
IMG OID637958816 
ProducttRNA splicing endonuclease 
Protein accessionYP_565725 
Protein GI91773033 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1676] tRNA splicing endonuclease 
TIGRFAM ID[TIGR00324] tRNA intron endonuclease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000280827 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCAG AGATAGTAAA GGACCGTGTA CTTGTAGAAA AAAAAGCTAT TAATGAGTTT 
TACAATAATG GGTACTACGG CAGGCCTAAA TCGAGCGGGC TTGAGCTCAC ACTTATCGAA
GCTGTATATC TGGCATTCCG CGGGAAGATA GAGGTGGAAC ATGAAGGAAA GGTCCTGGAG
TTTTCCGATC TTTTCAAGGA AGCTTCCATC TTGCAGCCCT CCTTCGAGCT TAAATATATC
GTTTACAAGG ACCTGAGAGA ACGAGGGTTC TACGTACAAC CCGGTGTGAC CGATTTCCGC
GTATACCCAC GTGGCAGCCA TCCCGGAAAG GGAGCGGCAA AGCAGTTCAT CTATGTAAGG
TCAGAAAGAG CACCAATGCC ACTGAGGGAC CTCTTGCGTT CCCTTGCAGC AGCCGAAAAC
GTTAGAAAGC AGATGGTACT CGCCATTGTA GATGAAGAGA GTGACATTAC TTTCTACGAT
GTGAAAAGGC CACGCTTAAA AGGCGAGATG AAGGAACCCC TTTACCCAGA CATCAATGCA
GATGCCACTT TCCTTGAGGA CAGGGTTGTC GTATGGGATG AGGAAGCTTC GAAAACCCTT
TTTGAGAATG GCTTTTACGG GAAACCATTG GATAGCCAGA GATTGCAGCT TTCACTTGTT
GAGTCCCGAT ATCTCCTTGA GAAGGGTGTC CTCAATATCA ACAACAGACA GGATGAATCC
ATGGATGTGG ATGCTTTTTC AAAGATGGCT TCGGAGATTG AACCCGAGTT CAATCTGAAG
AGCAGTGTTT ACACAGATCT TCGAGATAAA GGGGTCGTAC CAAAGACAGG TTTCAAGTTC
GGTAGTCATT TCCGTGTTTA TTCACAGGTG GAATCACCAA CAAAGATACC GCATTCCGAA
TATCTCATAC ATTCGATACC AATGGACCAT GAATTTACAC TCCCTGTCAT GTCAAGGGCC
ATAAGGCTTG CCAACAGTGT AAGAAAGAGG ATGCTTTATG CGATCCTCAC AGATGATGGT
GTCGATTACA TCGATATTGG CAGATTAAAG ATGTGA
 
Protein sequence
MRAEIVKDRV LVEKKAINEF YNNGYYGRPK SSGLELTLIE AVYLAFRGKI EVEHEGKVLE 
FSDLFKEASI LQPSFELKYI VYKDLRERGF YVQPGVTDFR VYPRGSHPGK GAAKQFIYVR
SERAPMPLRD LLRSLAAAEN VRKQMVLAIV DEESDITFYD VKRPRLKGEM KEPLYPDINA
DATFLEDRVV VWDEEASKTL FENGFYGKPL DSQRLQLSLV ESRYLLEKGV LNINNRQDES
MDVDAFSKMA SEIEPEFNLK SSVYTDLRDK GVVPKTGFKF GSHFRVYSQV ESPTKIPHSE
YLIHSIPMDH EFTLPVMSRA IRLANSVRKR MLYAILTDDG VDYIDIGRLK M