Gene Mboo_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0254 
Symbol 
ID5410971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp237066 
End bp238265 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID640867470 
Productnucleotidyl transferase 
Protein accessionYP_001403419 
Protein GI154149801 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATGCG TTGTGCTGGC AGCGGGGGAG GGAAAACGCA TGCGTCCCCT TACTGCCCGG 
CGACCGAAAG TGATGCTCCC GGTGGCAAAC CGCCCGATGA TGGAGCATCT TGTACTTGCG
GCCCGGGATG CCGGCATCAC CGAATTTGTC TTTGTGGTCG GGTATGGGGA GCGTGAGGTC
CGGAACCATT TCGGAAACGG GGAACGCTTC GGGATCCAGG TGGCATATGC ACCGCAGCGG
CAACAGTCGG GAACCGCAGA TGCCCTCCGC TCGGCACAGG ACCTTGTCAC AGGCCCGTTC
CTTGCAATGA ACGGGGACAT GATCCTCTCC TCTGCCGACA TAGCCCGGAT GATCGATGCA
CCGGCGCCTG CCATGGGAAC GAGCACCACC GACCATCCCG GGGACTTCGG AGTCGTACTC
GTGGAAGACG GCCGGGTCCT CTCATTAGAG GAGAAATCGA AACACCCGAA ATCCAACATC
ATCAATGCCG GGGCGTATTC CTTTACCCCG GAGATCTTTG AGCTGCTCGC CGGGATCAGG
CTCTCGGAGC GGGGCGAACT CGAGCTCACT GATGCTCTTG GCATCCTTAT CGCACGGCAT
GATCTGGGAG CAGTCCCCCT CTCAACATGG AGGGATATAG GATATCCCTG GGACCTGCTC
GATGCAAATG CTGCCCTCCT TTTGGGGCTC AATTCTCAGA ACGAGGGCAT CGTTGAGGAG
GGTGTCCATC TCTTGGGCCC TGTGGCAGTC GGTGAAGGCA CTGTGATAAA ATCGGGCACA
TACATCGAGG GGCCCTGCAT CATAGGAAAG AACTGCCGGA TCGGGCCGCA TGCCTATATC
AGGGGAGCCA CGAGCATCGG CGACGAAAGC CACATCGGGC ACTGCACCGA GATCAAGAAC
ACGGTTGTCA TGGCAAGGAC CAAGATTCCC CACTTCAACT ATATCGGTGA TTCGGTGATC
GGCAGCGGGT GTAATTTCGG TGCAGGGACC AAGATTGCAA ATCTCAGGCA CGATCATGGC
CCGGTAAAGG CCGGTGGGAA GGATACCCGG CACACCAAAT TTGGCGCGGT TGTCGGGGAC
AACGTGCACT TTGGGATCAA CTGTTCGGTC AATGTCGGAT CGGTGATCGG CAGCAATGCA
CAGTTCGCGC CCAACTCGGT TATCGAAGGG AGTTTTGGCG AGGACGCGGC GATCCGGTAG
 
Protein sequence
MQCVVLAAGE GKRMRPLTAR RPKVMLPVAN RPMMEHLVLA ARDAGITEFV FVVGYGEREV 
RNHFGNGERF GIQVAYAPQR QQSGTADALR SAQDLVTGPF LAMNGDMILS SADIARMIDA
PAPAMGTSTT DHPGDFGVVL VEDGRVLSLE EKSKHPKSNI INAGAYSFTP EIFELLAGIR
LSERGELELT DALGILIARH DLGAVPLSTW RDIGYPWDLL DANAALLLGL NSQNEGIVEE
GVHLLGPVAV GEGTVIKSGT YIEGPCIIGK NCRIGPHAYI RGATSIGDES HIGHCTEIKN
TVVMARTKIP HFNYIGDSVI GSGCNFGAGT KIANLRHDHG PVKAGGKDTR HTKFGAVVGD
NVHFGINCSV NVGSVIGSNA QFAPNSVIEG SFGEDAAIR