Gene Mboo_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2003 
Symbol 
ID5410427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2071432 
End bp2072784 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content53% 
IMG OID640869245 
Productnitrogenase 
Protein accessionYP_001405160 
Protein GI154151542 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.577427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.745673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAG CAGTACAAGC AAAAGTATCC TCCGCTGTGG ATCTCCCGGC ATCTGATGCC 
CTGGAAGCTC CGCGGTTCTC CTGCGCCCTT GCGGGAGCGT ACTCAACCGC GGTTGGTCTG
TATGGGGCGG TTCCGATCCT TCATTCAGGA GCGGGTTGCG GAATTGGCCA GCTTTTCGGT
CAGTTCTATG GCGGGGGGCA GAATGCCGGA GGCCCGGTAG GAGGGACTTC CACACCCTGC
TCCTCCCTTA TTGAGGAGCA TGTCATCTTC GGTGGGGAGA AGAAACTCCG GAACCTGATA
GAATCCAGTG TTGAACTCGT AAACGGGCAG TTTTACGCGG TAATCTCGGG ATGTATCCCT
TCACTCATTG GAGATGACAC TGAATCCGTG GTAAAGGAAT TCCGTGATAA GGTGCCAATC
ATCTCTATAA AGGCTGCCGG ATTCTCAGGA AATTCCTACC AGGGATACGA GCTGTTTTTT
GAATCGCTCA TTGACCAGTT CCTCACACCC CAGACAAAAG TGAAAGGGCA GGTCAATATC
TTTGGCATAG TGCCGTACCA GCACCTGTTC TGGAAAGGAG ATCTGGACGT TATCAGGGAA
CTCCTTGAAT CAATCGGCCT GAAACCCAAT ATCATCTTTA CCCGGTTCGA TTCCGTTGAA
GCCCTCAAAA GCATTCCCGC GGCGGAATAC AATATTGTGC TCTCGCCCTG GGTAGGGCAC
GAAACCGCAG AGAAACTCAA GGAGAAGTTC AAAACCCCGT TCATTGCATT CCCCGGGGTC
CCCATCGGTC CCAAGCAGAG CTCGGAATTC CTCCGTGTTG TTGGTAAGAA ACTAAGGGTA
CCGGCAAAGA CCGTCGAGAA GGTTATTGAA GAGCGGGAGA AGCGGGCATA CCGGTTCACC
GAGTACCTGG GGGATATCCT CACTCTTGTC CGGCCCCATG CGTATTTCTC GGTCGTTGCA
GACAGCAGTA CCGCAATCAG TATCACGAAA TTCCTGGCAA ACGAACTCGG GTATCTCCCG
GAGATCGTGC AGATCGCAGA CGATCCCCAT GAGGATAAGC GCGATCTCAT CCGCAGGGAA
CTGCTCGAGA CCCTCGAAAC ACCGGTGAAA CCGGAGATCG TCTTTGAGGT GGATTCCTAC
CGTATCAGGC AGAATCTGAA AAACCGGAAT TCACTCTTCC TGTTTGCGAG TTCCCTGGAA
TCTCCGATCT CCCTTGCCGA ATTTGGGGCA ATCCCCCTGA CCGTGGCATT CCCCTCATTT
GACCGGCTCA TCCTGGAGCA CTCCTATGCC GGGTATAAAG GAGGGCTCTT CCTTGCAGAG
GACTTCATGA GCAAATTTGC CGGGCCCCTG TGA
 
Protein sequence
MTKAVQAKVS SAVDLPASDA LEAPRFSCAL AGAYSTAVGL YGAVPILHSG AGCGIGQLFG 
QFYGGGQNAG GPVGGTSTPC SSLIEEHVIF GGEKKLRNLI ESSVELVNGQ FYAVISGCIP
SLIGDDTESV VKEFRDKVPI ISIKAAGFSG NSYQGYELFF ESLIDQFLTP QTKVKGQVNI
FGIVPYQHLF WKGDLDVIRE LLESIGLKPN IIFTRFDSVE ALKSIPAAEY NIVLSPWVGH
ETAEKLKEKF KTPFIAFPGV PIGPKQSSEF LRVVGKKLRV PAKTVEKVIE EREKRAYRFT
EYLGDILTLV RPHAYFSVVA DSSTAISITK FLANELGYLP EIVQIADDPH EDKRDLIRRE
LLETLETPVK PEIVFEVDSY RIRQNLKNRN SLFLFASSLE SPISLAEFGA IPLTVAFPSF
DRLILEHSYA GYKGGLFLAE DFMSKFAGPL