Gene Mboo_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1145 
Symbol 
ID5411592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1146426 
End bp1148066 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content54% 
IMG OID640868371 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001404306 
Protein GI154150688 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.990957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGAA TGGATACAAC ACTGGAAGAG CTCTACACGC AGTATCCGGA TACGGTGAAA 
AAAAACCGGA AGAAACATAT TGTCATCAAA TCGGAGGCAG ATGTTGCCTG CCCCCAGATT
GAGGCAAACA CGCGTACGAT CCCGGGTATC ATATCCCAGC GGGGCTGCGC CTATGCCGGC
TGCAAAGGCG TGGTAGTCGC ACCGATCAAG GATATGATCA CGATCACCCA CGGCCCGGTG
GGCTGCGGTT ATTACAGCTG GGGAACGCGA CGGAACAAGG CACGGGCAGA CAGCACAACG
CCAAAAGACA AGATCTATTC GCAGCTCTGC TTTACCACCG ACATGCAGGA GCCGGATATT
GTCTTTGGCG GGGACAAGAA ACTCGCCAGG ATGATAGACG AGATTGTTGC TGCATTCCAT
CCCCGGGGCA TCAACATCTG TTCGACCTGC CCTATCGGGC TTATCGGCGA TGATATCGGT
GCTGTTGCAA AAGCGGCGAC AGAACGCCAC GGGATCCAGG TGCTTGCGTA CGCCTGCGAA
GGTTACAAAG GAGTCAGTCA ATCGGCAGGC CACCACATTG CCAACAACAC CATCATGCAG
AACGTGATCG GGAAAGGAAA TGAGAGAAAA CCCGGTAAAC ACGTCATAAA CGTCCTTGGC
GAGTACAACA TCGGCGGCGA TGGCTGGGAG CAGGAGCGCA TCTTAAAAGA CTGCGGCTAC
ACGGTCAACT GCGTCATGAC CGGCGATGCA AGTTACGAGG ACATCAAAAA CCTGCACCTT
GCAGACTTAA ACCTCGTCCA GTGCCACCGC TCGATCAACT ACATCGCCGA GATGATGGAG
ACCAAGTACG GGACGCCGTG GCTCAAGGTC AACTTTGTCG GTGTCGAAGC GACAAACGAA
TCCCTGCGCC AGATGGCTAA ATGTTTCAGT GACCCCGAAC TGGAAAAACA GACCGAGGAG
GTCATTGCAC GGGAGACCGC CCGGATCGAA CCGGCGCTCG AACAGTTCCG GAAGATCTGC
CGGGGCAAAA AAGCGTTTGC CTTTGTCGGC GGGTCCCGCA GCCACCACTA CCAGCATCTG
TTAAAGGATC TCGGGATGGA GGTCGTTGTT GCCGGCTATG AATTTGCCCA CCGCGACGAT
TACGAAGGCC GGCAGGTCAT CCCGACCATC AAAAGCGATG CGGATTCCAA AAACATTCCC
GAACTTCACC TCAAAGCCGA TGAGAAGCTG TACCGGGAAG GAAACGAGTA CCTGAATCTC
TCAAAAGAGC AGTTTGAGGC CTTAAAAAAA GAGGTTCCGC TCAACTACTA CGAAGGGATG
TACCCTGATA TGAAAAACGG GGACATCATG ATCGACGACT GCAACCACTA CGAGCTTGAG
GAGTTAATTA AAAAACTCAG GCCCGATCTG ATCTTCACCG GCGTCCGCGA CAAGTACATT
GCAGAGAAAA TGGGAATTCC GGCAAAACAG ATGCACTCGT ACGATTATGC CGGGCCCTAT
GCGGGATACA ACGGCGCGAT CAACTTCGCA AACGATGTTG CCCATACGCT GACAACGCCG
GCCTGGAAGA TGATTGTGCC CCCCTGGGAA CGAATCGAAG AACCCGATAC AGGAAAACAG
GAAGGTGCGA ACGATGCTTG A
 
Protein sequence
MDGMDTTLEE LYTQYPDTVK KNRKKHIVIK SEADVACPQI EANTRTIPGI ISQRGCAYAG 
CKGVVVAPIK DMITITHGPV GCGYYSWGTR RNKARADSTT PKDKIYSQLC FTTDMQEPDI
VFGGDKKLAR MIDEIVAAFH PRGINICSTC PIGLIGDDIG AVAKAATERH GIQVLAYACE
GYKGVSQSAG HHIANNTIMQ NVIGKGNERK PGKHVINVLG EYNIGGDGWE QERILKDCGY
TVNCVMTGDA SYEDIKNLHL ADLNLVQCHR SINYIAEMME TKYGTPWLKV NFVGVEATNE
SLRQMAKCFS DPELEKQTEE VIARETARIE PALEQFRKIC RGKKAFAFVG GSRSHHYQHL
LKDLGMEVVV AGYEFAHRDD YEGRQVIPTI KSDADSKNIP ELHLKADEKL YREGNEYLNL
SKEQFEALKK EVPLNYYEGM YPDMKNGDIM IDDCNHYELE ELIKKLRPDL IFTGVRDKYI
AEKMGIPAKQ MHSYDYAGPY AGYNGAINFA NDVAHTLTTP AWKMIVPPWE RIEEPDTGKQ
EGANDA