Gene Mbar_A1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1553 
Symbol 
ID3625970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1913615 
End bp1915207 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content43% 
IMG OID637700434 
Productnitrogenase, subunit D 
Protein accessionYP_305080 
Protein GI73669065 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01861] nitrogenase iron-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATATC ACACGTTTAA GTGCAGCGAA TGTATTCCTG AAAGGGCTAT GCATGCTGTT 
ATAAAAGGTC CAGGTGAAGA TTTGACGTCC TGCCTTCCCC TTGGATATCT CAACACGATC
CCTGGGACGA TTTCAGAACG TGGATGCGCT TTCTGCGGTG CAAAACATGT TATAGGTGCA
CCTATGAAAG ATGTCATCCA TCTGTGCCAT GGCCCAGTTG GATGTACCTA TGATACCTGG
CATACTAAGC GTTATATTAG CGATAACGAC AACTTCCAGC TTAAATATGC CTGGACGACA
GATATGAAGG AAAAGAACGT CGTATTCGGC GCTGAAAAAC TGCTCAAACA AAATATTATT
GATTGTTTCA AGGCATTTCC GCATATCAAA AGAATGAGTA TCTACCAAAC TTGCGCTTCA
GCACTTATTG GAGACGATAT AAATGCAGTT GCGAAAAAAG TGATGGAAGA AATGCCAGAT
GTTGATATCT TTGTCTGCAA TGCTCCTGGT TTTGGGGGAC CTAGCCAGTC GGGAGGACAT
CACAAAATCA ATATTGCTTG GGTCGATCAA AAAGTAGGAA CATTTGAACC TGAAATCAAA
AGCAAATACG TCATCAATTA TGTTGGTGAT TATAATATCC AGGGAGATGC GGAAATTATA
GTGGATTATT TCCAGAGAAT GGGGATTCAG GTTCTTTCCA CCTTTACTGG GAACGGATCC
TATGACGACC TTAGGGGTAT GCATCTGGCC CATCTCAATG TACTAGAATG TGCGCGTTCT
GCAGAATACA TCTGTAACGA ACTAAGAAAA AGATACGGAA CTCCACGTCT TGATATCGAT
GGATATGGTT TTGAACCGCT CTCAGCATCA CTAATGAAAG TGGCTATGTT TTTCGGAATT
GAAAAAGAAG CCCAGGATAT TATAGACGAA GAAATTGCTA GATGGAAACC GGAACTTGAC
TGGTATGCTA AACGTCTGAA AGGAAAAAGA ATTTGTCTCT GGCCTGGCGG CTCCAAACTC
TGGCATTGGG CACATGTAAT TGAAGAAGAA ATGGGAGTTA AAGTTGTCTC AGTGTATTCA
AAATTCGGTC ATCAGGGAGA CTTCGAAAAA GGCGTTGCTC GGTGCAGTGA AGGAGCACTT
GCTATTGATG ATCCTAATGA ACTTGAAGGG ATTGAAGCTA TGGAGATATT AAAACCCGAT
TGTGTCCTTA CAGGTGTCCG TCCGGGAGAG GTTTCCAAAA AGATGAGGAT CCAATATCTC
AATATTCATG GATATCACAA CGGTCCATAT AAAGGGTTTG AAGGATGGGT CAGGCTTGCA
AGGGATCTCT ACAATGCCAT CTATTCACCG ATTCATCAGC TTTCTGGTTT GAATATCAGT
AAGGATGAGA TCCCCACTGA TAAAGGATTC GTGACTAGGA AGATGATTTC TGATGTGAAT
ATCATTGAGG ATGGGAAAAC TCCAATCGAG GAAAGGCCAT ACACCGGTGA ATGTGATATT
GTTACAAGAC TACGCGGAAA AAAATATCCC AAGCTTGAAC CACAGCAGCC GCTTGGCATG
GTAATGGAAG GAGGTGAGGC CATTAATGGA TGA
 
Protein sequence
MPYHTFKCSE CIPERAMHAV IKGPGEDLTS CLPLGYLNTI PGTISERGCA FCGAKHVIGA 
PMKDVIHLCH GPVGCTYDTW HTKRYISDND NFQLKYAWTT DMKEKNVVFG AEKLLKQNII
DCFKAFPHIK RMSIYQTCAS ALIGDDINAV AKKVMEEMPD VDIFVCNAPG FGGPSQSGGH
HKINIAWVDQ KVGTFEPEIK SKYVINYVGD YNIQGDAEII VDYFQRMGIQ VLSTFTGNGS
YDDLRGMHLA HLNVLECARS AEYICNELRK RYGTPRLDID GYGFEPLSAS LMKVAMFFGI
EKEAQDIIDE EIARWKPELD WYAKRLKGKR ICLWPGGSKL WHWAHVIEEE MGVKVVSVYS
KFGHQGDFEK GVARCSEGAL AIDDPNELEG IEAMEILKPD CVLTGVRPGE VSKKMRIQYL
NIHGYHNGPY KGFEGWVRLA RDLYNAIYSP IHQLSGLNIS KDEIPTDKGF VTRKMISDVN
IIEDGKTPIE ERPYTGECDI VTRLRGKKYP KLEPQQPLGM VMEGGEAING