Gene Mbar_A2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2278 
Symbol 
ID3625130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2887882 
End bp2889279 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content41% 
IMG OID637701151 
Productnitrogenase, subunit alpha 
Protein accessionYP_305783 
Protein GI73669768 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01860] nitrogenase vanadium-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000740744 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.119504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTAA AACTATTCTG TTGCGATGAA TGCATACCTG AGCGCCAAAA CCATGTTTAC 
ATAAAAGAAG AAGGAGAAGA CACAACTCAA TATCTCCCAC TCTCAAATAT AGAAACAATT
CCAGGATCAT TATCTGAGAG AGGGTGCAGC TATTGTGGAG CAAAACTCGT TATTGGCGGA
GTCATCAAAG ACTGTATTCA GATGATACAT GGACCGGTAG GATGTGCTTA TGATACCTGG
CACACGAAAA GGTATCCCAG CGATAATGAC AATTTTCAAT TAAAATATGT TTGGTCGTCG
GACACAAAAG AAAAACATAT TGTTTTCGGA GCTGAGAAGC AGCTCAAAAA AGCGATCAAG
GAAGCTTTCA AAGAATTTCC AGAAATCAAG CGAATGTTTG TCTACACGAC CTGTACAACC
GCATTGATAG GAGACGATCC TAAAGCAGTA TGTCGTGAGG TTGAGGAAGA GCTTGGAGAT
GTAGATATAT TCGTTGTCGA ATGTCCAGGA TTCGCTGGAG TCAGTCAATC AAAAGGACAT
CATGAGCTGA ACATCGGCTG GATGAGAGAT AAGATTGGAA CGCTTGAACC TGAAATTAAA
AGCGAATACA CAATTAATGT CATTGGTGAC TACAATATTC AGGGAGATAC TTACGTATTA
CAAAAATATT TTGATAAAAT GGGCATACAG GTCATTGCAC ACTTTACAGG AAATGTAACC
TATGATCAAC TACGCTGTAT GCATAGGGCA AAGCTGAATG TGGTCAACTG CGCGCGTTCT
GCAGGATATA TAGCCAACGA ACTTAAGAGA GTATATGATA TTCCAAGAAT GGATGTTGAT
ACCTGGGGTT TTGAATATGT CAAGGTAGCA CTGAGAAAAA TTGGAGCTTT CTTTGGATTG
GAAGACAAAG CTGAAGAAGT AATTGCAGAA GAGGTTGCAA AATACGAAGG AAAACTTAAC
TGGTATAAGG AACGGCTCAA AGGAAAAAAG GTCTGTATCT GGACTGGTGG GCCAAGACTA
TGGCACTGGA CAAAGGCTCT TGAAGACGAT TTAGGTATGG AAGTTGTTGC AATGTCTTCT
AAATTTGGTC ATCAGGAAGA CTTTGAGAAG GTTATTGCCA GGGGAAGAGT CGGGACGATT
TATATTGATG ACGGAAATGA ACTGGAGTTT TTCGAAGTAC TCGATAATAT TCACGCCGAT
ATTATTTTTA CCGGGCCCAG AGTTGGAGAC TTAGTCAAAA AACTGCACAT TCCATACATT
AACGGACATG CATATCACAA CGGTCCATAC ATGGGCTTTG AAGGCGCAGT AAACATGGCG
AGAGATATGT ATAACGGAAT TTATTCTCCG ATGTGGAGTT TAGCTGGAAA AGATCCGAGA
GTGGTGCAGG AATTATGA
 
Protein sequence
MPLKLFCCDE CIPERQNHVY IKEEGEDTTQ YLPLSNIETI PGSLSERGCS YCGAKLVIGG 
VIKDCIQMIH GPVGCAYDTW HTKRYPSDND NFQLKYVWSS DTKEKHIVFG AEKQLKKAIK
EAFKEFPEIK RMFVYTTCTT ALIGDDPKAV CREVEEELGD VDIFVVECPG FAGVSQSKGH
HELNIGWMRD KIGTLEPEIK SEYTINVIGD YNIQGDTYVL QKYFDKMGIQ VIAHFTGNVT
YDQLRCMHRA KLNVVNCARS AGYIANELKR VYDIPRMDVD TWGFEYVKVA LRKIGAFFGL
EDKAEEVIAE EVAKYEGKLN WYKERLKGKK VCIWTGGPRL WHWTKALEDD LGMEVVAMSS
KFGHQEDFEK VIARGRVGTI YIDDGNELEF FEVLDNIHAD IIFTGPRVGD LVKKLHIPYI
NGHAYHNGPY MGFEGAVNMA RDMYNGIYSP MWSLAGKDPR VVQEL