Gene Anae109_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3029 
Symbol 
ID5374497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3528191 
End bp3529642 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content66% 
IMG OID640844554 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001380210 
Protein GI153005885 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCCCC GTACGAACGT GAAGAAGGTC GACGGCATCA CGAAGGAGTC GACCCAGGCG 
ATGATCGACC GGACGCTCGA GGCGTACCCG GAGAAGGGCA AGAAGAAGCG CGCGCCGCAC
CTCGCCCCCA ACGATCAGGC GTCCGCCAGC GCCTGCGTGA AGTCGAACCG CAAGACGGTC
CCCGGGGTCA TGAGCGCCCG CGGCTGCGCC TACGCCGGCG CCAAGGGCGT GGTGTGGGGG
CCGATCCGGG ACATGGTGCA CGTCTCCCAC GGCCCGGTCG GCTGCGGCTG GTACTCGTGG
GGCACGCGCC GGAACCTCAT GTCCGGCAAG AACGGCGTCT CGAGCTTCGC GATGCAGTTC
ACCTCGGACT TCCAGGAGAA GGACATCGTC TACGGCGGCG ACAAGAAGCT CGCCGTGCTC
CTGCGGGAGG CGAAGGAGCT CTTCCCGCTC GCCAAGGGCA TCTCCGTCCT GTCGGAGTGC
CCCGTGGGCC TCATCGGCGA CGACATCAAC GCGGTGGCGA AGCAGATGTC GAAGGAGCTG
GACCTCCCGA TCATCCCCTG CAACTGCGAG GGCTTCCGCG GCGTCTCCCA GTCGCTCGGG
CACCACATCT CGAACGACAC CATCCGCGAC TACATCATCG AGACGCGCGA GTTCGCGGAG
CCCGAGACGC CGTACGACAT CGCCCTCATC GGCGAGTACA ACATCGGCGG CGACGCCTGG
TCGACGAAGC CGCTGCTGGA GGAGTGCGGC TTCAACGTGA AGGCGGTGTG GACCGGCGAC
GGCCAGATGG AGCACATCGC CGCGACGCAC GAGGTGAAGC TCAACGTCAT CCACTGCTAC
CGCTCCATGA ACTACATGTG CAAGGTCATG GAGGAGAAGT ACGGCGTGCC GTGGATCGAG
CTGAACTTCT TCGGGCCCAC GAAGATCAAG GAGAGCCTGC GCAAGCTCGC CGAGCGGTTC
GACGACCGGA TCAAGGCGAA CGTGGAGAAG GTCATCGCCA GGTACGACCC CATGATGCAG
CGGGTGATCG AGGAGGTCCG GCCGCGCCTG GAGGGTAAGA AGGTGATGCT CTACGTGGGC
GGCCTCCGCC CGCGCCACAC CGTGGGCGCG TACGAGGACC TCGGCATGAC CGTGGTCGGC
TCCGGCTACG AGTTCGCGCA CTCGGACGAC TACGACCGGA CCTCGCCCGA GATGCCCGAC
GCGACCGTCA TCTACGACGA CGCCTCGGAG TACGAGCTCG AGCGGTTCGT CCACGACCTG
AAGCCGGACC TCGTCGCCTC GGGGATCAAG GAGAAGTACC TCTTCCAGAA GATGGGGCTG
CCGTTCCGGC AGATGCACAG CTGGGACTAC TCGGGCCCGT ACCACGGGTA CAAGGGCTTC
CCCACCTTCG CCCGCGACAT CGACATGGCG ATCAACAGCC CGACGTGGGG GCTCGTGAAG
TCGCCGTTCT AG
 
Protein sequence
MSPRTNVKKV DGITKESTQA MIDRTLEAYP EKGKKKRAPH LAPNDQASAS ACVKSNRKTV 
PGVMSARGCA YAGAKGVVWG PIRDMVHVSH GPVGCGWYSW GTRRNLMSGK NGVSSFAMQF
TSDFQEKDIV YGGDKKLAVL LREAKELFPL AKGISVLSEC PVGLIGDDIN AVAKQMSKEL
DLPIIPCNCE GFRGVSQSLG HHISNDTIRD YIIETREFAE PETPYDIALI GEYNIGGDAW
STKPLLEECG FNVKAVWTGD GQMEHIAATH EVKLNVIHCY RSMNYMCKVM EEKYGVPWIE
LNFFGPTKIK ESLRKLAERF DDRIKANVEK VIARYDPMMQ RVIEEVRPRL EGKKVMLYVG
GLRPRHTVGA YEDLGMTVVG SGYEFAHSDD YDRTSPEMPD ATVIYDDASE YELERFVHDL
KPDLVASGIK EKYLFQKMGL PFRQMHSWDY SGPYHGYKGF PTFARDIDMA INSPTWGLVK
SPF