Gene Gdia_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1569 
Symbol 
ID6974979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1746237 
End bp1747736 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content59% 
IMG OID643391100 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002275963 
Protein GI209543734 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.825856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGG ATGAAGACAA GACCAACGAT AGCGCGTTTC ACGCTCGCCT GATCGCGGAA 
GTGCTGGAAG CATACCCGGA CAAGGCCCGC AAGCGCCGGC AGAAGCATCT GAACGTCGCC
GGCCAAGCCG AAGCCGAGGC GCAGGACGCC GGCGAAGAAG GCGTGATGCT GAGCGAATGC
GACGTCAAGT CGAACGTGAA GTCCGTGCCG GGCGTCATGA CGATCCGCGG CTGCGCCTAT
GCCGGATCGA AGGGCGTGGT CTGGGGGCCG GTCAAGGACA TGGTCCATAT CAGCCATGGC
CCGGTCGGCT GCGGACAGTA TTCGTGGTCG CAGCGCCGCA ACTACTATAT AGGCAATACC
GGCGTCGACT CGTTCGTCAC CATGCAGTTC ACGTCCGATT TCCAGGAAAA GGACATCGTG
TTCGGCGGTG ACAAGAAGCT GGAAAAGATC ATCGACGAAA TCGATGAGCT CTTTCCGCTG
GCCAAGGGAA TTTCGGTGCA GTCCGAATGC CCGATCGGCC TGATCGGCGA CGACATCGAG
GCGGTGTCGC GCAAGAAGAA GAAGGAAATC GGCAAGACCA TCGTGCCGGT CCGGTGCGAG
GGCTTCCGCG GCGTTTCGCA GTCGCTGGGC CATCATATCG CCAATGATGC CATCCGGGAC
TGGGTGTTCG ACGGCGAGGA CAAGCACGCG GCGTTCGAGA CGACGCCTTA TGACGTCAAC
GTCATCGGTG ACTACAACAT CGGCGGCGAT GCCTGGTCGT CACGCATCCT GCTGGAAGAA
ATGGGCCTGC GCGTCGTGGG CAACTGGTCC GGCGATGCGA CGCTGGCCGA AATCGAGCGC
GCGCCCAAGG CGAAGCTGAA CCTGATTCAC TGCTATCGCT CGATGAACTA TATCTGCCGG
CATATGGAAG AAAAGTACAA TATTCCCTGG ACGGAATATA ACTTCTTCGG CCCGTCGCAG
ATCGCGGCAT CGCTGCGCAA GATCGCGGCC CTGTTCGACG AGAAGATCCA GGAAGGCGCC
GAGCGCGTCA TCGCGAAATA CCAGCCCCTG GTCGATGCGG TGATCGAGAA GTTCCGTCCG
CGCCTGGCGG GCAAGAAGGT CATGCTGTAT GTCGGCGGCC TGCGTCCGCG CCACGTCGTC
AATGCCTATA ACGACCTGGG CATGGAAATC GTCGGCACTG GCTACGAATT CGGCCACAAC
GACGACTATC AGCGCACTGG ACACTACGTC AGGGAAGGCA CGCTGATCTA CGACGACGTC
ACCGGGTATG AACTGGAGAA GTTCATCGAA GGCATCCGTC CGGACCTGGT CGGATCGGGC
ATCAAGGAAA AATATCCCGT GCAGAAGATG GGCATCCCGT TCCGTCAGAT GCATTCATGG
GATTATTCGG GCCCGTATCA TGGCTATGAT GGTTTCGCCA TCTTCGCCCG TGACATGGAT
CTTGCCATCA ATAATCCGGT CTGGAGCATG TTCAAGGCTC CGTGGAAAAA CGCCGCCTGA
 
Protein sequence
MSLDEDKTND SAFHARLIAE VLEAYPDKAR KRRQKHLNVA GQAEAEAQDA GEEGVMLSEC 
DVKSNVKSVP GVMTIRGCAY AGSKGVVWGP VKDMVHISHG PVGCGQYSWS QRRNYYIGNT
GVDSFVTMQF TSDFQEKDIV FGGDKKLEKI IDEIDELFPL AKGISVQSEC PIGLIGDDIE
AVSRKKKKEI GKTIVPVRCE GFRGVSQSLG HHIANDAIRD WVFDGEDKHA AFETTPYDVN
VIGDYNIGGD AWSSRILLEE MGLRVVGNWS GDATLAEIER APKAKLNLIH CYRSMNYICR
HMEEKYNIPW TEYNFFGPSQ IAASLRKIAA LFDEKIQEGA ERVIAKYQPL VDAVIEKFRP
RLAGKKVMLY VGGLRPRHVV NAYNDLGMEI VGTGYEFGHN DDYQRTGHYV REGTLIYDDV
TGYELEKFIE GIRPDLVGSG IKEKYPVQKM GIPFRQMHSW DYSGPYHGYD GFAIFARDMD
LAINNPVWSM FKAPWKNAA