Gene Gdia_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1566 
Symbol 
ID6974976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1741718 
End bp1743115 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content67% 
IMG OID643391097 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_002275960 
Protein GI209543731 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCA TCGTGAAGCC GCGCAAGGCG GCGTCCGTCA ATCCGCTGAA ATCCTCGACG 
CCGCTGGGCG CGGCGCTGGC CTATCTGGGT ATCGACGGCG CGGTGCCGCT GTTCCATGGC
TCGCAGGGCT GCACGTCCTT CGCGCTGGTC CTGACCGTGC GCCATTACAA GGAAGCGATC
CCGCTGCAGA CCACGGCGAT GGACGAGGTC GCGACCATCC TGGGGGCCGC GGGCAATTTG
GAAGAGGCAC TGCTGAATTT GCAGCGGCGG ATGAAGCCGC GCTTCATCGG CATCGCCTCG
ACCGCACTGG TCGAGACCAG GGGAGAGGAT TATGCCGGCG ACCTGAAGCT GATCCTCCAG
CGGCAGCCCG AACTGGCGGA TACACGGATC GTCTTCGCCT CGACCCCGGA TTATGCCGGC
GCGCTGGAGG ACGGCTGGGC GGCCGCCGTC AGCGCGATCA TCGAATCGGT CGTGGCCCCG
TGGTCGCCGA CGGTGACGTC GTTCCAGCAG GTCAACGTCC TGCCGGGCGT GCACCAGACC
CCGGCCGACA TCGAGGCGCT GCGGGACCTG ATCGAAAGCT TCGGCCTGTA TCCCGTGATC
CTGCCGGACC TGTCCGGATC GCTGGACGGT CATGTGGCCG AGAACTGGTG CCCGACCACG
CAGGGCGGCG CGCGGATGGA AGAAGTGGCG CAGATGGCGC GCGCGGTGCA CACCATCGCC
ATCGGCGAGC ATATGCGCGC GCCGGCCGAC CTGCTGGGCA GCGTAACCGG CGTGCCGGTC
ACGCTGTTTC CCACCCTGAC GGGACTTGCG GCCAACGACC GGCTGATGGC GCTGCTGTCG
CGCCTGTCGG GCCGGGCGGT GCCGGGGCGC TATCGCCGCC AGCGCAGCCA GTTGCTGGAC
GCGATGCTGG ACGGTCATTT CCATTTCGGC GGCAAGCGCA TCGCGATCGC CGCCGATCCC
GATCTTCTGT ATGGCCTGTC GGCCTTCTTT GCCGGAATGG GCGCACGGAT CGTCGCCGCG
GTCGCCTCGA CGTCCAATGC GCCGAACCTG GACTCCATTC CCGCGGACAG CGTCATCGTG
GGCGACCTGA CGGATCTTGA AGACGCGGTC CACGCGGCGG GGGGCGCCGA TCTGCTGGTC
ACCCACAGTC ACGGCCGTCA GTCCGCCGAC CGCCTGGGCA TTCCGCTGAT GCGCGTGGGC
TTTCCCATTT TCGATCGTCT GGGCACCGCG CACGCGCAGA CCATCGGATA TCGCGGCACG
CGCGACCTGA TCTTCCGCGT CGCCAACCTG TTCCTGGGCC AGATGCATGA GCACACGCCG
GACGATTTCG GCCACGTGCC GTCCGCCCAT ACGATCGAGG AGATAGTGCA TGACAGCGCG
TCGCTTGCAG CTCATTGA
 
Protein sequence
MATIVKPRKA ASVNPLKSST PLGAALAYLG IDGAVPLFHG SQGCTSFALV LTVRHYKEAI 
PLQTTAMDEV ATILGAAGNL EEALLNLQRR MKPRFIGIAS TALVETRGED YAGDLKLILQ
RQPELADTRI VFASTPDYAG ALEDGWAAAV SAIIESVVAP WSPTVTSFQQ VNVLPGVHQT
PADIEALRDL IESFGLYPVI LPDLSGSLDG HVAENWCPTT QGGARMEEVA QMARAVHTIA
IGEHMRAPAD LLGSVTGVPV TLFPTLTGLA ANDRLMALLS RLSGRAVPGR YRRQRSQLLD
AMLDGHFHFG GKRIAIAADP DLLYGLSAFF AGMGARIVAA VASTSNAPNL DSIPADSVIV
GDLTDLEDAV HAAGGADLLV THSHGRQSAD RLGIPLMRVG FPIFDRLGTA HAQTIGYRGT
RDLIFRVANL FLGQMHEHTP DDFGHVPSAH TIEEIVHDSA SLAAH