Gene Gdia_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1567 
Symbol 
ID6974977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1743127 
End bp1744566 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content65% 
IMG OID643391098 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_002275961 
Protein GI209543732 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CTTTGAAGGC GAAGGTTACC GAACTGTTCA ACGAGCCAGG GTGTGAAAAG 
AACCTGGCGA AGGGCGAAAA GGAGCGCAGG AAGGGGTGTT CCAAGCCGCT GACGCCCGGT
GCGGCGGCGG GGGGGTGCGC GTTTGACGGC GCGAAGATCG CGCTGCAGCC CATTACGGAT
GCGGTCCATC TGGTCCATGG TCCGCTGGCC TGCGAAGGCA ATGGCTGGGA CAACCGGCAT
GCCGCCAGTT CCGGACCGAA ACTGTATCGC CTGGGCGCCA CGACGGATCT GTCGCAGATG
GATATCGTCA TGGGGCTGGG CGAAAAGCGG CTGTACAAGG CCATCAAGGA CGTCATCGCC
CGCTATGCGC CACCCGCGGT CTTCGTCTAT TCCACATGCG TTCCGGCGCT GACCGGCGAT
GATGTCGTCG CGGTCTGCGC CCATGCCAGC CAGAAGCTGG CGACGCCGTG CATCCCGGTC
AACGCCCCGG GCTTTGTCGG CGGGAAGAAC CTGGGCAACA AGCTGGCGGG CGAGGCGCTG
CTGGATTATG TGATCGGGAC GATGGAGCCC GAAATCAGCA CGCCCACGGA TATCAATATT
CTTGGCGAAT ATAATCTTTC GGGCGAATTG TGGCAGATCC TGCCGCTGTT TCGGGCCCTG
GGGATTCGGG TCCATGCCTG CGTGACGGGC GATGCCCGCT ATCGCGAGGT TGCGTCGGCG
CACCGGTCGC GCGTCAACAT GATGGTGTGC TCGACCGCGC TGATCAACGT CGCGCGCAAG
ATGGAAGAAC GGTGGGGCAT TCCGTACTTT GAAGGATCTT TCTACGGAAT CGAGGACACC
TCCGCCGCCC TGCGGGCCAT CGCCGGGATG CTGGTGGCCC GTGGCGCGCC GGCCGACCTG
CCGGCCCGGG CCGAGGCCCT GATCGCGGAG GAGGAAGCCC GCGCCTGGGA GGCCATCGCC
CCCTATCGCG CGCGGCTGGA GGGCAAGCGG GTGCTGCTCT ATACCGGCGG TGTCAAATCC
TGGTCGATCG TTTCCGCCCT GCAGGAGATC GGGATGGTCG TCGTGGGCAC GTCGGTCCGG
AAATCCACCG ATAACGACAA GCAGAAGATC AAGGACCTGA TGGGCGGCGA CGCCCACATG
GTGGACGCCA TCCCGCCGCG GGAGATGTAC GCCCAGTTGC GGCGCGGCGA CGCGGACATC
CTGCTGTCCG GCGGACGGAC GCAGTTCGTG GCGCTGAAGG CCCGGGTGCC CTGGCTGGAT
ATCAACCAGG AACGGCACCA GGCCTATGCC GGATATGACG GCATGGTGGC GCTGGTGCGT
GAACTGGATC GGTCCCTGTC CAACCCGGTC TGGGCGGATG TCCGCCGCCC GGCCCCGTGG
GAGGAGGACG ATGCCGATGC GTTCCTGGAT GACGCGCCGT TCCTGACCCC CTTGTCCTGA
 
Protein sequence
MSDALKAKVT ELFNEPGCEK NLAKGEKERR KGCSKPLTPG AAAGGCAFDG AKIALQPITD 
AVHLVHGPLA CEGNGWDNRH AASSGPKLYR LGATTDLSQM DIVMGLGEKR LYKAIKDVIA
RYAPPAVFVY STCVPALTGD DVVAVCAHAS QKLATPCIPV NAPGFVGGKN LGNKLAGEAL
LDYVIGTMEP EISTPTDINI LGEYNLSGEL WQILPLFRAL GIRVHACVTG DARYREVASA
HRSRVNMMVC STALINVARK MEERWGIPYF EGSFYGIEDT SAALRAIAGM LVARGAPADL
PARAEALIAE EEARAWEAIA PYRARLEGKR VLLYTGGVKS WSIVSALQEI GMVVVGTSVR
KSTDNDKQKI KDLMGGDAHM VDAIPPREMY AQLRRGDADI LLSGGRTQFV ALKARVPWLD
INQERHQAYA GYDGMVALVR ELDRSLSNPV WADVRRPAPW EEDDADAFLD DAPFLTPLS