Gene Gmet_0669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0669 
Symbol 
ID3739215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp733828 
End bp736587 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content65% 
IMG OID637777947 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_383636 
Protein GI78221889 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.961568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC CAGACTACTA CGACGTATCC GAATGCGAAA CCCACGAAAA AGGTGCCCCC 
AAGTTCTGCA AGAAATCGGA GCCGGGGGAG GGGGCCGAGC GCTCCTGTGC CTACGACGGC
GCCCGGGTGG TGCTCATGCC GATCACCGAT GTGATCCATC TGGTGCACGG CCCCATCGCC
TGCGCCGGCA ATTCCTGGGA CAACCGGGGG GCCCGCTCCT CGGACTCCCA GCTCTACCGC
CGGGGTTTCA CAACCGAGAT GCTGGAGAAC GACGTGGTTT TCGGTGGCGA GAAGAAGCTC
TACCGGGCCA TCCTGGAGCT GGCCGAGCGC TACGAAGGCC AGGCGAAGGC CATGTTCGTC
TACGCCACCT GCGTCACCGC CATGACCGGC GACGACGTGG AGGCGGTCTG CGCCGCGGCC
GGCAAGAAGG TCGCCATCCC CCTCATCCCC GTCAACACCC CCGGTTTCAT CGGCGACAAG
AACATCGGCA ACCGCTTGGC CGGCGAGGTG CTCTTCAAGC ATGTCATCGG CACCGCCGAG
CCGCCGGTTC TGGGGGAGTA TCCGATAAAT CTCATTGGCG AGTACAACAT TGCCGGCGAC
CTCTGGGGGA TGCTGCCGCT CTTTGAGCGC CTCGGCATCC AGGTCCTCTC CTGCTTCAGC
GGCGACGCCA CCTTCGAGGA ACTCCGCTAC GCCCACCGGG CAAAGCTCAA TATTATCATC
TGCTCCAAGA GCCTCACCAA CCTCGCCAGG AAGATGCAGA AGAACTACGG CATGCCCTAT
CTGGAGGAGT CCTTCTACGG CATGACCGAC ACGGCCAAAG CCCTGCGGGA CATCGCCCGG
GAGCTGGACG ACGCCGTGGG GGGGCTGGAG AAGCGGATCA TGCAGGACCG GGTGGAGAAG
CTACTGGAGG AGGAAGAGGC GACGTGCCGC GAGCGCCTTG CCCCCTATCG GGCGCGGCTG
GAGGGGAAGC GGTCGGTCCT CTTCACCGGC GGGGTCAAGA CCTGGTCCAT GGTGAACGCC
CTGCGGGAGC TGGGGGTGGA GATCCTGGCC GCCGGCACCC AGAACTCCAC CCTGGAGGAC
TTCTACCGGA TGAAGGCCCT CATGCACCAG GATGCCCGGA TCATCGAGGA CACCTCCAGC
GCCGGGCTCC TCCAGGTCAT GTACGACAAG ATGCCGGATC TCATTGTAGC CGGGGGGAAG
ACCAAGTTCC TGGCCCTTAA GACCAAGACC CCCTTCCTGG ATATCAACCA CGGCCGCTCC
CACCCCTACG CCGGGTACGA GGGGATGGTG ACCTTTGCGA AGCAGCTGGA CCTCACGGTC
AACAACCCCA TCTGGCCGGT GCTGAACGCC AAGGCGCCCT GGGAGAAGAC GGAGGAAGAG
TTGACGGCCG CGGTGGCGCT TGCCGCCGGT CACGCCCGAG CCTGCCTGGA TGAGGATCTG
AAGGATTCCA CGGTGAAGGT GCCGGCCAAG AACGCCACGG TGAACCCCCA GAAGAACTCC
CCGGCCCTTG GCGCGACCCT GGCCTATCTC GGCATCGACC AGATGCTGGC CCTTCTCCAT
GGCGCCCAGG GATGCTCCAC CTTCATCCGG CTGCAGCTCT CGCGCCATTT CAAGGAGCCC
GTCGCCCTGA ACTCCACCGC CATGAGCGAG GACACCGCCA TCTTCGGCGG GTGGGAGAAC
CTGAAAAAGG GGCTGAAAAA GGTGATCGAG AAGTTCAGCC CGGAGGTGGT GGGGGTCATG
ACCTCGGGGC TCACCGAGAC CATGGGGGAT GATGTCCGGA GCGCCATCGT CCACTTTCGG
CAGGAGTACC CCGAGCACGA CGGGGTCCCC GTCGTTTGGG CTTCGACCCC TGACTACTGC
GGCTCGCTGC AGGAGGGGTA CGCGGCAACG GTGGAGGCCA TCGTGAGGAG CGTCCCCGAG
CCGGGGGAGA CGATCCCGGG CCAGGTGACG GTCCTGCCCG GTGCCCACCT GACCCCGGCC
GACGTGGAGG AGGTGCGGGA GCTCTGCGAG GCCTTCGGGC TCGACCCCAT CATCGTCCCC
GACATCGCCA ACGCCCTGGA TGGCCACATC GACGAGACAG TGTCGCCGCT TTCAACCGGC
GGGGTTTCCA TGGCCCGGAT CAGGCAGGCG GGGCAAAGCG CGGCGACCCT CTTCATCGGC
GATTCCCTGG CCAAGGCTGC CGAGGCCATG ACGGAGCGGT GCGGCATGCC CAGCTATGGC
TTCACCTCCC TCACGGGGCT TGCCCAGGTG GACCGCTTCA TGGAGACCCT CGCGGCCATC
GCGGGGCGGC CGATCCCCGA GAAGTTCCGC CGCTGGCGGA GCCGCCTCAT GGACGCCATG
GTGGACAGCC ACTACCAGTT CGGCCTCAAG AAAGTCACGG TGGCCCTGGA GGGAGACAAC
CTGAAAACGC TGGTGAACTT CCTGGCGGGG ATGGGGTGCG AGATACAGGC GGCCATCGCG
GCGACACGGG TCCGGGGGCT CGACGGGCTG CCGGCTAGGG ATATCTTCGT GGGGGATCTG
GAGGACCTGG AAACGGCGGC TAGGGGAAGC GACCTGATCG TGGCCAACTC CAATGGCCGT
CAGGCCGCGG CAAAGCTGGG GATCAAGGCC CACCTGCGGG CGGGGCTTCC GGTCTTCGAT
CGCCTTGGCG CCCACCAGAA GATGTGGGTG GGGTACCGGG GGACCATGAA CCTTCTGTTC
GAGACGGCGA ACCTGTTCCA GGCCAACGCG GGAGAGGGGC AGAAGCTGGC GCATAACTGA
 
Protein sequence
MAKPDYYDVS ECETHEKGAP KFCKKSEPGE GAERSCAYDG ARVVLMPITD VIHLVHGPIA 
CAGNSWDNRG ARSSDSQLYR RGFTTEMLEN DVVFGGEKKL YRAILELAER YEGQAKAMFV
YATCVTAMTG DDVEAVCAAA GKKVAIPLIP VNTPGFIGDK NIGNRLAGEV LFKHVIGTAE
PPVLGEYPIN LIGEYNIAGD LWGMLPLFER LGIQVLSCFS GDATFEELRY AHRAKLNIII
CSKSLTNLAR KMQKNYGMPY LEESFYGMTD TAKALRDIAR ELDDAVGGLE KRIMQDRVEK
LLEEEEATCR ERLAPYRARL EGKRSVLFTG GVKTWSMVNA LRELGVEILA AGTQNSTLED
FYRMKALMHQ DARIIEDTSS AGLLQVMYDK MPDLIVAGGK TKFLALKTKT PFLDINHGRS
HPYAGYEGMV TFAKQLDLTV NNPIWPVLNA KAPWEKTEEE LTAAVALAAG HARACLDEDL
KDSTVKVPAK NATVNPQKNS PALGATLAYL GIDQMLALLH GAQGCSTFIR LQLSRHFKEP
VALNSTAMSE DTAIFGGWEN LKKGLKKVIE KFSPEVVGVM TSGLTETMGD DVRSAIVHFR
QEYPEHDGVP VVWASTPDYC GSLQEGYAAT VEAIVRSVPE PGETIPGQVT VLPGAHLTPA
DVEEVRELCE AFGLDPIIVP DIANALDGHI DETVSPLSTG GVSMARIRQA GQSAATLFIG
DSLAKAAEAM TERCGMPSYG FTSLTGLAQV DRFMETLAAI AGRPIPEKFR RWRSRLMDAM
VDSHYQFGLK KVTVALEGDN LKTLVNFLAG MGCEIQAAIA ATRVRGLDGL PARDIFVGDL
EDLETAARGS DLIVANSNGR QAAAKLGIKA HLRAGLPVFD RLGAHQKMWV GYRGTMNLLF
ETANLFQANA GEGQKLAHN