Gene Ppro_3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_3491 
Symbol 
ID4572658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp3854111 
End bp3856870 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content65% 
IMG OID639757550 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_903141 
Protein GI118581891 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAC CTGACTACTA CGATGTAACC GATTGCGACA CCCACGACAA GGGCGCCCCC 
AAGTTCTGCA AGAAATCCGA GCCGGGCGAG GGGACCGAGC GGAGCTGCGC CTACGACGGC
GCCCGGGTGG TGCTGATGCC GATCACCGAC GTGATCCACC TGGTACACGG CCCCATCGCC
TGCGCCGGCA ACTCCTGGGA CAACCGCGGG GCCCGCTCCT CCGGCTCCCA GCTCTACCGG
CGCGGCTTCA CCACCGAGAT GCTGGAGAAC GACGTGGTCT TCGGCGGCGA GAAGAAGCTG
TACAAGGCCA TACTTGACCT GGCCGAGCGC TACCGCGACC AGGCCAAGGC CATCTTCGTC
TACGCCACCT GCGTCACCGC CATGACCGGC GACGACGTGG AGGCGGTCTG CAAGGCTGCC
CAGCCCAAGG CGGGTATGCC GGTCATACCG ATCAACGCCC CCGGCTTCAT CGGCGACAAG
AACATCGGCA ACCGCCTGGC CGGGGAGATC ATGTTCAAGC ATGTCATCGG AACGGCAGAG
CCGCCTGAAC TGGGGGAGTA CCCCATCAAC CTGATCGGCG AGTACAACAT TGCCGGCGAC
CTGTGGGGCA TGCTGCCGCT GTTCCAGCGG TTGGGGATCC AGATCCTCTC CTGCTTCAGC
GGAGACGCGA AATTCGAGGA GTTGCGCTAC GCCCACCGGG CCAAGCTGAA CGTGATCATC
TGCTCCAAGA GCCTGACCAA CCTGGCCAAG AAGATGCAGA AAACCTACGG TATGCCCTAT
CTGGAGGAAT CATTCTACGG CATGACCGAC ACGGCCAAGG CGCTGCGCGA CATCGCCCGG
GAGCTGGACA ACACGGTCAA CGGCCTGGAG AAGCGGGTCA TGCAGGACCG GGTGGAGAAG
CTCCTGGCAG AGGAAGAGGA AAAATGCCGC AAACGCATCG CCCCCTACCG GGCCAGGCTG
GAGGGGAAAC GGGCGGTGCT GTTCACCGGC GGGGTCAAGA CCTGGTCCAT GGTCAACGCC
CTGCGGGAGC TGGGGGTGGA GATCCTGGCC GCCGGCACCC AGAACTCGAC CCTGGAGGAC
TTCTACCGCA TGAAGGCGCT GATGCACGCC GATGCCGGGA TCATCGAGGA TACCAGCACC
GCGGGGCTTT TGGCCGTGAT GCGGGAGAAG ATGCCCGACC TGATCGTGGC CGGCGGCAAG
ACCAAGTTCC TGGCGCTGAA GACCAAGACC CCCTTCCTGG ACATCAATCA TGGCCGCAGC
CACCCCTACG CCGGCTACGA AGGGATGGTC ACCTTTGCCA AACAGCTGGA CCTGACGGTG
AACAACCCGA TCTGGCCGCT CCTGAACGCC CCGGCCCCCT GGGAGAAGAG CCCCGACCAG
CTGGCCGAAG ACCTGGTGGA GGTGGCCGGC CATGGCGAGC GCTTCCTGGC CGAGGATCTC
TCCGCCTCAA GAGTCAGGGT CAGCACCAAG AGTGCGGTGG TCAACCCCCA GAAGAACTCG
CCGGCCCTGG GGGCGACCCT GGCCTACCTG GGGATCGACA ACATGCTGGG GCTCCTCCAC
GGCGCCCAGG GGTGCTCCAC CTTCATCAGG TTGCAGCTCT CCCGCCACTT CAAGGAGTCC
ATCGCCCTGA ATGCCACCGC CATGAGCGAG GAGTCGGCCA TCTTCGGCGG CTGGGAGAAC
CTGAAGCAGG GGATCAAGCG GGTGATGGAG AAGTTCCACC CCGGTGTGGT GGGGGTGATG
ACCTCGGGCC TGACCGAAAC CATGGGGGAC GACGTGCAGA GCGCCATCGT CCACTTCCGC
CGGGAAAACC CGGAACTGGC CGACACACCG GTCATCCACG CCTCCACCCC CGATTACTGC
GGCTCGCTGC AGGAGGGGTA CGCCGCGGCG GTTGAGGCGA TCCTGTCCAC CCTGCCCGAG
GGGGGCACGG CCGTTCCCGG CCAGGTCAAC CTGCTCCCCG GCAGCCACCT GACCCCGGCG
GACGTGGAGC AGATCAGGGA GCTGATGGAG GAATTCGGTC TGACGGTGCT GACCATCCCG
GACATATCCT GCGCCATGGA CGGCCACATC GACGAGCAGG TATCGCCCCT CTCCACCGGC
GGGATTGCGG TGGAGGCCAT CCGGCGGGCC GGTCGCAGCG TTGCCACCAT CTACGTGGGG
GATTCCCTGG CCAGGGCTGC CCTGAAAATG AAGGAGAAAT TCGCAATCCC GGCCTATGGT
TTTACCTCTC TCTCCGGGCT GGGAGAGACC GACCTGTTCA TGGAGACCAT GAGCGCACTC
TCCGGTCGCC CCATCCCGGA GAAGCAGCAG CGCTGGCGCA GCCGCCTCAT GGACGCCATG
GTGGACAGTC ACTACCAGTT CGGCGCCAAG AGGGTTGCCC TGGCCCTGGA GTCGGACAAC
CTGAAGAGCC TGACCACCTT CCTGGCCGGC ATGGGCTGCC AGATCCAGGC AGCACTGAGC
GCCACCCGCA CCCGTGGCCT GGACAGCCTT CCCTGCGACA ACGTTTTTGT GGGGGACCTG
GAGGATCTGG AGACCGCCGC CCAGGGGGTG GACCTGCTGG TGGCCAACAG CAACGGCCGC
CAGGCTGCGG CACGGCTGAA GATCGGCGCC CACCTGCGGG CAGGTCTGCC GGTGTTCGAC
CGCCTGGGCG CCCACCAGAA GATGTGGGTC GGCTACCGGG GAAGCATGAA CCTGCTGTTC
GAGGTGGCCA ACCTGTTCCA GGCCAATGCC AAGGAGGCGC AGAAGCTGGC GCATAATTGA
 
Protein sequence
MAKPDYYDVT DCDTHDKGAP KFCKKSEPGE GTERSCAYDG ARVVLMPITD VIHLVHGPIA 
CAGNSWDNRG ARSSGSQLYR RGFTTEMLEN DVVFGGEKKL YKAILDLAER YRDQAKAIFV
YATCVTAMTG DDVEAVCKAA QPKAGMPVIP INAPGFIGDK NIGNRLAGEI MFKHVIGTAE
PPELGEYPIN LIGEYNIAGD LWGMLPLFQR LGIQILSCFS GDAKFEELRY AHRAKLNVII
CSKSLTNLAK KMQKTYGMPY LEESFYGMTD TAKALRDIAR ELDNTVNGLE KRVMQDRVEK
LLAEEEEKCR KRIAPYRARL EGKRAVLFTG GVKTWSMVNA LRELGVEILA AGTQNSTLED
FYRMKALMHA DAGIIEDTST AGLLAVMREK MPDLIVAGGK TKFLALKTKT PFLDINHGRS
HPYAGYEGMV TFAKQLDLTV NNPIWPLLNA PAPWEKSPDQ LAEDLVEVAG HGERFLAEDL
SASRVRVSTK SAVVNPQKNS PALGATLAYL GIDNMLGLLH GAQGCSTFIR LQLSRHFKES
IALNATAMSE ESAIFGGWEN LKQGIKRVME KFHPGVVGVM TSGLTETMGD DVQSAIVHFR
RENPELADTP VIHASTPDYC GSLQEGYAAA VEAILSTLPE GGTAVPGQVN LLPGSHLTPA
DVEQIRELME EFGLTVLTIP DISCAMDGHI DEQVSPLSTG GIAVEAIRRA GRSVATIYVG
DSLARAALKM KEKFAIPAYG FTSLSGLGET DLFMETMSAL SGRPIPEKQQ RWRSRLMDAM
VDSHYQFGAK RVALALESDN LKSLTTFLAG MGCQIQAALS ATRTRGLDSL PCDNVFVGDL
EDLETAAQGV DLLVANSNGR QAAARLKIGA HLRAGLPVFD RLGAHQKMWV GYRGSMNLLF
EVANLFQANA KEAQKLAHN