Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppro_3491 |
Symbol | |
ID | 4572658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelobacter propionicus DSM 2379 |
Kingdom | Bacteria |
Replicon accession | NC_008609 |
Strand | - |
Start bp | 3854111 |
End bp | 3856870 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639757550 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_903141 |
Protein GI | 118581891 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAC CTGACTACTA CGATGTAACC GATTGCGACA CCCACGACAA GGGCGCCCCC AAGTTCTGCA AGAAATCCGA GCCGGGCGAG GGGACCGAGC GGAGCTGCGC CTACGACGGC GCCCGGGTGG TGCTGATGCC GATCACCGAC GTGATCCACC TGGTACACGG CCCCATCGCC TGCGCCGGCA ACTCCTGGGA CAACCGCGGG GCCCGCTCCT CCGGCTCCCA GCTCTACCGG CGCGGCTTCA CCACCGAGAT GCTGGAGAAC GACGTGGTCT TCGGCGGCGA GAAGAAGCTG TACAAGGCCA TACTTGACCT GGCCGAGCGC TACCGCGACC AGGCCAAGGC CATCTTCGTC TACGCCACCT GCGTCACCGC CATGACCGGC GACGACGTGG AGGCGGTCTG CAAGGCTGCC CAGCCCAAGG CGGGTATGCC GGTCATACCG ATCAACGCCC CCGGCTTCAT CGGCGACAAG AACATCGGCA ACCGCCTGGC CGGGGAGATC ATGTTCAAGC ATGTCATCGG AACGGCAGAG CCGCCTGAAC TGGGGGAGTA CCCCATCAAC CTGATCGGCG AGTACAACAT TGCCGGCGAC CTGTGGGGCA TGCTGCCGCT GTTCCAGCGG TTGGGGATCC AGATCCTCTC CTGCTTCAGC GGAGACGCGA AATTCGAGGA GTTGCGCTAC GCCCACCGGG CCAAGCTGAA CGTGATCATC TGCTCCAAGA GCCTGACCAA CCTGGCCAAG AAGATGCAGA AAACCTACGG TATGCCCTAT CTGGAGGAAT CATTCTACGG CATGACCGAC ACGGCCAAGG CGCTGCGCGA CATCGCCCGG GAGCTGGACA ACACGGTCAA CGGCCTGGAG AAGCGGGTCA TGCAGGACCG GGTGGAGAAG CTCCTGGCAG AGGAAGAGGA AAAATGCCGC AAACGCATCG CCCCCTACCG GGCCAGGCTG GAGGGGAAAC GGGCGGTGCT GTTCACCGGC GGGGTCAAGA CCTGGTCCAT GGTCAACGCC CTGCGGGAGC TGGGGGTGGA GATCCTGGCC GCCGGCACCC AGAACTCGAC CCTGGAGGAC TTCTACCGCA TGAAGGCGCT GATGCACGCC GATGCCGGGA TCATCGAGGA TACCAGCACC GCGGGGCTTT TGGCCGTGAT GCGGGAGAAG ATGCCCGACC TGATCGTGGC CGGCGGCAAG ACCAAGTTCC TGGCGCTGAA GACCAAGACC CCCTTCCTGG ACATCAATCA TGGCCGCAGC CACCCCTACG CCGGCTACGA AGGGATGGTC ACCTTTGCCA AACAGCTGGA CCTGACGGTG AACAACCCGA TCTGGCCGCT CCTGAACGCC CCGGCCCCCT GGGAGAAGAG CCCCGACCAG CTGGCCGAAG ACCTGGTGGA GGTGGCCGGC CATGGCGAGC GCTTCCTGGC CGAGGATCTC TCCGCCTCAA GAGTCAGGGT CAGCACCAAG AGTGCGGTGG TCAACCCCCA GAAGAACTCG CCGGCCCTGG GGGCGACCCT GGCCTACCTG GGGATCGACA ACATGCTGGG GCTCCTCCAC GGCGCCCAGG GGTGCTCCAC CTTCATCAGG TTGCAGCTCT CCCGCCACTT CAAGGAGTCC ATCGCCCTGA ATGCCACCGC CATGAGCGAG GAGTCGGCCA TCTTCGGCGG CTGGGAGAAC CTGAAGCAGG GGATCAAGCG GGTGATGGAG AAGTTCCACC CCGGTGTGGT GGGGGTGATG ACCTCGGGCC TGACCGAAAC CATGGGGGAC GACGTGCAGA GCGCCATCGT CCACTTCCGC CGGGAAAACC CGGAACTGGC CGACACACCG GTCATCCACG CCTCCACCCC CGATTACTGC GGCTCGCTGC AGGAGGGGTA CGCCGCGGCG GTTGAGGCGA TCCTGTCCAC CCTGCCCGAG GGGGGCACGG CCGTTCCCGG CCAGGTCAAC CTGCTCCCCG GCAGCCACCT GACCCCGGCG GACGTGGAGC AGATCAGGGA GCTGATGGAG GAATTCGGTC TGACGGTGCT GACCATCCCG GACATATCCT GCGCCATGGA CGGCCACATC GACGAGCAGG TATCGCCCCT CTCCACCGGC GGGATTGCGG TGGAGGCCAT CCGGCGGGCC GGTCGCAGCG TTGCCACCAT CTACGTGGGG GATTCCCTGG CCAGGGCTGC CCTGAAAATG AAGGAGAAAT TCGCAATCCC GGCCTATGGT TTTACCTCTC TCTCCGGGCT GGGAGAGACC GACCTGTTCA TGGAGACCAT GAGCGCACTC TCCGGTCGCC CCATCCCGGA GAAGCAGCAG CGCTGGCGCA GCCGCCTCAT GGACGCCATG GTGGACAGTC ACTACCAGTT CGGCGCCAAG AGGGTTGCCC TGGCCCTGGA GTCGGACAAC CTGAAGAGCC TGACCACCTT CCTGGCCGGC ATGGGCTGCC AGATCCAGGC AGCACTGAGC GCCACCCGCA CCCGTGGCCT GGACAGCCTT CCCTGCGACA ACGTTTTTGT GGGGGACCTG GAGGATCTGG AGACCGCCGC CCAGGGGGTG GACCTGCTGG TGGCCAACAG CAACGGCCGC CAGGCTGCGG CACGGCTGAA GATCGGCGCC CACCTGCGGG CAGGTCTGCC GGTGTTCGAC CGCCTGGGCG CCCACCAGAA GATGTGGGTC GGCTACCGGG GAAGCATGAA CCTGCTGTTC GAGGTGGCCA ACCTGTTCCA GGCCAATGCC AAGGAGGCGC AGAAGCTGGC GCATAATTGA
|
Protein sequence | MAKPDYYDVT DCDTHDKGAP KFCKKSEPGE GTERSCAYDG ARVVLMPITD VIHLVHGPIA CAGNSWDNRG ARSSGSQLYR RGFTTEMLEN DVVFGGEKKL YKAILDLAER YRDQAKAIFV YATCVTAMTG DDVEAVCKAA QPKAGMPVIP INAPGFIGDK NIGNRLAGEI MFKHVIGTAE PPELGEYPIN LIGEYNIAGD LWGMLPLFQR LGIQILSCFS GDAKFEELRY AHRAKLNVII CSKSLTNLAK KMQKTYGMPY LEESFYGMTD TAKALRDIAR ELDNTVNGLE KRVMQDRVEK LLAEEEEKCR KRIAPYRARL EGKRAVLFTG GVKTWSMVNA LRELGVEILA AGTQNSTLED FYRMKALMHA DAGIIEDTST AGLLAVMREK MPDLIVAGGK TKFLALKTKT PFLDINHGRS HPYAGYEGMV TFAKQLDLTV NNPIWPLLNA PAPWEKSPDQ LAEDLVEVAG HGERFLAEDL SASRVRVSTK SAVVNPQKNS PALGATLAYL GIDNMLGLLH GAQGCSTFIR LQLSRHFKES IALNATAMSE ESAIFGGWEN LKQGIKRVME KFHPGVVGVM TSGLTETMGD DVQSAIVHFR RENPELADTP VIHASTPDYC GSLQEGYAAA VEAILSTLPE GGTAVPGQVN LLPGSHLTPA DVEQIRELME EFGLTVLTIP DISCAMDGHI DEQVSPLSTG GIAVEAIRRA GRSVATIYVG DSLARAALKM KEKFAIPAYG FTSLSGLGET DLFMETMSAL SGRPIPEKQQ RWRSRLMDAM VDSHYQFGAK RVALALESDN LKSLTTFLAG MGCQIQAALS ATRTRGLDSL PCDNVFVGDL EDLETAAQGV DLLVANSNGR QAAARLKIGA HLRAGLPVFD RLGAHQKMWV GYRGSMNLLF EVANLFQANA KEAQKLAHN
|
| |