Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2806 |
Symbol | nifEN |
ID | 2686928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3084293 |
End bp | 3087052 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637127496 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | NP_953850 |
Protein GI | 39997899 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.853873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGAC CCGATTATTA CGACACAACC GACTGCGAAA CCCACGAAAA GGGTGCCCCG AAATTCTGCA AGAAATCGGA ACCGGGGGAG GGCACCGAGC GCTCCTGTGC CTATGACGGC GCCCGGGTGG TGCTCATGCC GGTCACCGAC GTGATCCATC TGGTGCACGG CCCCATCGCC TGCGCCGGCA ACTCCTGGGA TAACCGGGGT GCCCGCTCTT CGGGCTCCCA GCTCTACCGC AGGGGCTTTA CCACCGAGAT GCTGGAAAAC GACGTGATCT TCGGCGGCGA GAAGAAGCTC TACCGGGCCA TCCTGGAGCT GGCGGAGCGC TATGCGGGCC AGGCAAAGGC CATGTTCGTC TATGCCACCT GCGTCACCGC CATGACCGGC GACGACGTGG AGGCGGTCTG CAAGGCGGCC CAGAAAAAGG TCGCCATCCC CCTGATCCCG GTCAACACTC CCGGCTTCAT CGGCGACAAG AACATCGGCA ACCGCCTGGC CGGCGAGGTG CTCCTGAAGC ACGTCATCGG CACCGCCGAG CCGCCGGTGC TGGGAGAGTA TCCCATCAAC CTGATCGGCG AATACAACAT CGCCGGCGAC CTTTGGGGGA TGCTGCCGCT GTTCGACCGG CTCGGTATCC AGATTCTCTC CTGTTTCAGC GGTGACGCCA AATTCGAGGA TCTCCGCTAC GCCCACCGGG CGAAGCTGAA CGTCATCATC TGCTCCAAGA GTCTCACCAA CCTCGCCAGG AAGATGCAGA AGAACTACGG CATGCCCTAC CTGGAGGAGT CCTTTTACGG CATGACCGAC ACGGCCAAGG CCCTGCGGGA CATCGCCCGG GAGCTGGATG ACGCCGTGGG CGGGCTGGAG AAGCGGATCA TGCAGGACCG GGTGGAGAAG CTCCTGGAAG AGGAAGAGGC GCAATGCCGG GAGCGGCTTG CCCCCTACCG GGCACGGCTC GAAGGGAAGC GCTCGGTTCT TTTCACCGGC GGGGTCAAGA CCTGGTCCAT GGTGAACGCC CTGCGGGAGC TGGGGGTGGA GATCCTGGCC GCCGGCACCC AGAACTCGAC CCTGGAGGAT TTCTACCGGA TGAAGGCCCT CATGCACCAG GATGCCGGGA TCATCGAGGA CACTTCCAGC GCCGGGCTCC TCAAGGTCAT GTATGACAAG ATGCCCGATC TGATCGTTGC CGGCGGGAAG ACCAAGTTCC TGGCCCTGAA AACCAAGACG CCGTTCCTGG ATATCAACCA TGGGCGTTCC CATCCCTATG CGGGGTACGA GGGGATGGTG ACCTTTGCGA AGCAGCTCGA CCTGACGGTA AACAACCCCA TCTGGCCAGT GTTGAACGCC AAGTCCCCCT GGGAGAAGTC CGATGAGGAA CTGGCTGCGA GCGTGGCCAG GGCGGCCGGC CATGCCCGGG CCTACCTGGA CGAGGACCTG AAGGATTCAC GGGTCAAGGT CCCCACCAAG CCGGCCACGG TGAACCCCCA GAAGAATTCG CCTGCCCTGG GCGCAACGTT GGCTTACCTG GGCATCGACC AGATGCTGGC GCTCCTGCAC GGAGCCCAGG GCTGCTCCAC CTTCATCCGG CTCCAGCTCT CCCGCCACTT CAAGGAGCCC ATCGCCCTCA ACTCCACCGC CATGAGCGAG GACACCGCCA TCTTCGGCGG CTGGGAGAAC CTGAAGGCGG GGCTCGCGCG GGTCATGGAG AAATTCAGGC CAACGGTGGT CGGCGTCATG ACCTCAGGTC TCACGGAAAC CATGGGGGAT GACGTCCGGA GCGCCATCCA CCAGTTCCGG GAGGAGCACC CGGAGCATGA CAACGTGCCC GTGGTTTGGG CGTCCACGCC CGACTACTGC GGTTCGCTCC AGGAGGGATA CGCGGCGGCG GTGGAGGCCA TTGTCCGGAG CGTGCCGGAG CCCGGCGCAT CCATCCCGGA CCAGGTGACG GTTCTCCCCG GCGCCCACCT GACCCCCGCC GATGTGGAGG AGGTCCGGGA GATCTGCGAG GCCTTCGGGC TCGATCCCAT CATTGTGCCG GACATCGCCA ATGCCCTGGA CGGCCATATC GACGAAACGG TTTCGCCCCT GTCAACCGGG GGTGTCTCCC CCGAGCGGAT CCGTCAGGCC GGCCGCAGCG CCGCCGCCAT CTTCATCGGC GACTCGCTGG CAAAGGCGGC AGAAGCGCTC ACCACCACCT GCGGCATGCC CAACTACGGT TTCACCTCCC TCACGGGCCT GGCCGAGGTC GACCGCTTCA TGGAAACCCT GTCGGCCATT TCGGGCCGGC CGATCCCGGA GAAGTTCAAC CGCTGGCGGA GCCGGCTCAT GGACGCCATG GTCGACTCCC ACTACCAGTT CGGGCTGAAG AAGGTTACCG TTGCCCTGGA AGGCGACAAC CTGAAGGTTC TGACGAATTT CCTGGCAGGC ATGGGATGCG AGATCCGGGC GGCCATTGCC GCCACCAGGG TCCGGGGGCT GGACGCCCTC CCGGCCGGGG ACGTCTTCGT GGGCGACCTG GAAGACCTGG AGACGACCGC CAAGGGGTGC GACCTGATCG TAGCCAACTC CAACGGCCGC CAGGCCGCGG CCAAACTCGG CATCAAGGCC CATCTGCGGG CCGGGCTTCC GGTTTTCGAC CGCCTCGGCG CCCACCAGAA AATGTGGGTT GGATACCGGG GAACCATGAA TCTGCTGTTC GAAACGGCCA ACCTGTTCCA GGCCAACGCG GGGGATGCCC AGAAGCTGGC CCATAACTGA
|
Protein sequence | MARPDYYDTT DCETHEKGAP KFCKKSEPGE GTERSCAYDG ARVVLMPVTD VIHLVHGPIA CAGNSWDNRG ARSSGSQLYR RGFTTEMLEN DVIFGGEKKL YRAILELAER YAGQAKAMFV YATCVTAMTG DDVEAVCKAA QKKVAIPLIP VNTPGFIGDK NIGNRLAGEV LLKHVIGTAE PPVLGEYPIN LIGEYNIAGD LWGMLPLFDR LGIQILSCFS GDAKFEDLRY AHRAKLNVII CSKSLTNLAR KMQKNYGMPY LEESFYGMTD TAKALRDIAR ELDDAVGGLE KRIMQDRVEK LLEEEEAQCR ERLAPYRARL EGKRSVLFTG GVKTWSMVNA LRELGVEILA AGTQNSTLED FYRMKALMHQ DAGIIEDTSS AGLLKVMYDK MPDLIVAGGK TKFLALKTKT PFLDINHGRS HPYAGYEGMV TFAKQLDLTV NNPIWPVLNA KSPWEKSDEE LAASVARAAG HARAYLDEDL KDSRVKVPTK PATVNPQKNS PALGATLAYL GIDQMLALLH GAQGCSTFIR LQLSRHFKEP IALNSTAMSE DTAIFGGWEN LKAGLARVME KFRPTVVGVM TSGLTETMGD DVRSAIHQFR EEHPEHDNVP VVWASTPDYC GSLQEGYAAA VEAIVRSVPE PGASIPDQVT VLPGAHLTPA DVEEVREICE AFGLDPIIVP DIANALDGHI DETVSPLSTG GVSPERIRQA GRSAAAIFIG DSLAKAAEAL TTTCGMPNYG FTSLTGLAEV DRFMETLSAI SGRPIPEKFN RWRSRLMDAM VDSHYQFGLK KVTVALEGDN LKVLTNFLAG MGCEIRAAIA ATRVRGLDAL PAGDVFVGDL EDLETTAKGC DLIVANSNGR QAAAKLGIKA HLRAGLPVFD RLGAHQKMWV GYRGTMNLLF ETANLFQANA GDAQKLAHN
|
| |