Gene GSU2806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2806 
SymbolnifEN 
ID2686928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3084293 
End bp3087052 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content64% 
IMG OID637127496 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionNP_953850 
Protein GI39997899 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.853873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAC CCGATTATTA CGACACAACC GACTGCGAAA CCCACGAAAA GGGTGCCCCG 
AAATTCTGCA AGAAATCGGA ACCGGGGGAG GGCACCGAGC GCTCCTGTGC CTATGACGGC
GCCCGGGTGG TGCTCATGCC GGTCACCGAC GTGATCCATC TGGTGCACGG CCCCATCGCC
TGCGCCGGCA ACTCCTGGGA TAACCGGGGT GCCCGCTCTT CGGGCTCCCA GCTCTACCGC
AGGGGCTTTA CCACCGAGAT GCTGGAAAAC GACGTGATCT TCGGCGGCGA GAAGAAGCTC
TACCGGGCCA TCCTGGAGCT GGCGGAGCGC TATGCGGGCC AGGCAAAGGC CATGTTCGTC
TATGCCACCT GCGTCACCGC CATGACCGGC GACGACGTGG AGGCGGTCTG CAAGGCGGCC
CAGAAAAAGG TCGCCATCCC CCTGATCCCG GTCAACACTC CCGGCTTCAT CGGCGACAAG
AACATCGGCA ACCGCCTGGC CGGCGAGGTG CTCCTGAAGC ACGTCATCGG CACCGCCGAG
CCGCCGGTGC TGGGAGAGTA TCCCATCAAC CTGATCGGCG AATACAACAT CGCCGGCGAC
CTTTGGGGGA TGCTGCCGCT GTTCGACCGG CTCGGTATCC AGATTCTCTC CTGTTTCAGC
GGTGACGCCA AATTCGAGGA TCTCCGCTAC GCCCACCGGG CGAAGCTGAA CGTCATCATC
TGCTCCAAGA GTCTCACCAA CCTCGCCAGG AAGATGCAGA AGAACTACGG CATGCCCTAC
CTGGAGGAGT CCTTTTACGG CATGACCGAC ACGGCCAAGG CCCTGCGGGA CATCGCCCGG
GAGCTGGATG ACGCCGTGGG CGGGCTGGAG AAGCGGATCA TGCAGGACCG GGTGGAGAAG
CTCCTGGAAG AGGAAGAGGC GCAATGCCGG GAGCGGCTTG CCCCCTACCG GGCACGGCTC
GAAGGGAAGC GCTCGGTTCT TTTCACCGGC GGGGTCAAGA CCTGGTCCAT GGTGAACGCC
CTGCGGGAGC TGGGGGTGGA GATCCTGGCC GCCGGCACCC AGAACTCGAC CCTGGAGGAT
TTCTACCGGA TGAAGGCCCT CATGCACCAG GATGCCGGGA TCATCGAGGA CACTTCCAGC
GCCGGGCTCC TCAAGGTCAT GTATGACAAG ATGCCCGATC TGATCGTTGC CGGCGGGAAG
ACCAAGTTCC TGGCCCTGAA AACCAAGACG CCGTTCCTGG ATATCAACCA TGGGCGTTCC
CATCCCTATG CGGGGTACGA GGGGATGGTG ACCTTTGCGA AGCAGCTCGA CCTGACGGTA
AACAACCCCA TCTGGCCAGT GTTGAACGCC AAGTCCCCCT GGGAGAAGTC CGATGAGGAA
CTGGCTGCGA GCGTGGCCAG GGCGGCCGGC CATGCCCGGG CCTACCTGGA CGAGGACCTG
AAGGATTCAC GGGTCAAGGT CCCCACCAAG CCGGCCACGG TGAACCCCCA GAAGAATTCG
CCTGCCCTGG GCGCAACGTT GGCTTACCTG GGCATCGACC AGATGCTGGC GCTCCTGCAC
GGAGCCCAGG GCTGCTCCAC CTTCATCCGG CTCCAGCTCT CCCGCCACTT CAAGGAGCCC
ATCGCCCTCA ACTCCACCGC CATGAGCGAG GACACCGCCA TCTTCGGCGG CTGGGAGAAC
CTGAAGGCGG GGCTCGCGCG GGTCATGGAG AAATTCAGGC CAACGGTGGT CGGCGTCATG
ACCTCAGGTC TCACGGAAAC CATGGGGGAT GACGTCCGGA GCGCCATCCA CCAGTTCCGG
GAGGAGCACC CGGAGCATGA CAACGTGCCC GTGGTTTGGG CGTCCACGCC CGACTACTGC
GGTTCGCTCC AGGAGGGATA CGCGGCGGCG GTGGAGGCCA TTGTCCGGAG CGTGCCGGAG
CCCGGCGCAT CCATCCCGGA CCAGGTGACG GTTCTCCCCG GCGCCCACCT GACCCCCGCC
GATGTGGAGG AGGTCCGGGA GATCTGCGAG GCCTTCGGGC TCGATCCCAT CATTGTGCCG
GACATCGCCA ATGCCCTGGA CGGCCATATC GACGAAACGG TTTCGCCCCT GTCAACCGGG
GGTGTCTCCC CCGAGCGGAT CCGTCAGGCC GGCCGCAGCG CCGCCGCCAT CTTCATCGGC
GACTCGCTGG CAAAGGCGGC AGAAGCGCTC ACCACCACCT GCGGCATGCC CAACTACGGT
TTCACCTCCC TCACGGGCCT GGCCGAGGTC GACCGCTTCA TGGAAACCCT GTCGGCCATT
TCGGGCCGGC CGATCCCGGA GAAGTTCAAC CGCTGGCGGA GCCGGCTCAT GGACGCCATG
GTCGACTCCC ACTACCAGTT CGGGCTGAAG AAGGTTACCG TTGCCCTGGA AGGCGACAAC
CTGAAGGTTC TGACGAATTT CCTGGCAGGC ATGGGATGCG AGATCCGGGC GGCCATTGCC
GCCACCAGGG TCCGGGGGCT GGACGCCCTC CCGGCCGGGG ACGTCTTCGT GGGCGACCTG
GAAGACCTGG AGACGACCGC CAAGGGGTGC GACCTGATCG TAGCCAACTC CAACGGCCGC
CAGGCCGCGG CCAAACTCGG CATCAAGGCC CATCTGCGGG CCGGGCTTCC GGTTTTCGAC
CGCCTCGGCG CCCACCAGAA AATGTGGGTT GGATACCGGG GAACCATGAA TCTGCTGTTC
GAAACGGCCA ACCTGTTCCA GGCCAACGCG GGGGATGCCC AGAAGCTGGC CCATAACTGA
 
Protein sequence
MARPDYYDTT DCETHEKGAP KFCKKSEPGE GTERSCAYDG ARVVLMPVTD VIHLVHGPIA 
CAGNSWDNRG ARSSGSQLYR RGFTTEMLEN DVIFGGEKKL YRAILELAER YAGQAKAMFV
YATCVTAMTG DDVEAVCKAA QKKVAIPLIP VNTPGFIGDK NIGNRLAGEV LLKHVIGTAE
PPVLGEYPIN LIGEYNIAGD LWGMLPLFDR LGIQILSCFS GDAKFEDLRY AHRAKLNVII
CSKSLTNLAR KMQKNYGMPY LEESFYGMTD TAKALRDIAR ELDDAVGGLE KRIMQDRVEK
LLEEEEAQCR ERLAPYRARL EGKRSVLFTG GVKTWSMVNA LRELGVEILA AGTQNSTLED
FYRMKALMHQ DAGIIEDTSS AGLLKVMYDK MPDLIVAGGK TKFLALKTKT PFLDINHGRS
HPYAGYEGMV TFAKQLDLTV NNPIWPVLNA KSPWEKSDEE LAASVARAAG HARAYLDEDL
KDSRVKVPTK PATVNPQKNS PALGATLAYL GIDQMLALLH GAQGCSTFIR LQLSRHFKEP
IALNSTAMSE DTAIFGGWEN LKAGLARVME KFRPTVVGVM TSGLTETMGD DVRSAIHQFR
EEHPEHDNVP VVWASTPDYC GSLQEGYAAA VEAIVRSVPE PGASIPDQVT VLPGAHLTPA
DVEEVREICE AFGLDPIIVP DIANALDGHI DETVSPLSTG GVSPERIRQA GRSAAAIFIG
DSLAKAAEAL TTTCGMPNYG FTSLTGLAEV DRFMETLSAI SGRPIPEKFN RWRSRLMDAM
VDSHYQFGLK KVTVALEGDN LKVLTNFLAG MGCEIRAAIA ATRVRGLDAL PAGDVFVGDL
EDLETTAKGC DLIVANSNGR QAAAKLGIKA HLRAGLPVFD RLGAHQKMWV GYRGTMNLLF
ETANLFQANA GDAQKLAHN