Gene GSU2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2820 
SymbolnifD 
ID2687142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3102400 
End bp3103839 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content61% 
IMG OID637127510 
Productnitrogenase molybdenum-iron protein, alpha chain 
Protein accessionNP_953864 
Protein GI39997913 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAATG CAGCCAAAAG AGTTGAAGGA ATAACCAAAG AGTCGACTCA GGAGATGATC 
GACAAGGTGT TGGAGGTTTA CCCGGAAAAA GGGCGTAAGA AGCGCGCTCC ACACCTGGCC
CCCAACGACG GCGTCAGTTC CTGCGTAAAG TCCAACCGCA AGACCGTTCC CGGCGTCATG
AGCGCCCGGG GCTGCGCCTA CGCCGGCGCC AAGGGGGTCG TCTGGGGCCC CATCCGCGAC
ATGGTTCACA TCTCCCATGG TCCGGTGGGG TGCGGCTGGT ACTCCTGGGG TACTCGCCGC
AACCTGATGA GCGGCATCAA CGGGGTCACC AACTTCGGCA TGCAGTTCAC GTCGGATTTC
CAGGAAAAGG ATATCGTCTA CGGCGGGGAC AAGAAGTTGA AGCAACTGCT GGAGGAGGCC
AAGGGACTGT TCCCGCTGGC CAAGGGGATT TCGGTCCTGT CCGAGTGTCC GGTGGGCCTC
ATCGGCGACG ACATCAACGC CGTGGCCAAG CAGTCCGCCA AGGAGCTGGA CATCCCGGTC
ATACCCTGTA ACTGCGAAGG GTTCCGCGGG GTCAGCCAGT CCCTGGGCCA CCACATTTCC
AACGACACCA TCCGCGACTT CATCATCGGC ACCCGGGAGT TTGCCGAGCC CGAGAGCCCG
TACGACATCG CCCTTATCGG CGACTACAAC ATCGGCGGTG ACGTCTGGAG CGCCAAGGCG
CTGCTGGAGG AGATCGGCCT CAACGTGAAG GCCACCTGGA CCGGCGACGG CGAGTTGGAC
CGGATCGCCG CCACCCACAC GGTGAAGCTC AACCTCATCC ACTGCTACCG CTCCATGAAC
TACATGTGCC GCGTGATGGA AGAGAAGTAC GGCATCCCCT GGCTGGAGTT CAACTTCTTC
GGCCCCAGCA AGATCAAGGA GAGCCTGCGC GCGATTGGTG AGCGGTTCGA CGACAGGATC
AAGGAGAACG TGGAAAAGGT CATTGCCAAG TACGACCCGA TCATGCAGGC GGTGATCGAC
GAGTATCGGC CGCGGCTCGA AGGGAAGAAG GTCATGATCT ACGTCGGCGG TCTCCGTCCC
CGCCACACCG TGGGCGCCTA CGAGGACCTG GGGATGGTGG TTGTCGGCAC CGGCTACGAG
TTCGCCCACG GCGACGACTA CGAGCGGACC TCCCCCGAGA TGCCGGTCGA TACGGTCATT
TTTGACGATG CCTCCGAGTT CGAGCTGGAG AAGTTCGCCC ATCAGATCAA GCCCGACCTG
GTTGCCTCCG GCATCAAAGA GAAGTACGTG TTCCAGAAAA TGGGGCTTCC CTTCCGCCAG
ATGCACAGCT GGGACTACTC CGGTCCCTAT CACGGCTACC AGGGATTCCC GATCTTTGCC
CGGGACATTG ACATGGCGGT GAATAGCCCC ACGTGGTCGC TGATCAAGTC GCCGTTCTAG
 
Protein sequence
MSNAAKRVEG ITKESTQEMI DKVLEVYPEK GRKKRAPHLA PNDGVSSCVK SNRKTVPGVM 
SARGCAYAGA KGVVWGPIRD MVHISHGPVG CGWYSWGTRR NLMSGINGVT NFGMQFTSDF
QEKDIVYGGD KKLKQLLEEA KGLFPLAKGI SVLSECPVGL IGDDINAVAK QSAKELDIPV
IPCNCEGFRG VSQSLGHHIS NDTIRDFIIG TREFAEPESP YDIALIGDYN IGGDVWSAKA
LLEEIGLNVK ATWTGDGELD RIAATHTVKL NLIHCYRSMN YMCRVMEEKY GIPWLEFNFF
GPSKIKESLR AIGERFDDRI KENVEKVIAK YDPIMQAVID EYRPRLEGKK VMIYVGGLRP
RHTVGAYEDL GMVVVGTGYE FAHGDDYERT SPEMPVDTVI FDDASEFELE KFAHQIKPDL
VASGIKEKYV FQKMGLPFRQ MHSWDYSGPY HGYQGFPIFA RDIDMAVNSP TWSLIKSPF