Gene GSU2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2819 
SymbolnifK 
ID2686860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3100763 
End bp3102232 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content62% 
IMG OID637127509 
Productnitrogenase molybdenum-iron protein, beta subunit 
Protein accessionNP_953863 
Protein GI39997912 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACC AACTCGGACT CGCCGTCAAG CCGGTCACCG AATACGATGA CGCAGAAGTA 
AAGAGAGTCG CCGAATGGAT CAACACTGAA GAGTACAAGG AGAAGAACTT CGCCCGCCAG
GCCCTGGTGA TCAACCCGGC CCACGCCTGT CAGCCCCTGG GGGCCGAACT GGTGGCCCAC
GCCTTCGAGG GGACCCTGCC ATTCGTTCAC GGTTCCCAGG GGTGCGCCTC CTACTACCGC
TCCACCCTCA ACCGGCACTT CCGGGAGCCG GCGCCGGCCG TTTCCGATGC CATGACCGAG
GACGGCGCCG TGTTCGGCGG CCAGAACAAC CTCCACGAGG GGCTGGAAAA CGCCATCGCT
CTCTACAAGC CCAAGATGGT CGCCGTCTTC ACCTCGTGCA TGCCGGAGAT CATCGGCGAC
GACCTGACCG CGTTCCTGAA GAACGCCCGC AACAAGGGGA TCATCCCGGC GGACATGCCG
ACCCCGTACG CCAACACCCC GAGCTTCAAC GGCTCACACA TCCACGGCTA CGACGCCATG
CTCCTTTCCA TCCTGCAGAC TCTGACCGCG GGCAAACAGG TGGAGGGTCG CTGCACGGGC
AAGCTTAACC TGATCCCCGG CTTCGACGCC AATACCGGCA ACTTCAGGGA GTACAAGCGG
ATTCTCGAGG CCTTCGGCAT TCCCTACACC ATCCTCGGCG ACATCTCCGA CGTGTTCGAT
TCGCCCCTGG ACGGCACGTA CCGCCCCTAT CCGGGCGGCA CCACGCTGGA TGACGCCGCC
GACTCCATCA ACGGCAAGGC CACCCTCAAC CTGGGGCCCT ATTCGGCGGC AAAGACCTTC
TCTTGGGTTA AAGACTCCTA TTCCGGTAAG CATGCGTCCC TTCCCATGCC CATGGGAGTC
ACCAAGACCG ACGACTTCCT CAAGAAGCTG TCGGAGCTCT TCGGCAAGCC GGTCCCCGAG
AGTCTGAAGG AGGAGCGGGG CCGGGCCGTG GACGCCATGA CCGATGCCCA CCAGTACATC
CACAACAAAA AGTTCGCCGT CTACGGCGAT CCCGACCAGC TCCTCGGCTA CGTCTCCTTT
CTGCTGGAGA TGGGCGCCAA GCCCTATCAC ATCCTCTGCA GCAAGGGGAC AAAGAAGCTG
GAGAAGGAAA TCCAGGCGTT GCTCGATACC TCTCCCTACG GCGCCGGCTG CAAGATCTAC
ATCAACAAGG ATCTCTGGCA CATGCGGAGC CTGCTCATGA CCGACCCGGT GGACGCCATG
ATCGGTGACA CCCACGGCAA GTTCGCGGCC CGCGACGCCG GTATCCCGCT TTTCCGCTTC
GGCTTCCCGA TCTTCGACCG GGTCAACAAG CACCGCTACC CGATCATCGG CTACCAGGGC
GTGGTCAATA TGCTGACCGA GATCTGCAAC AAGTTCCTCG ACATCACCGA CGAGACTTGT
GAGGACCGGT TCTTCGAGAT GATGCGGTAA
 
Protein sequence
MSNQLGLAVK PVTEYDDAEV KRVAEWINTE EYKEKNFARQ ALVINPAHAC QPLGAELVAH 
AFEGTLPFVH GSQGCASYYR STLNRHFREP APAVSDAMTE DGAVFGGQNN LHEGLENAIA
LYKPKMVAVF TSCMPEIIGD DLTAFLKNAR NKGIIPADMP TPYANTPSFN GSHIHGYDAM
LLSILQTLTA GKQVEGRCTG KLNLIPGFDA NTGNFREYKR ILEAFGIPYT ILGDISDVFD
SPLDGTYRPY PGGTTLDDAA DSINGKATLN LGPYSAAKTF SWVKDSYSGK HASLPMPMGV
TKTDDFLKKL SELFGKPVPE SLKEERGRAV DAMTDAHQYI HNKKFAVYGD PDQLLGYVSF
LLEMGAKPYH ILCSKGTKKL EKEIQALLDT SPYGAGCKIY INKDLWHMRS LLMTDPVDAM
IGDTHGKFAA RDAGIPLFRF GFPIFDRVNK HRYPIIGYQG VVNMLTEICN KFLDITDETC
EDRFFEMMR