Gene Rsph17025_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1248 
Symbol 
ID5084421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1289700 
End bp1291220 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID640482806 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001167454 
Protein GI146277295 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.836121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAAT CCGCCGAGAA CATTCTCGAC CACAAGGATC TGTTCAAGGA ACCCGAATAC 
CAGGCGATGC TGGAGAAGAA GCGCGCCACC TACGAGAACG CGACCCCGGC CGAGAAGGTC
GAGGAAGTCG CCGACTGGAC GAAGTCCTGG GACTACCGCG AGAAGAACCT CGCCCGCTCC
TGCGTCACGA TCAACCCGGC CAAGGCCTGC CAGCCGCTCG GCGCCGTCTT CGCCGCCGCG
GGCTATGACA GCACCATGAG CTTCGTGCAC GGCTCGCAGG GATGCGTGGC CTACTACCGC
TCGCACCTCG CCCGCCACTT CAAGGAGCCC TCGTCGGCGG TCTCGTCCTC GATGACCGAG
GATGCGGCGG TGTTCGGCGG CCTGAACAAC ATGATCGAGG GCCTGGCCAA CACCTACGCG
CTCTACAGCC CGAAGATGAT CGCGGTTTCG ACCACCTGCA TGGCCGAAGT CATCGGCGAC
GACCTCAACT CCTTCATCCT GAAGTCGAAG GAAAAGGAAA GCGTCCCGGC CGACTTCCCG
GTGCCCTTCG CCCACACCCC GGCCTTCGTG GGCAGCCACG TCGATGGCTA CGACAACATG
CAGAAGGGCA TCCTGTCCTG CTTCTGGAAG GACGCGCCCC GCACCGCCGG CGAGGGGATC
AACATCATCC CCGGCTTCGA TGGCTATGTG GTCGGCAACA TCCGCGAGAT GAAGCGGATG
CTGGGCCTGA TGGGCGTCGA GGCGACCGTT CTGGGCGATG CCTCGGACGT CTATGACACG
CCCTCGGACG GCGAGTTCCG CATGTATGCC GGCGGCACCA CGCAGGAGGA GATCAAGGCG
GGCCTGAACG CCAAGGCCAC GATCTCGCTG CAGGAATATT GCACCCGCAA GACGCTCGCC
TTCTGCGAGG AAGTCGGGCA GCAGACCGCC TCGTTCCACT ATCCGATGGG TGTGCAGGCC
ACCGACGAGT TCCTGATGAA GGTCGCCGAG CTGACCGGCA AGGAAATCCC GGAACAGCTG
CGGCTGGAGC GTGGCCGTCT GATCGACGCC ATGGCCGACA GCCAAGCTTA TCTGCACGGC
AAGACCTACG CGATCTACGG CGACCCGGAC TTCGTCCATG CGATGGCCCG TTTCGTGATG
GAGATGGGCG GCGAGCCGAA GCACTGCCTC GCCACCAACG GCGGCAAGGA CTGGGAAGCG
CAAATGATGG CGCTGCTGGC CTCGTCGCCC TTCGGCGAAG GCTGCCAGGT CTGGGCGGGC
AAGGACCTGT GGCACCTGCG CTCGATCCTT GCGACCGAGC CTGCGGACCT GCTGATCGGC
AACAGCTATG GCAAGTATCT GGAGAAGGAC TGCAACATCC CGCTGATCCG CCTGACCTTC
CCGATCTTCG ACCGTCACCA CCACCACCGC TTCCCGACCT TCGGCTATCA GGGCGCGATC
GGGGTGCTGG TGAAGATCCT CGACACGATC TTCGACAAGC TCGACACGGA ATCGGACATC
TCGTTCGACC TGACCCGCTG A
 
Protein sequence
MPQSAENILD HKDLFKEPEY QAMLEKKRAT YENATPAEKV EEVADWTKSW DYREKNLARS 
CVTINPAKAC QPLGAVFAAA GYDSTMSFVH GSQGCVAYYR SHLARHFKEP SSAVSSSMTE
DAAVFGGLNN MIEGLANTYA LYSPKMIAVS TTCMAEVIGD DLNSFILKSK EKESVPADFP
VPFAHTPAFV GSHVDGYDNM QKGILSCFWK DAPRTAGEGI NIIPGFDGYV VGNIREMKRM
LGLMGVEATV LGDASDVYDT PSDGEFRMYA GGTTQEEIKA GLNAKATISL QEYCTRKTLA
FCEEVGQQTA SFHYPMGVQA TDEFLMKVAE LTGKEIPEQL RLERGRLIDA MADSQAYLHG
KTYAIYGDPD FVHAMARFVM EMGGEPKHCL ATNGGKDWEA QMMALLASSP FGEGCQVWAG
KDLWHLRSIL ATEPADLLIG NSYGKYLEKD CNIPLIRLTF PIFDRHHHHR FPTFGYQGAI
GVLVKILDTI FDKLDTESDI SFDLTR