Gene Rsph17029_2190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2190 
Symbol 
ID4895849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2320133 
End bp2321683 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content64% 
IMG OID640112784 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001044065 
Protein GI126462951 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0387614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.12949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGG CCAGCAGAAG GATCATGCTC ATGCCGCAGT CGGCCGAAAA GGTTCTGGAT 
CACAAGGATC TGTTCAAGGA ACCCGAATAT CAGGCGATGC TCGAGAAGAA GCGCGCCACC
TACGAGAATG CGACGCCCGC CGAGACGGTG GCCGAAACCG CGGACTGGAC GAAGTCCTGG
GACTATCGCG AGAAGAACCT CGCCCGCTCC TGCGTGACCA TCAACCCGGC CAAGGCCTGC
CAGCCGCTGG GCGCGGTCTT CGCCGCCGCC GGCTATGACA GCACCATGAG CTTCGTGCAC
GGCTCGCAGG GCTGCGTGGC CTACTATCGC TCGCACCTCG CCCGCCACTT CAAGGAGCCG
TCCTCGGCGG TGTCCTCCTC GATGACCGAG GATGCGGCGG TGTTCGGCGG CCTGAACAAC
ATGGTGGAGG GCCTCGCCAA CACCTATGCG CTCTATTCGC CGAAGATGAT CGCGGTTTCC
ACCACCTGCA TGGCGGAAGT CATCGGCGAC GACCTCAACT CCTTCATCAT CAAGTCGAAG
GAAAAGGAAA GCGTCCCGGC CGACTTCCCG GTGCCCTTCG CCCATACGCC GGCCTTCGTG
GGCAGCCACG TCGACGGCTA CGACAACATG CAGAAGGGCA TCCTGTCGAA CTTCTGGAAG
GACGCGCCGC GCACCGCGGG CGAAGGCCTG AACATCATCC CGGGCTTTGA CGGCTACTGC
GTGGGCAACG TCCGCGAGAT GAAGCGCATG CTCGGCCTGA TGGGCGTCGA GGCGACCGTT
CTGGGCGATG CCTCGGATGT CTACGACACC CCCTCCGATG GCGAATACCG CATGTATGCG
GGCGGCACCA CGCAGGAGGA GATCAAGGAG GCCCTGAACG CGAAGGCCAC CCTCTCGCTG
CAGGAATATT GCACCCGCAG GACGCTCGCC TTCTGCGAGG AAGTGGGCCA GGAAACCGCC
TCGTTCCACT ATCCGATGGG CGTCAAGGCC ACCGACGAGT TCTTGATGAA GGTCTCGGAC
CTGACCGGCA AGGAAATCCC GGAAGCGCTC CGCCTCGAGC GCGGCCGCCT GATCGACGCC
ATGGCCGACA GCCAGGCCTA TCTGCACGGC AAGACCTACG CCATCTTCGG CGATCCCGAC
TTCGTCTATG CGATGGCCCG CTTCGTGATG GAGATGGGCG GCGAGCCGAA GCACTGCCTC
GCCACCAACG GCGGCAAGGA CTGGGAAGTG CAGATGAAGG AGCTGCTGGC CTCCTCGCCC
TTCGGCGAAG GCTGCCAGGT CTGGGCGGGC AAGGACCTCT GGCACCTGCG CTCGATCCTC
GCCACGGAAC CGGCGGACCT GCTGATCGGC AGCAGCTACG GCAAGTATCT CGAGCGCGAC
TGCAACGTGC CGCTGATCCG CCTGACCTTC CCGATCTTCG ACCGCCACCA CCACCACCGC
TTCCCGACCT TCGGCTATCA GGGCGCGATC CAGGTGCTGG TGAAGATCCT CGACAAGATC
TTCGACAAGC TCGACGACGA GTCCGACATC TCGTTCGACC TGACCCGCTG A
 
Protein sequence
MRTASRRIML MPQSAEKVLD HKDLFKEPEY QAMLEKKRAT YENATPAETV AETADWTKSW 
DYREKNLARS CVTINPAKAC QPLGAVFAAA GYDSTMSFVH GSQGCVAYYR SHLARHFKEP
SSAVSSSMTE DAAVFGGLNN MVEGLANTYA LYSPKMIAVS TTCMAEVIGD DLNSFIIKSK
EKESVPADFP VPFAHTPAFV GSHVDGYDNM QKGILSNFWK DAPRTAGEGL NIIPGFDGYC
VGNVREMKRM LGLMGVEATV LGDASDVYDT PSDGEYRMYA GGTTQEEIKE ALNAKATLSL
QEYCTRRTLA FCEEVGQETA SFHYPMGVKA TDEFLMKVSD LTGKEIPEAL RLERGRLIDA
MADSQAYLHG KTYAIFGDPD FVYAMARFVM EMGGEPKHCL ATNGGKDWEV QMKELLASSP
FGEGCQVWAG KDLWHLRSIL ATEPADLLIG SSYGKYLERD CNVPLIRLTF PIFDRHHHHR
FPTFGYQGAI QVLVKILDKI FDKLDDESDI SFDLTR