Gene Rsph17029_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2191 
Symbol 
ID4895103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2321754 
End bp2323235 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content63% 
IMG OID640112785 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001044066 
Protein GI126462952 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.395448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.437472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAG ATATCGCTGA CTCTGCCGAG ACCAACATGA AGCTGATCGA GGAGGTGCTG 
GCCGCCTACC CCGACAAGGC CAGGAAGAAG CGCGCCAAGC ACCTGAATGT CGCAGCGCCC
GTCGCCGAGG CCGAACCCGG CCTCCAGTCG AGATGCGACA ATGTGAAATC GAACATCAAG
TCGGTCCCCG GCGTGATGAC CATCCGCGGC TGCGCCTATG CCGGCTCGAA GGGCGTGGTC
TGGGGCCCGG TCAAGGACAT GCTGCACATC AGCCACGGCC CGGTCGGCTG CGGCCACTAC
AGCTGGTCCC AGCGCCGCAA CTACTACACC GGCACGACGG GCGTGGATTC GTTCGTGACG
ATGCAGGTCA CCACCGACTT CCAGGAAAAC GACATCGTCT TCGGCGGTGA CAAGAAGCTG
GAAAAGACCA TCGACGAGCT GAACATGCTC TTCCCGCTGA ACAAGGGGAT CTCGATCCAG
TCGGAATGCC CGATCGGCCT GATCGGCGAC GACATCGAGG CGGTGTCGAA GAAGAAGGCC
AAGGACATCG GCAAGCGCGT CGTTCCGGTG CGCTGCGAGG GATTCCGCGG CGTGTCGCAG
TCGCTCGGCC ACCATATCGC GAACGACATG ATCCGCGACT GGGTGCTGGA AGCGGGCGAG
GGCGCGCGCG CGGGCTACGA GCCCGGCCCC TATGACGTGA ACATCATCGG CGACTACAAC
ATCGGCGGCG ACGCCTGGTC GAGCCGGATC CTGCTGGAAG AGATCGGCCT CAACGTCATC
GCGCAGTGGT CGGGCGACGC GACCATCGCC GAGATGGAGC GCGCTCCGGC GGCCAAGCTG
AACCTCATCC ACTGCTACCG CTCGATGAGC TACATCTGCC GGCACATGGA AGAGAACCAC
GGCGTGCCGT GGATGGAATA CAACTTCTTC GGCCCCTCTC AGATCGCGGC CTCGCTGCGC
GCCATCGCCG CGAAGTTCGA CGACAGGATC CAGGCCAATG CCGAAGCGGT CATCGCGAAA
TACCAGCCGC TCGTCGACGC GGTGAACGCG AAATACAAGC CGCGCCTCGA AGGCAAGAAG
GTGATGCTCT ATGTGGGCGG CCTGCGTCCG CGCCACGTCG TCGACGCCTA CCATGACCTG
GGCATGGAGA TCGTGGGCAC CGGCTACGAA TTCGCCCACA ATGACGACTA CAAGCGCACC
GGCCATTACA TCAAGGAAGG CACGCTGATC TTCGACGACG TCTCGGGCTA CGAGCTGGAG
AAATTCGTCG AGGCGATCCG TCCCGATCTC GTGGGCTCGG GCATCAAGGA GAAATACAAC
ACGCAGAAGA TGGGCATCCC GTTCCGTCAG ATGCACTCCT GGGATTATTC CGGCCCCTAC
CACGGCTACG ACGGCTACGC GATCTTCGCG CGCGACATGG ATCTCGCGAT CAACAACCCC
GTCTGGGGCA TGTTCGACGC GCCCTGGAAG AAGACGGCCT GA
 
Protein sequence
MAKDIADSAE TNMKLIEEVL AAYPDKARKK RAKHLNVAAP VAEAEPGLQS RCDNVKSNIK 
SVPGVMTIRG CAYAGSKGVV WGPVKDMLHI SHGPVGCGHY SWSQRRNYYT GTTGVDSFVT
MQVTTDFQEN DIVFGGDKKL EKTIDELNML FPLNKGISIQ SECPIGLIGD DIEAVSKKKA
KDIGKRVVPV RCEGFRGVSQ SLGHHIANDM IRDWVLEAGE GARAGYEPGP YDVNIIGDYN
IGGDAWSSRI LLEEIGLNVI AQWSGDATIA EMERAPAAKL NLIHCYRSMS YICRHMEENH
GVPWMEYNFF GPSQIAASLR AIAAKFDDRI QANAEAVIAK YQPLVDAVNA KYKPRLEGKK
VMLYVGGLRP RHVVDAYHDL GMEIVGTGYE FAHNDDYKRT GHYIKEGTLI FDDVSGYELE
KFVEAIRPDL VGSGIKEKYN TQKMGIPFRQ MHSWDYSGPY HGYDGYAIFA RDMDLAINNP
VWGMFDAPWK KTA