Gene Rsph17025_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1247 
Symbol 
ID5084420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1288119 
End bp1289600 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content63% 
IMG OID640482805 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001167453 
Protein GI146277294 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.867808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAG ACATTGCAGA TTCTGCGGAA GCCAACATGA AGCTGATCGA GGAGGTGCTT 
GCCGCCTATC CCGACAAGGC CAAGAAGAAA CGCGCCAAGC ACCTTGGCGT CGCCGAGACG
ATTGCCGATG CCGAGCCCGG CATCCAGTCG AAATGCGACA CCGTCAAGTC GAACATCAAG
TCGGTTCCCG GCGTGATGAC CATCCGCGGC TGCGCCTACG CCGGCTCGAA GGGCGTGGTC
TGGGGTCCGG TCAAGGACAT GCTGCACATC AGCCACGGCC CGGTGGGCTG CGGCCACTAC
AGCTGGTCCC AGCGCCGCAA CTACTACACC GGCACCACGG GCATCGACAG CTTCGTGACC
ATGCAGGTCA CCACCGACTT CCAGGAAAAC GACATCGTCT TCGGCGGTGA CAAGAAGCTG
GAAAAGACCA TCGACGAGCT GAACACGCTC TTCCCGCTGA ACAAGGGCAT CTCGATCCAG
TCCGAATGCC CGATCGGCCT GATCGGCGAC GACATCGAGG CGGTGTCGAA GAAGAAGGCC
AAGGACATCG GCAAGCGCGT GATCCCGGTG CGCTGCGAGG GCTTCCGCGG CGTGTCGCAG
TCGCTCGGCC ACCATATCGC GAACGACATG ATCCGCGACT GGGTGCTCGA GGCGGGCGAG
GGCGCGCGGG CGGGTTTCGA GGCCGGCCCC TATGATGTCA ACATCATCGG CGACTACAAC
ATCGGCGGCG ACGCCTGGTC GAGCCGGATC CTCCTGGAAG AGATCGGCCT GAACGTGATC
GCCCAATGGT CGGGCGACGC CACCATCGCC GAGATGGAGC GGGCGCCGGC GGCCAAGCTG
AACCTGATCC ACTGCTACCG CTCCATGAGC TACATCTGCC GGCACATGGA AGAGAAGCAC
GGCGTTCCGT GGATGGAATA CAACTTCTTC GGGCCGAGCC AGATCGCCGC GTCCCTGCGC
GCCATCGCGG CGAAGTTCGA CGCCACCATC CAGGCCAATG CCGAGGCGGT GATCGCGAAA
TATCAGCCGC TCGTCGATGC GGTGAACGCG AAATACAAGC CGCGCCTCGA AGGCAAGAAG
GTGATGCTCT ACGTCGGCGG CCTGCGTCCG CGCCACGTCG TCGATGCCTA CCACGACCTC
GGCATGGAGA TCGTCGGCAC CGGCTACGAG TTCGCCCACA ACGACGACTA CAAGCGCACC
GGCCACTACA TCAAGGAAGG CACGCTGATC TACGACGACG TGTCGGGCTA CGAGCTGGAG
AAGTTCGTCG AGGCGATCCG TCCCGATCTC GTCGGCTCGG GCATCAAGGA GAAGTACAAC
ACGCAGAAGA TGGGCATCCC GTTCCGTCAG ATGCACTCGT GGGACTACTC CGGCCCCTAC
CACGGCTACG ACGGCTACGC GATCTTCGCG CGCGACATGG ATCTTGCGAT CAACAACCCG
GTCTGGGGCA TGTTCGACGC GCCCTGGAAG AAGACGGCCT GA
 
Protein sequence
MAKDIADSAE ANMKLIEEVL AAYPDKAKKK RAKHLGVAET IADAEPGIQS KCDTVKSNIK 
SVPGVMTIRG CAYAGSKGVV WGPVKDMLHI SHGPVGCGHY SWSQRRNYYT GTTGIDSFVT
MQVTTDFQEN DIVFGGDKKL EKTIDELNTL FPLNKGISIQ SECPIGLIGD DIEAVSKKKA
KDIGKRVIPV RCEGFRGVSQ SLGHHIANDM IRDWVLEAGE GARAGFEAGP YDVNIIGDYN
IGGDAWSSRI LLEEIGLNVI AQWSGDATIA EMERAPAAKL NLIHCYRSMS YICRHMEEKH
GVPWMEYNFF GPSQIAASLR AIAAKFDATI QANAEAVIAK YQPLVDAVNA KYKPRLEGKK
VMLYVGGLRP RHVVDAYHDL GMEIVGTGYE FAHNDDYKRT GHYIKEGTLI YDDVSGYELE
KFVEAIRPDL VGSGIKEKYN TQKMGIPFRQ MHSWDYSGPY HGYDGYAIFA RDMDLAINNP
VWGMFDAPWK KTA