Gene RPD_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1074 
Symbol 
ID4021550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1225007 
End bp1226470 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID637961266 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_568213 
Protein GI91975554 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.811234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CAGTCGCAGA ATCTCCCGCG GACCTCAAGG AACGCAACAA GCAGCTGATC 
GGCGAAGTCC TGGAAGCCTA TCCGGACAAG TCCGCCAAGC GCCGCGCCAA GCACCTCAAC
ACCTACGAGG CCGAAAAGGC CGAGTGCTCG GTCAAGTCCA ACATCAAGTC GATCCCGGGC
GTGATGACGA TCCGCGGCTG CGCCTACGCC GGCTCGAAGG GCGTGGTGTG GGGTCCGATC
AAGGACATGG TCCACATCAG CCACGGCCCG GTCGGCTGCG GCCAGTATTC CTGGGGCTCG
CGCCGCAACT ATTACAAGGG CGTCACCGGC ATCGACACCT TCGGCACGAT GCAGTTCACC
TCCGATTTCC AGGAGAAGGA CATCGTCTTC GGTGGCGACA AGAAGCTCGG CAAGATCATC
GACGAAATTC AGGACCTGTT CCCGCTGAAC CGCGGCATTT CGGTGCAATC GGAATGCCCG
ATCGGTCTGA TCGGCGACGA CATCGAGGCG GTCTCCAAGG CCAAGACGAA ACAGTATGAC
GGCAAGCCTA TCATCCCGGT GCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC
CATCACATCG CCAACGACGT GATCCGCGAC TGGGTGTTCG ACAAGGCCGG CGACAAGGTC
GCCACGTTCG AATCGACGCC CTACGACGTC GCGATCATCG GCGACTACAA CATCGGCGGC
GATGCCTGGG CCTCGCGCAT CCTGCTTGAG GAGATGGGGC TCCGCGTGAT CGCGCAGTGG
TCCGGCGACG GCACCATCGC CGAGCTCGAG AACACTCCAA AAGCGAAGCT GAACATCCTG
CACTGCTACC GCTCGATGAA CTACATCACG CGGCACATGG AAGAGAAGTT CGGGATCCCG
TGGGTCGAGT ACAATTTCTT CGGTCCGACC AAGATCGAAG CCTCGTTGCG CGAGATCGCT
GCGAAGTTCG ACGACAAGAT CAAGGAAGGC GCCGAGCGCG TCATCGCCAA GTACAAGCCG
CGGATGCAGG CGATCGTCGA TCGCTATCGT CCGCGCCTGG AAGGCAAGAA GGTGATGCTC
TATGTCGGCG GCCTGCGTCC GCGCCACGTG ATCGGCGCCT ATGAAGACCT CGGCATGGAA
GTGGTCGGCA CCGGCTATGA ATTCGGCCAC AACGACGACT ATCAGCGCAC CACCCACTAC
GTGAAGGACG GCACGCTGAT CTACGACGAC GTCACCGGCT ACGAATTCGA GAAATTCGTG
GAGAAGGTGC GGCCCGATCT GGTCGGCTCC GGCATCAAGG AAAAGTACAT CTTCCAGAAG
ATGGGTGTGC CGTTCCGCCA GATGCATTCG TGGGACTATT CCGGCCCGTA TCACGGCTAT
GACGGCTTCG CCATCTTCGC CCGTGACATG GACATCGCCA TCAACGCTCC GATCTGGAAG
CTGACCAAGG CACCTTGGAG CTGA
 
Protein sequence
MSTAVAESPA DLKERNKQLI GEVLEAYPDK SAKRRAKHLN TYEAEKAECS VKSNIKSIPG 
VMTIRGCAYA GSKGVVWGPI KDMVHISHGP VGCGQYSWGS RRNYYKGVTG IDTFGTMQFT
SDFQEKDIVF GGDKKLGKII DEIQDLFPLN RGISVQSECP IGLIGDDIEA VSKAKTKQYD
GKPIIPVRCE GFRGVSQSLG HHIANDVIRD WVFDKAGDKV ATFESTPYDV AIIGDYNIGG
DAWASRILLE EMGLRVIAQW SGDGTIAELE NTPKAKLNIL HCYRSMNYIT RHMEEKFGIP
WVEYNFFGPT KIEASLREIA AKFDDKIKEG AERVIAKYKP RMQAIVDRYR PRLEGKKVML
YVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV
EKVRPDLVGS GIKEKYIFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DIAINAPIWK
LTKAPWS