Gene Rpal_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5100 
Symbol 
ID6412794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5485218 
End bp5486681 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID642714985 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001994064 
Protein GI192293459 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG CAGTCGCAGA ATCCCCCGCG GACATCAAGG AACGTAACAA GAAGCTGATC 
GGCGAAGTCC TGGAGGCCTA TCCGGACAAG TCGGCCAAGC GTCGCGCCAA GCATCTCAAT
ACGTACGACG CCGAAAAGGC GGAGTGCTCG GTCAAGTCCA ACATCAAGTC GATCCCGGGC
GTGATGACGA TCCGCGGTTG CGCCTACGCC GGCTCGAAGG GCGTGGTATG GGGCCCGATC
AAGGACATGG TCCACATCAG CCACGGTCCG GTCGGCTGCG GCCAGTATTC CTGGGGCTCG
CGCCGCAACT ACTACAAGGG CACCACCGGC GTCGACACCT TCGGCACGAT GCAGTTCACC
TCCGACTTCC AGGAGAAGGA CATCGTTTTC GGCGGTGACA AGAAGCTCGG CAAGATCATC
GACGAGATCC AGGAGCTGTT CCCGCTCTCC AAGGGCATCT CGGTGCAGTC GGAATGCCCG
ATCGGTCTGA TCGGTGACGA CATCGAGGCG GTCTCGAAGG CCAAGTCGAA GCAGTACGAC
GGCAAGCCGA TCATCCCGGT GCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC
CACCACATTG CCAACGACGT GATCCGTGAC TGGGTGTTCG ACAAGGCTGG CGAGAAGAAT
GCCGGCTTCC AGTCGACCCC CTACGACGTC GCGATCATCG GCGACTACAA TATCGGCGGC
GACGCCTGGG CCTCGCGCAT CCTGCTCGAG GAGATGGGCC TCCGCGTGAT CGCGCAGTGG
TCCGGCGACG GCACCATTGC CGAGCTGGAG AATACCCCGA AGGCGAAGCT GAACATCCTG
CACTGCTACC GCTCGATGAA CTACATCACG CGGCACATGG AAGAGAAGTT CGGGATTCCG
TGGGTGGAAT ACAACTTCTT CGGTCCCACC AAGATCGAAG CGAGCCTGCG CGAGATCGCG
TCGAAGTTCG ACGACAAGAT CAAGGAAGGC GCCGAGCGCG TCATCGCTAA GTACAAGCCG
CAGATGGAAG CGGTGATCGC CAAATATCGC CCGCGCCTCG AAGGCAAGAA GGTGATGCTG
TATGTCGGCG GTCTGCGTCC GCGCCACGTC ATCGGCGCCT ACGAAGATCT CGGCATGGAA
GTGGTCGGAA CCGGCTACGA ATTTGGCCAT AACGACGATT ATCAGCGCAC CACCCATTAC
GTGAAGGACG GCACGCTGAT CTACGACGAC GTCACCGGCT ACGAATTCGA GAAGTTCGTC
GAGAAGGTGC GTCCCGACCT GGTCGGCTCG GGCGTCAAGG AAAAGTACAT CTTCCAGAAG
ATGGGCGTTC CGTTCCGCCA GATGCACTCC TGGGACTACT CGGGCCCGTA TCATGGCTAC
GACGGGTTCG GCATCTTCGC CCGCGACATG GACATCGCGA TCAATGCCCC GGTCTGGAAA
CTGACCAAGG CGCCTTGGAG CTGA
 
Protein sequence
MSTAVAESPA DIKERNKKLI GEVLEAYPDK SAKRRAKHLN TYDAEKAECS VKSNIKSIPG 
VMTIRGCAYA GSKGVVWGPI KDMVHISHGP VGCGQYSWGS RRNYYKGTTG VDTFGTMQFT
SDFQEKDIVF GGDKKLGKII DEIQELFPLS KGISVQSECP IGLIGDDIEA VSKAKSKQYD
GKPIIPVRCE GFRGVSQSLG HHIANDVIRD WVFDKAGEKN AGFQSTPYDV AIIGDYNIGG
DAWASRILLE EMGLRVIAQW SGDGTIAELE NTPKAKLNIL HCYRSMNYIT RHMEEKFGIP
WVEYNFFGPT KIEASLREIA SKFDDKIKEG AERVIAKYKP QMEAVIAKYR PRLEGKKVML
YVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV
EKVRPDLVGS GVKEKYIFQK MGVPFRQMHS WDYSGPYHGY DGFGIFARDM DIAINAPVWK
LTKAPWS