Gene Rpal_5099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5099 
Symbol 
ID6412793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5483570 
End bp5485129 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content62% 
IMG OID642714984 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001994063 
Protein GI192293458 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGA CCGCAGAGAA GATCCGCGAT CACTTCGAAC TCTTCCGTGA GCCCCAGTAC 
GAAGAGTTGA TGGAGAACAA GCGGAAGAAT TTCGAGAACT ATGTTGGCGA TGCCGAGGTC
ACGCGCGTCG CGGACTGGAC CAAGACCAAG GAATACCAGG ACAAGAACTT CGCTCGCGAG
GCTCTCGTCA TCAACCCGGC CAAGGCCTGC CAGCCGCTCG GTGCAGTGTT CGCCGCGGTC
GGCTTCGAGA AGACGCTGCC GTTCGTGCAC GGCTCGCAGG GCTGCGTTGC CTATTACCGC
AGCCACTTCA CCCGCCACTT CAAGGAGCCG ACCTCGTGCG TCTCCTCGTC GATGACCGAA
GACGCCGCGG TGTTCGGCGG CCTCAACAAC ATGATCGACG GCCTGGCCAA CGCCTATGCG
CTGTACAAGC CGAAGATGAT CGCGGTCTCG ACCACCTGCA TGGCCGAGGT CATCGGTGAC
GACCTCAACG CGTTCATCAA GAACGCCAAG GAAAAGGGCT CGGTCCCGCA GGAATTCGAC
GTCACCTACG CCCACACCCC GGCGTTCGTC GGCAGCCACA TCACCGGCTA CGACAACACC
ATGAAGGGCA TCGTCGAGCA CTTCTGGGAC GGCAAGTCCG GCACCGTGGA AAAGCTCGAG
CGCAAGCCGA ACGAGTCGAT CAACTTCCTC GGTGGGTTCG ACGGCTACAC CGTCGGCAAC
ATCCGCGAGA TCAAGCGGAT CTTCGAACTG ATGGGCGTCG ATTACACCAT CTTCGGCGAC
AACAGCGACG TCTGGGATAC CCCGGCCGAC GGTGAGTTCC GGATGTACGA CGGCGGTACC
ACGCTGGAGC AGGCCGCCAA CGCGGTCCAC GCCAAGGCGA CGATCTCGAT GCAGGAGTTC
TGCACCGAGA AGACCCTGGC GACGATCGCC GATCACGGCC AGGAAGTGGT CGCCTTCAAC
CACCCGGTCG GCATCGCCGG CACCGATCGC TTCCTGCAGG CGGTGTCGCG GATCACCGGC
AAGGCGATCC CGGAAGCGCT GACCAAGGAG CGCGGCCGTC TGGTTGACGC CATCGGCGAC
TCCTCGGCCC ACATCCACGG CAAGAAGTTC GCGATCTACG GCGATCCGGA CCTCTGCTAC
GGCCTCGCCG AATTCATCCT CGAACTCGGC GGCGAGCCGG TCCACATCCT GGCGACCAAC
GGCAACAAGA CCTGGGAAGC CAAGGTTCAG GCTCTGCTCG ACTCGTCGCC GTTCGGCGCG
GGCTGCAAGG TCTACGCCGG CAAGGATCTG TGGCACCTGC GGTCGCTGCT GTTCACCGAA
CCGGTGGACT TCATGATCGG TAACACCTAC GGCAAGTATC TCGAGCGCGA CACGGGCACC
CCGCTGATCC GTCTCGGCTT CCCGGTGTTC GACCGCCACC ACCACCACCG CTCGCCGGTG
TGGGGCTATC AGGGGTCGAT GAACGTCCTG GTCAAGATCC TCGACAAGAT CTTCGACGAA
ATGGACAAGG CGACCAACAC TGCCGGCAAG ACCGACGTCA GCTTCGATAT CATCCGCTGA
 
Protein sequence
MTETAEKIRD HFELFREPQY EELMENKRKN FENYVGDAEV TRVADWTKTK EYQDKNFARE 
ALVINPAKAC QPLGAVFAAV GFEKTLPFVH GSQGCVAYYR SHFTRHFKEP TSCVSSSMTE
DAAVFGGLNN MIDGLANAYA LYKPKMIAVS TTCMAEVIGD DLNAFIKNAK EKGSVPQEFD
VTYAHTPAFV GSHITGYDNT MKGIVEHFWD GKSGTVEKLE RKPNESINFL GGFDGYTVGN
IREIKRIFEL MGVDYTIFGD NSDVWDTPAD GEFRMYDGGT TLEQAANAVH AKATISMQEF
CTEKTLATIA DHGQEVVAFN HPVGIAGTDR FLQAVSRITG KAIPEALTKE RGRLVDAIGD
SSAHIHGKKF AIYGDPDLCY GLAEFILELG GEPVHILATN GNKTWEAKVQ ALLDSSPFGA
GCKVYAGKDL WHLRSLLFTE PVDFMIGNTY GKYLERDTGT PLIRLGFPVF DRHHHHRSPV
WGYQGSMNVL VKILDKIFDE MDKATNTAGK TDVSFDIIR