Gene Rpal_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5098 
Symbol 
ID6412792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5482067 
End bp5483530 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content64% 
IMG OID642714983 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_001994062 
Protein GI192293457 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCC TCGCAGACAA GATTCAAGAC GTCTTCAACG AGCCCGGCTG TGCGGCCAAT 
CAGGCCAAGT CCGACAAGCA ACGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCGGGGGGC
GCAGCCGGTG GCTGCGCCTT CGACGGCGCC AAGATCGCGC TGCAGCCGAT CGTCGACGTC
GCGCACCTGG TGCACGGCCC GATCGCCTGC GAAGGCTCGT CCTGGGACAA TCGCGGCACC
AAGTCGTCCG GCTCGAAGCT GTATCGCACC GGCTTCACCA CCGACATGGG CGAGAACGAT
GTGATCTTCG GCGGCGAGAA GCGGCTGTTT AGGTCGATTC GCGAGATCAT CGAGAAGTAC
GATCCGCCGG CCGTGTTCGT GTATCAGACC TGTGTTCCGG CGATGATGGG CGACGACATC
GTCGCCGTCT GCAAGGTCGC CTCCGAGAAA TTCGGCAAGC CCTGCGTTCC GATCATCTCT
CCCGGCTTCG TCGGCCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAGGC GATGCTCGAT
TACGTGATCG GCACTCAGGA GCCGGAGTTC ACGACCCCCT ACGACATCAA CATCATCGGC
GAATACAACG TCGCGGGCGA ATTGTGGCAG GTGAAGCCGC TGCTCGACGA ACTCGGGATC
CGCATTCTGT CGTGCCTGTC GGGCGATGCG CGCTATCACG AAGTGGCGCA ATCGCACCGC
GCCCGCGCCG CCATGATGGT GTGCTCGACC GCGATGATCA ATGTCGCGCG CAAGATGGAA
GAGCGCTACG GCATCCCGTA TTTCGAAGGC TCGTTCTACG GCATCAGCGA CACCTCCGAG
TCGCTTCGCC AGATTGCGCG GTTGCTGATC GCCCGCGGCG CGCCGGACGA GCTGATGGCC
CGCACCGAGG CGCTGATCGC CCGCGAAGAG GCCAAGGCCT GGGCGGCGAT CAAGGCCTAC
ACCCCGCGGC TGGAAGGCAA GAAGGTGCTG CTGATCACCG GCGGCGTAAA GTCGTGGTCG
GTGGTGGCGG CGCTGCAGGA AGCGGGGCTG TCCATCGTCG GCACCAGCGT CAAGAAGTCG
ACCAAGGAAG ACAAGCTGCG GCTCAAGGAG ATGAGCCCGG ACGTCCACCA GATCGACGAT
CTGCGCCCAC GCGAAATGTA CAAGATGCTC AAAGATGCGC AGGCCGACAT CATGCTGTCG
GGCGGCCGCT CGCAGTTCGT CGCCTTGAAG GCGCGGATGC CCTGGATGGA TATCAACCAG
GAGCGCACCT ACGCGTATTG CGGCTATGTC GGCATCGTCG AGATGGTTCG GCAGATCGAC
AAATCGCTGT CCAATCCGAT CTGGGCTCAG GTGCGCAGCG CCCCGCCGTG GGACGAGGTC
ACCTGGGAGC AGCGCGCGGA CGCCGCCAAC GCCGCCGACG ATCGCCAACG CGCGATCTTC
GGGCGTTCGG CTCGAGTGGC GTGA
 
Protein sequence
MSRLADKIQD VFNEPGCAAN QAKSDKQRKK GCSKPLQPGG AAGGCAFDGA KIALQPIVDV 
AHLVHGPIAC EGSSWDNRGT KSSGSKLYRT GFTTDMGEND VIFGGEKRLF RSIREIIEKY
DPPAVFVYQT CVPAMMGDDI VAVCKVASEK FGKPCVPIIS PGFVGPKNLG NKLAGEAMLD
YVIGTQEPEF TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEVAQSHR
ARAAMMVCST AMINVARKME ERYGIPYFEG SFYGISDTSE SLRQIARLLI ARGAPDELMA
RTEALIAREE AKAWAAIKAY TPRLEGKKVL LITGGVKSWS VVAALQEAGL SIVGTSVKKS
TKEDKLRLKE MSPDVHQIDD LRPREMYKML KDAQADIMLS GGRSQFVALK ARMPWMDINQ
ERTYAYCGYV GIVEMVRQID KSLSNPIWAQ VRSAPPWDEV TWEQRADAAN AADDRQRAIF
GRSARVA