Gene RPC_4460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4460 
Symbol 
ID3973086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4964308 
End bp4965978 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content65% 
IMG OID637927571 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_534302 
Protein GI90425932 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC TGGCCGACAA GATCCAAGAC GTCTTCAACG AGCCAGGCTG CTCCACCAAC 
ACCGGCAAAG CCGCCAAGGA GCGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCCGGCGCC
GCTGCCGGCG GCTGTGCCTT TGACGGCGCC AAGATCGCGC TGCAGCCGGT CACCGACGTC
GCCCATCTGG TGCACGGCCC GATCGCCTGC GAAGGCAGTT CGTGGGACAA CCGCGGCGTC
AAGTCGTCGG GCTCCAAGCT CTATCGCACC AGCTTCACCA CCGACATGAG CGAGAACGAC
GTGGTGTTCG GCGGCGAGAA GCGGCTGTTC AAGTCGATCC GCGAAATCAT CGAGAAATAC
GATCCGCCGG CGGTGTTCGT CTATCAGACC TGCGTGCCGG CGATGATGGG CGACGACATC
GCCGCGGTCT GCAAGGTGGC GAGCGAGAAG TTCGGCAAGC CGTGCATCCC GATCATCTCC
CCGGGCTTCG TCGGGCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAGGC GATGCTGGAT
TACGTGATCG GCACCCAAGA GCCAGATTAC ACCACGCCCT ACGACATCAA CATCATCGGC
GAATACAACG TCGCCGGCGA ACTCTGGCAG GTCAAGCCGC TGCTCGACGA ACTCGGCATC
CGCATCCTGT CGTGCCTGTC GGGCGACGCC CGCTACCACG AGATCGCCCG CTCGCACCGC
GCCCGCGCCG CGATGATGGT GTGCTCGACC GCGATGATCA ACGTCGCCCG CAAGATGGAA
GAGCGCTACG GCATTCCGTA TTTCGAAGGC TCGTTCTACG GCATCACCGA CACCTCGGAT
TCGCTGCGGC AGATCGCCAG GCTCTTGATC GCGCGCGGCG CCGACGCCGA GTTGATGGAC
CGCGTCGAAG CGGTGATCGC CCGCGAGGAA GCCCGCGCTT GGGAGGCGCT CAAGGCGTTC
ACGCCGCGCC TGCAAGGCAA GAAGGTTTTG TTGATCACCG GCGGCGTCAA ATCCTGGTCG
GTGGTGGCGG CGCTGCAGGA GGCCGGGCTC AGCATCGTGG GCTCAAGCGT CAAGAAGTCG
ACCAAGGAGG ACAAGGAGCG GCTCAAGGAG ATGAGCCCCG ACGTCCACCA GATCGACGAC
CTGCGGCCGC GCGAAATGTA CAAGATGCTG AAAGAGGCCA AGGCCGACAT CATGCTGTCC
GGCGGCCGCT CGCAATTCGT GGCGCTGAAG GCCAAGATGC CCTGGCTCGA TATCAACCAG
GAGCGGCACT ACGCCTATGC GGGCTATGTC GGCATCATCG AGATGGTTCG GCAGATCGAT
AAGGCGTTGT CCAATCCGGT GTGGCAGCAG GTCCGCATGG CGCCGCCGTG GGACGAGGTC
AGCTGGGAAG ACCGCGCCGA CGCCGCCAAT GCGGCGGATG CAGCAGCGCT CGCCGCCGAT
CCGGAGCGCG CCGAGGCCGA GCGCCGGGCG ACCATCGTTT GCAAATGCAA GGAGATCAGC
GTCGGCACCA TCGAGGATGC GATCCGCAGC CACGACCTCA CCGTGATCGC GCAAGTCACC
CAGCTCACCC AGGCCGGCGG TGGGTGCGGT TCTTGCAAGG AGACCATCGC GGCGATGCTG
GCGCGCGAAG CGGCGACCGC CACGGGCGCG CCCTCCGTTC AGGTCGCATG A
 
Protein sequence
MSSLADKIQD VFNEPGCSTN TGKAAKERKK GCSKPLQPGA AAGGCAFDGA KIALQPVTDV 
AHLVHGPIAC EGSSWDNRGV KSSGSKLYRT SFTTDMSEND VVFGGEKRLF KSIREIIEKY
DPPAVFVYQT CVPAMMGDDI AAVCKVASEK FGKPCIPIIS PGFVGPKNLG NKLAGEAMLD
YVIGTQEPDY TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEIARSHR
ARAAMMVCST AMINVARKME ERYGIPYFEG SFYGITDTSD SLRQIARLLI ARGADAELMD
RVEAVIAREE ARAWEALKAF TPRLQGKKVL LITGGVKSWS VVAALQEAGL SIVGSSVKKS
TKEDKERLKE MSPDVHQIDD LRPREMYKML KEAKADIMLS GGRSQFVALK AKMPWLDINQ
ERHYAYAGYV GIIEMVRQID KALSNPVWQQ VRMAPPWDEV SWEDRADAAN AADAAALAAD
PERAEAERRA TIVCKCKEIS VGTIEDAIRS HDLTVIAQVT QLTQAGGGCG SCKETIAAML
AREAATATGA PSVQVA