Gene RSP_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_0538 
SymbolnifE 
ID3718047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2273554 
End bp2275017 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content69% 
IMG OID640071747 
ProductnifE, nitrogenase molybdenum-cofactor synthesis protein 
Protein accessionYP_353611 
Protein GI77464107 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.510346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAG CCTTGAAGCA GAAGATCCAG GACGCCTTTC ACGAGCCGGG CTGCGCCACG 
AACACCGCCA AGTCCGAGGG CGAGCGCCGG AAGGGCTGCG CGAAACAGCT GACGCCCGGC
GCGGCGGCCG GGGGCTGCGC CTTCGACGGG GCGATGATCG CGCTGCAGCC CATCACCGAC
GTGGCCCATC TCGTCCATGC CCCGCTCGCC TGCTGGGGCA ACGGCTGGGA CAACCGCGGC
TCGGCCTCGT CGGGCTCCGA CCTCTACCGT CGCGGCTTCA CCACCGACCT CTCCGAGCTC
GACATCGTGA TGGGCCGCGG CGAGGCCAGG CTCTTCCGTG CCATCCGCGA AGTGATCGCG
CAGGAGAACC CGGCCGCAGT CTTCGTCTAT GCCACCTGCG TGACGGCGCT CATCGGCGAC
GACATCGGCG CCGTCTGCAA GGCCGCCGCC GAACGGTTCG GCCGCCCGGT GATCCCGATC
AACGTGCCGG GCTATGTGGG CTCGAAGAAC CTCGGCAACA AGCTGGGGGT GGACGCGCTG
GTCGAACATG TCGTGGGGAC GATGGAGCCC GCGACGGCGA CCGATTGCGA CATCAACATC
CTCGGCGACT TCAACCTGTC GGGCGAACTC TGGCAGGTGA AGCCGCTGCT CGACCGCCTC
GGCATCCGCA TCCTCGGCTC GGTCTCGGGG GATGCGCGCT ATGCGCAGGT GGCCATGATG
CACCGGGCGC GGGTGACGAT GCTCGTCTGC TCGCACGCCT TCCTGGGCAT CGCCCGCAAG
CTCGAGGACC GCTACGGCAT CCCGTGGTTC GAGGGCAGCT TCTACGGCAT CTCCGACACG
TCCGACGCGC TGCGGACCCT GTGCCGGATG CTGGTCGAGC GCGGCGCGCC CGCGGACCTC
GTGACCCGCT GCGAGGCGCT GATCGCCGAG GAGGAGGCCC GCACCTGGGC CGCGCTGGAA
CCGCTCCGCC CCGCCGTCGC CGGCCGGCGC GTGCTCCTTT ACACCGGCGG GCACAAGACC
TGGTCGGTGG TCTCGGCGCT GCAGGAACTC GGCATGGAGG TGGTCGGCAC CTCGATGCGC
AAGGCCACGC CCGGCGACCG CGCGCGCGTC ACCGAGATCA TGGGCACCGA GGCCCACATG
TACGAGAACA TGGCGCCGAA GGAGATGTAT CGGATGCTGC GGGACGCGCG GGCCGATGTG
CTTATGTCGG GGGGGCGGTC GCAGTTCGTG GCGCTGAAGG CCCGCGTGCC CTGGATCGAC
GTGAATCAGG AAAAGCACGA GCCCTACGCA GGCTACATGG GCATGGTCGA TCTCGTGCGC
GCCATCGACC GGTCGATCAA CAACCCGATG TGGGCCGAGC TGCGCGACCC CGCGCCTTGG
GACGTGCCGG CCGAAGAAGC CGCCGTGACG CCCTTCAGCC TCGCGGCCGT TCCCGGCTCG
AAAGCCGATT TCGAGGATTG CTGA
 
Protein sequence
MSEALKQKIQ DAFHEPGCAT NTAKSEGERR KGCAKQLTPG AAAGGCAFDG AMIALQPITD 
VAHLVHAPLA CWGNGWDNRG SASSGSDLYR RGFTTDLSEL DIVMGRGEAR LFRAIREVIA
QENPAAVFVY ATCVTALIGD DIGAVCKAAA ERFGRPVIPI NVPGYVGSKN LGNKLGVDAL
VEHVVGTMEP ATATDCDINI LGDFNLSGEL WQVKPLLDRL GIRILGSVSG DARYAQVAMM
HRARVTMLVC SHAFLGIARK LEDRYGIPWF EGSFYGISDT SDALRTLCRM LVERGAPADL
VTRCEALIAE EEARTWAALE PLRPAVAGRR VLLYTGGHKT WSVVSALQEL GMEVVGTSMR
KATPGDRARV TEIMGTEAHM YENMAPKEMY RMLRDARADV LMSGGRSQFV ALKARVPWID
VNQEKHEPYA GYMGMVDLVR AIDRSINNPM WAELRDPAPW DVPAEEAAVT PFSLAAVPGS
KADFEDC