Gene Rsph17025_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1249 
Symbol 
ID5084422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1291327 
End bp1292949 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content68% 
IMG OID640482807 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_001167455 
Protein GI146277296 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGCA GGGGACAAGC CGTGGCGCAT GACGCTGCTT TCGCATCGGT GGTTCTCGCC 
GCAGCGCGGG ATTTCCCTTC CCCTCCAGCG CCCTCCTGCC GTTTCCGCAA CCTGGCACGC
TGCGTGCTTT CCCCTTCGGT GACGGGACAG GAGCCGCGCA TGTCGGAAGC CTTGAAGCAG
AAGATCCAGG ACGCCTTTCA CGAGCCCGGC TGCGCGACCA ACGCCGCCAA GCCCGAGGCC
GAGCGCAAGA AGGGCTGCGC CAAGCAGCTG ACCCCCGGCG CCGCGGCGGG CGGCTGCGCC
TTCGACGGCG CCATGATCGC GCTGCAACCG ATCACCGATG TGGCCCATCT GGTCCATGCC
CCGCTCGCCT GCTGGGGCAA CGGCTGGGAC AACCGCGGCT CGGCCTCGTC GGGCTCCGAA
CTTTACCGCA AGGGCTTTAC CACCGACCTG ACCGAACTCG ACATCGTGAT GGGCAACGGC
GAGAAGAAGC TCTTCCGCGC CATCCGCGAG GTGATCGCGC AGGAGAACCC GGCCGCCGTC
TTCGTCTATG CCACCTGCGT GACCGCGCTG ATCGGCGACG ATCTCGGCGC GGTCTGCAAG
GCCGCCGCCG AACGGTTCGG GCGGCCGGTG ATCCCGGTCA ACGTGCCGGG CTATGTCGGC
TCGAAGAACC TCGGCAACAA GCTGGGCGTG GATGCGCTGG TCGAGCATGT GGTGGGCACG
ATGGAGCCCG CCGAGCCGGG CCTGACCGAC ATCAACATCA TCGGCGACTT CAACCTGTCG
GGCGAGCTCT GGCAGGTGAA GCCGCTGCTC GACCGGCTGG GCATCCGCAT CCTCGGCAGC
GTCTCGGGCG ATGCGCGCTA TGCGCAGGTG GCGATGATGC ACCGCGCGCG GGTGACGATG
CTGGTCTGCT CGCACGCCTT CATGGCCATC GCCCGCAAGC TGGAGGAACG CCACGGCATC
CCGTGGTTCG AGGGCAGCTT CTATGGCATC TCCGACACCT CCGCCGCGCT GCGCACCCTG
TGCCGGATGC TGGTGGAGCG CGGCGCCCCC GCCGACCTTC TTCCCCGCTG TGAGGCCCTC
ATCGCCGAGG AAGAGGCACG GACCCGGGCG GAGCTTGCTC CCCTTCGCCC GCGGGTCGAG
GGGCGCCGCG TGCTTCTCTA TACCGGCGGG CACAAGACCT GGTCGGTGGT CTCGGCCCTG
CAAGAACTGG GGATCGAGGT GGTGGGCACC TCGATGCGCA AGGCGACCGA CGGCGACCGC
GGGCGCGTCA CCGAGATCAT GGGCACCGAC GCTCACATGT ATGAGAACAT GGCGCCGGCC
GAGATGTATC GCCTGCTGCG CGAGGCGCGA GCGGACGTGC TGATGTCGGG CGGGCGGTCG
CAGTTCGTGG CGCTGAAGGC ACGCGTGCCC TGGATCGACG TGAACCAGGA AAAGCACGAA
CCCTACGCCG GTTACATGGG CATGGTCGAA CTGGTCCGCG CCATCGACCG CGCGGTGAAC
AACCCGATGT GGGCCGACCT GCGGGAGCCC GCGCCGTGGG AGATGCCGGC CTGCGAGGCT
CCCGACGCAC CTTTCGTGCT GGCCGCCGTG CCCGGCTCGA AAGCCGATTT CGAGGATTGC
TGA
 
Protein sequence
MSGRGQAVAH DAAFASVVLA AARDFPSPPA PSCRFRNLAR CVLSPSVTGQ EPRMSEALKQ 
KIQDAFHEPG CATNAAKPEA ERKKGCAKQL TPGAAAGGCA FDGAMIALQP ITDVAHLVHA
PLACWGNGWD NRGSASSGSE LYRKGFTTDL TELDIVMGNG EKKLFRAIRE VIAQENPAAV
FVYATCVTAL IGDDLGAVCK AAAERFGRPV IPVNVPGYVG SKNLGNKLGV DALVEHVVGT
MEPAEPGLTD INIIGDFNLS GELWQVKPLL DRLGIRILGS VSGDARYAQV AMMHRARVTM
LVCSHAFMAI ARKLEERHGI PWFEGSFYGI SDTSAALRTL CRMLVERGAP ADLLPRCEAL
IAEEEARTRA ELAPLRPRVE GRRVLLYTGG HKTWSVVSAL QELGIEVVGT SMRKATDGDR
GRVTEIMGTD AHMYENMAPA EMYRLLREAR ADVLMSGGRS QFVALKARVP WIDVNQEKHE
PYAGYMGMVE LVRAIDRAVN NPMWADLREP APWEMPACEA PDAPFVLAAV PGSKADFEDC