Gene Rpal_5111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5111 
Symbol 
ID6412805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5492122 
End bp5493678 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content64% 
IMG OID642714996 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_001994075 
Protein GI192293470 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC TGCTGCAACT GCACGATTTC AGCGCGCTGG GAACGACGTC GTTCGACGAG 
ATGCGCAAGA GCGCGGCGCA ATCGGGCTGC AGCAGCAAAA GCGGCGCCGG CAAGACCGGC
TGCGGCTCGG CCGCTGGCCC CAGCGATCTG CCGCCGGAAG TCTGGGAGAA GGTGAAGAAT
CATCCCTGCT ACAGCGAGCA GGCGCATCAT CACTTCGCCC GCATGCACGT CGCGGTCGCG
CCCGCGTGCA ACATCCAGTG CAATTACTGC AATCGCAAAT ACGATTGCGC CAATGAATCC
CGTCCCGGCG TGGTCAGCGA GAAGCTGACC CCAGAGCAGG CCGCGCGCAA AGTCGTCGCA
GTGGCCTCGA CCATCCCGCA GATGACAGTA CTCGGCATCG CCGGTCCGGG TGATGCGCTT
GCCAATCCGG CCAAGACCTT CAAGACCTTC GAGCTGGTCA CCGAGACCGC GCCCGACATC
AAGCTGTGCC TGTCGACCAA CGGCCTGATG CTGCCCGACT ATGTCGAGCA GATCGCCGCC
ATGAAGGTCG ATCACGTCAC CATCACGATC AATATGGTCG ATCCGGAGGT CGGCGCGAAG
ATCTACCCGT GGATCTTCTA CAATCACCGC CGTTACACCG GCGTCGAGGC GTCGAAGATC
CTCAGCGAGC GGCAGTTGCT CGGACTGGAG ATGCTGGTCG CACGCGGCAT CCTGGTGAAG
GTCAACTCGG TGATGATCCC GGGCATTAAC GACGAGCACC TGATCGAGGT CAACAAGGCG
GTGAAGTCGC GCGGCGCCTT CCTGCACAAC ATCATGCCGC TGATCTCCGA AGCCGAGCAC
GGCACTGCGT TCGGCCTGTC GGGCCAGCGC GGCCCGACCG CGCAGGAGCT GAAGGCGCTG
CAGGACGCCT GCGAAGGCGA GATGAACATG ATGCGGCACT GCCGGCAGTG CCGCGCCGAC
GCGGTCGGCC TGCTCGGCGA GGATCGCAGC GCCGAGTTCA CCACCGAAAA GGTGATGGCG
ATGGACGTCG AATACGACCT CGCCGCGCGC CAGGCCTACC AGGCCAAGGT CGAGGCCGAG
CGCGACGCGA TCGCGGTCGC CAAGCAGCGC GAGCTGGAGA AGCTCGCCGA CGAGACGGCG
ACCATCAAGA TCCAGGTGGC GATCGCCACC AAGGGCGGCG GCGTCATCAA CGAGCACTTC
GGCCACGCCC ACGAGTTCCA GATCTACGAG GTGTCGACCG CCGGTGCGAA GTTCGTCGGC
CACCGCCGTG TCGATCTGTA TTGCGAAGGC GGTTACGCCA GCGAAACCGG TATCGAGCCG
ATCCTCAAGG CGCTGAATGA CTGCACCGCC GTGCTGGTCG CCAAGATCGG CATGTGCCCG
AAGGACTCGC TCGCCGGTGC CGGCATCGAG GCAGTCGAGA CCTACGCATT CGAATACATC
GAGCAGTCGG TGATCGCGTA TTTCAAGGAA TACCTGGAAC GCGTCGGCAA GTCGGAGATT
CGCCACGTCG CGCGAGGCGA TGCCACGATC CGCCAGGGCG CGTTCACCGA GGCCTAG
 
Protein sequence
MSKLLQLHDF SALGTTSFDE MRKSAAQSGC SSKSGAGKTG CGSAAGPSDL PPEVWEKVKN 
HPCYSEQAHH HFARMHVAVA PACNIQCNYC NRKYDCANES RPGVVSEKLT PEQAARKVVA
VASTIPQMTV LGIAGPGDAL ANPAKTFKTF ELVTETAPDI KLCLSTNGLM LPDYVEQIAA
MKVDHVTITI NMVDPEVGAK IYPWIFYNHR RYTGVEASKI LSERQLLGLE MLVARGILVK
VNSVMIPGIN DEHLIEVNKA VKSRGAFLHN IMPLISEAEH GTAFGLSGQR GPTAQELKAL
QDACEGEMNM MRHCRQCRAD AVGLLGEDRS AEFTTEKVMA MDVEYDLAAR QAYQAKVEAE
RDAIAVAKQR ELEKLADETA TIKIQVAIAT KGGGVINEHF GHAHEFQIYE VSTAGAKFVG
HRRVDLYCEG GYASETGIEP ILKALNDCTA VLVAKIGMCP KDSLAGAGIE AVETYAFEYI
EQSVIAYFKE YLERVGKSEI RHVARGDATI RQGAFTEA