Gene Rru_A0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0994 
Symbol 
ID3833553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1182865 
End bp1184376 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID637825083 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_426082 
Protein GI83592330 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0535] Predicted Fe-S oxidoreductases
[COG1433] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGAAC AGGTTATATC CCTGGACAGC ATATCGCGCC TTGGCGTTCT ATCGGCCGCC 
TCCCGCGACC AAGTGCCGCC CCCGGCCGCC GCCGAGGGCT GCGCTTCGCA GGGGCGCTGC
GGCGCCTCGG CCGGGCCCGA TGACATGCCC GCCGAAGTCT GGGAAAAGGT GAAGAACCAT
CCCTGCTATT CGGAAGAGGC CCACCATTTC TTCGCCCGCA TGCATGTGGC GGTGGCGCCG
GCCTGTAACA TCCAGTGCAA TTATTGCAAC CGCAAGTACG ATTGCGCCAA TGAAAGCCGC
CCCGGAGTCG TCTCCGAACG GCTGACGCCC GAGCAGGCGG CGCGCAAGGT GGCCGCCGTC
GCCAACGAAC TGCCCCAGCT TTCGGTGCTT GGCGTCGCCG GTCCGGGCGA CAGCCTGTTT
GACGCCCGCA AGACCTTCGA GACCTTCGCC CGGGTCGGCG CGATGCTGCC CGATCTGAAG
TTCTGCATTT CGACCAATGG CTTGGCCCTT CCCGATCATC TTGACGAACT GACCCGCCAC
GCCATTGATC ACGTCACCAT CACCCTGAAC TGCGTCGATC CCGAAATCGG CGCGCGGATC
TATCCCTGGA TCTATTTCAA GGGCCGGCGC TGGACCGGCG TCGATGGCGC GAAGATCCTG
CTCGAGCGCC AGATGGAAGG GCTTGAAGGG CTGATCGCCC GCAAGATCCT GGTCAAGATC
AACTCGGTGA TGATCCCGGG CATCAACGAC CACCATCTGC CCGAGGTCAA CCGGGTGATC
CGCGAGAAGG GGGCGTTCTT GCACAACGTC ATGCCGCTGA TCTCGGCGCC CGAGCACGGC
ACCCACTTCG GGCTGACCGG CCAGCGCGGG CCGACGCCAG CGGAACTCAA GGACCTGCAG
GATCGGCTTG GCGGCGGCGC CAATCTGATG AAGCATTGTC GCCAGTGCCG GGCCGATGCC
ATCGGCCTGC TGGGCGAGGA TCGCGGGCAG GAGTTCACCA TGGACAAAAT TCCCGAAACG
GTCGTAGTCG ACCCCTCGCG GCGCGAGGCT TACCGGGCTT TCGTTGAAGA GGAGCGCGCC
GATCGTCGGG CCGCCGCCGT CCAGACCGAA TCGGCTCTGG CGCGTGACGC CGCCGTGGGC
ACGCCCGCCC GACTGGTGGC GATCACCACC AAGGGTGGCG GGCGGATCAA CGCCCATTTC
GGTAAGGTCA CCGAGTTCCA GATCTACGAG GTGGATGGGG CCGGGGTGCG CTTCGTCGGC
GCCCGGCGTT TGCCCGATAA ATACTGCCTG GGCGGCTTCG ATAATGCCCC GGTGATGCCG
GCGATCCTTG ACGCCCTGGA GGGGGTCGAC GCGGTGTTGA CGGCCAAGAT CGGTCCGGGA
CCGCGCGGGC GCCTGGAAGC GGCCGGCATC GAGGTGTACG AGGACCGCAC GCTGGAGTAT
ATCGAGCCCG CCATCGGAAG TTGGTACGCG GGCCGGATGG GAGCGGCGCT CTCTGAACGG
TCTTCGGCCT GA
 
Protein sequence
MSEQVISLDS ISRLGVLSAA SRDQVPPPAA AEGCASQGRC GASAGPDDMP AEVWEKVKNH 
PCYSEEAHHF FARMHVAVAP ACNIQCNYCN RKYDCANESR PGVVSERLTP EQAARKVAAV
ANELPQLSVL GVAGPGDSLF DARKTFETFA RVGAMLPDLK FCISTNGLAL PDHLDELTRH
AIDHVTITLN CVDPEIGARI YPWIYFKGRR WTGVDGAKIL LERQMEGLEG LIARKILVKI
NSVMIPGIND HHLPEVNRVI REKGAFLHNV MPLISAPEHG THFGLTGQRG PTPAELKDLQ
DRLGGGANLM KHCRQCRADA IGLLGEDRGQ EFTMDKIPET VVVDPSRREA YRAFVEEERA
DRRAAAVQTE SALARDAAVG TPARLVAITT KGGGRINAHF GKVTEFQIYE VDGAGVRFVG
ARRLPDKYCL GGFDNAPVMP AILDALEGVD AVLTAKIGPG PRGRLEAAGI EVYEDRTLEY
IEPAIGSWYA GRMGAALSER SSA