Gene Rru_A1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1388 
Symbol 
ID3834803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1639371 
End bp1640972 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content62% 
IMG OID637825478 
ProductNifA subfamily transcriptional regulator 
Protein accessionYP_426476 
Protein GI83592724 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTATGG ACTATACCTA TGCGATTGAT CTCGACGATT TCGTCGATGA TTCCACCGAC 
TGCAAGACCG GCGAATGTCG TGTCGCCGTT CTGCCGATCC TGTTCCAGAT CAGCCAGATC
ATTTCCGAGA GCGAGGATTT GCCCCGGTCT CTGGCGATCA TTCTCAAGGT CATGCAGCAG
CGCATGCGCA TCGGCCGCGG AACCGTCAGT CTTTATGACC GCGAAAGCGG CAAGATCTTC
GTTCACGAAA GCTTTGGCGG CCGGGACGAC CAGCAGGCCC TGGGCGCCAG CGGGCCGGGG
CGCGGCATCA CCGCCAAGGT GGTGGACTCG GGCAAGGCGA TCATCGTGCC CGAACTGCGC
GACAGCCCGA CCAAGCCCAA CCGCACCCAG CTTCAGGCCG ATGGCAGCGA TTTGGCGGTG
TCGTTCTTCT GCGTGCCGAT CCTGCGCGGT CGCAAGGTTC TGGGCACCAT CAGCGCCGAG
CGCGTCTATG CCAATCGCCG GCTGTTGAAG CAGGACGTGG AACTGATCGC CACCATCGCC
TCGATGATCG CCCCGGCGGT GGAGCTGTAT CTTTTGGAGA ACATCGACAA GGTGCGGCTG
GAAAACGAGA ATCGCCGTCT GCACGACGCC CTGAAAAGCC GCTTCAAACC TACCAATATC
ATCGGCACCT CCAAGCCGAT GCAAGAGGTC TATGATCTGA TCCACAAGGT GGCGATGACC
AAGGCCACCG TGCTGATCCT CGGGGAAAGC GGCGTCGGCA AGGAATTGGT CGCCAGCGCC
ATCCATTACA ACGGCGCCAC CGCCGAAGGG CCGTTCATCA AGTTCAATTG CGCCGCCTTG
CCCGAAAGCC TAGGCGAGTC CGAGTTGTTC GGTCACGAGA AAGGCGCCTT CACCGGCGCC
ATCGCCCAGC GCAAGGGGCG CTTCGAGATG GCCGACGGCG GCACCATCTT TCTCGACGAG
GTTGGCGAGC TGAGTTTGGC CATGCAGGCC AAGCTGTTGC GGGTGCTCCA GGAAAAGACC
TTCGAGCGCG TCGGCGGCGG CCGGTCGGTG CGCGTCGACG TGCGGATCAT CGCCGCCACC
AACCGCAATC TGCCCGAGAT GGTCGAGAAG GGCACCTTCC GCGAGGATCT GTTCTACCGG
CTCAATGTCT TCCCCATCAC CTTGCCGCCG CTGCGCGACC GGGGCAGCGA CGTGATCTTG
CTGGTCGATC ATTTCATCGC CCGCCACGCC GCCGAGGGTG GGCGCGAGGC CAAACGGGTG
TCGACCCCGG CCTTGACCAT GCTGATGGCC TATCACTGGC CGGGCAATGT GCGCGAACTG
GAAAACGTTA TCGAACGCTC GGTGATCTTG TCGGAAGACG GGGTGATCCA CGGCTATAAT
CTGCCGCCCT CGCTGCAGAC GGCAACGGAG ACCGGCACGT CGTTTGGCTG CGGCCTGGAA
GCCAAGCTGC AGGCGGTGGA ATACGAAATG ATCGTCGAGG CCCTGAAAAC CCATGGCGGC
AACGCCACCG AGGCGGCCAA GGAGTTGGGG CTGACCCGGC GCATCCTTGG CCTGCGCATG
GAAAAATACG CCCTCAATTA CAAGACTTAT CGCAAGCGCT GA
 
Protein sequence
MLMDYTYAID LDDFVDDSTD CKTGECRVAV LPILFQISQI ISESEDLPRS LAIILKVMQQ 
RMRIGRGTVS LYDRESGKIF VHESFGGRDD QQALGASGPG RGITAKVVDS GKAIIVPELR
DSPTKPNRTQ LQADGSDLAV SFFCVPILRG RKVLGTISAE RVYANRRLLK QDVELIATIA
SMIAPAVELY LLENIDKVRL ENENRRLHDA LKSRFKPTNI IGTSKPMQEV YDLIHKVAMT
KATVLILGES GVGKELVASA IHYNGATAEG PFIKFNCAAL PESLGESELF GHEKGAFTGA
IAQRKGRFEM ADGGTIFLDE VGELSLAMQA KLLRVLQEKT FERVGGGRSV RVDVRIIAAT
NRNLPEMVEK GTFREDLFYR LNVFPITLPP LRDRGSDVIL LVDHFIARHA AEGGREAKRV
STPALTMLMA YHWPGNVREL ENVIERSVIL SEDGVIHGYN LPPSLQTATE TGTSFGCGLE
AKLQAVEYEM IVEALKTHGG NATEAAKELG LTRRILGLRM EKYALNYKTY RKR