Gene Rpal_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1622 
Symbol 
ID6409279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1735753 
End bp1737321 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content62% 
IMG OID642711511 
Productnitrogenase alpha chain 
Protein accessionYP_001990626 
Protein GI192290021 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01861] nitrogenase iron-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATATC ACGAGTTCGA CTGTTCGAAA TGTCTGCCCG AACGCCAGAA GCACGCCGTC 
ACCAAGGGCG CCGGCGATAA TCTCGCCACC GCGCTGCCGC TCGGCTATCT CAACACGATC
CCGGGATCGA TCTCGGAGCG CGGCTGCGCC TATTGCGGCG CAAAGCATGT GATCGGCACG
CCGATGAAGG ACGTGATTCA CATGAGTCAC GGCCCCGTCG GCTGCACCTA CGACACCTGG
CAGACCAAGC GCTATATCTC CGACAACAAC AACTTCCAGC TCAAATACAC CTTCGCCACC
GACGTCCGCG AGAAGCACAT CGTATTCGGC GCCGAAGGCC TTCTGAAGCA GAACATCATC
GAGGCTTTCA AAGCCTGCCC CGACATCAAG CGGATGACGA TCTACCAGAC CTGCGCCACC
GCGCTGATCG GCGACGACAT CAATGCTGTC GCCGCCGAGG TGATGGAGGA GATGCCGGAC
GTCGACATCT TCACCTGCAA CTCGCCAGGC TTCGGCGGCC CCAGCCAGTC CGGCGGCCAC
CACAAGATCA ACATCGCCTG GATCAACGAC AAGGTCGGCA CCGTCGAGCC CGAGATCACC
TCGGATTACG TCATCAACTA TGTGGGCGAA TACAACATCC AGGGCGACCA GGAGGTGATG
CTCGACTATT TCACCCGGAT GGGTATCCAG GTGCTGTCGA CCTTCACCGG CAATGGCACC
TATGACGGCC TGCGGGCGAT GCACCGCGCG CATCTCAACG TGCTCGAATG CGCCCGTTCG
GCCGAATACA TCTGCAACGA GCTGCGCGTC CGCTACGGGA TTCCGCGGCT CGACATCGAC
GGCTTCGGCT TCGAGCCGTT GTCGCAGTCG CTGCGCAAGA TCGGGATGTT CTTCGGCATC
GAAGACCGCG CCGAAGCGAT CATCGCCGAA GAGACCGCGC GCTGGAAGCC GGAGCTCGAC
TGGTACAAGG AACGGCTGAA GGGCAAAAAG GTCTGCCTGT GGCCAGGCGG CTCCAAGCTG
TGGCACTGGG CGCACGCCAT CGAAGAGGAG ATGGGCGTCA AGGTCGTCTC GGTCTACACT
AAGTTCGGCC ACCAGGGCGA CATGGAAAAG GGCATCGCGC GCTGCGGCGA GGATGCGCTG
GCGATCGACG ATCCCAACGA ACTCGAGGGC CTCGAGGCGC TGGAGAAGCT GCAGCCGGAC
ATCATCTTCA CCGGCAAGCG TCCCGGCGAA GTCGCCAAGA AGGTCCGCGT TCCGTACCTC
AACGCCCACG CCTATCACAA CGGCCCATAC AAGGGCTTCG AAGGCTGGGT GCGGTTCGCC
CGCGACATCT ACAACGGCAT CTACTCGCCG ATGCACCAGC TCTCCGGGCT GGACATCAGC
AAGGACGAGA TTCCGGCCGA TCGCGGTTTC GTCACGCAGC GCATGCTGTC CGACGCGAAG
CTGCCGGAAG AGATCGCCAA GTCGGAGACG CTGCGGCGCT ACACCGGCAA GGACGACATC
ATCTCCGACC TGCGCAAGAA GAACGCGCCC TACTTCACCC CGATCGTCAA AGCCGAAGCG
GCCGAGTGA
 
Protein sequence
MPYHEFDCSK CLPERQKHAV TKGAGDNLAT ALPLGYLNTI PGSISERGCA YCGAKHVIGT 
PMKDVIHMSH GPVGCTYDTW QTKRYISDNN NFQLKYTFAT DVREKHIVFG AEGLLKQNII
EAFKACPDIK RMTIYQTCAT ALIGDDINAV AAEVMEEMPD VDIFTCNSPG FGGPSQSGGH
HKINIAWIND KVGTVEPEIT SDYVINYVGE YNIQGDQEVM LDYFTRMGIQ VLSTFTGNGT
YDGLRAMHRA HLNVLECARS AEYICNELRV RYGIPRLDID GFGFEPLSQS LRKIGMFFGI
EDRAEAIIAE ETARWKPELD WYKERLKGKK VCLWPGGSKL WHWAHAIEEE MGVKVVSVYT
KFGHQGDMEK GIARCGEDAL AIDDPNELEG LEALEKLQPD IIFTGKRPGE VAKKVRVPYL
NAHAYHNGPY KGFEGWVRFA RDIYNGIYSP MHQLSGLDIS KDEIPADRGF VTQRMLSDAK
LPEEIAKSET LRRYTGKDDI ISDLRKKNAP YFTPIVKAEA AE