Gene Rpal_2887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2887 
Symbol 
ID6410556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3149107 
End bp3150465 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content66% 
IMG OID642712767 
Productoxidoreductase/nitrogenase component 1 
Protein accessionYP_001991870 
Protein GI192291265 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.513277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGAAGG CACTTCAGCC GCCGACCCGG GAGCGGAGAC TCAATGGCGT CAGTGCCTAT 
TTGGGCGACG TCGCGTCCTT GATCGCTGAA CTCGAAGCCG GGCCACCCGC GTCACGGATT
CGCACGTTCT CGCAAGTTGC CAGCGATGAC GTGTCGATCG CGTTGCGCGT TCTGGCAGAA
GTTGAGGGGC TGGGGATCAT CGTACACGGT GCGCGGGGCT GCGCCGGAGC GCTGGTGCGC
AGTCGCCCGG GCCGTCCATG GGCGGTCACC AACCTCGATC AGCGCGACAC CATTCTCGGC
GCAGACGGAA ATCTCGCTCG GACGCTACGT CGACTGCATC AACGCCATCG CCCCTGGGGA
ATTGTCGTGG TGGCGACGCC CGTCGTTGCG ATCAATAACG ACGACATCCA GGCGGTGGCT
CAAGAGCTGA GTGACGAACT CGGCATTCCG GTCGTCGAGT TGCGGACGGA TGGATTCCGT
TCGCGGATCG GTGCCACGGG CTACGATATT GCGAGCGCGG CTCTTGCGTC CCTGGTGCCG
CCACAGCACG GCCGTCGGCG CGACCTGATC AATCTGCTGG CTTTTGAGCG GGGCCCGGGC
CTCACCGCTG TGGTGCAACA GCTTGCCGGG CTTGGGCTCG AGGTGAATCT GGTGCCGGCG
GGGGCCGGCC AGGAGGCGTT CGCCAAGGCG GCGCAAGCGG TCCTCAGCGT CGCGGTGTTT
CAGGACGAGG CCGACGTGCT GCGCCGCGAA CTCGACCGGC TGCACCGCGT GCCGTTCCTG
CATCTGCCGC CGCCGATCGG TTCTACTGGT GCGCTGCGGT TCGTCGAAGC CGTCGCTGAG
GCAACCGAGC GGCCGCTCCC GATCCGGATC GATGATCGCA TCAGCACCGA CCTGCTGGAG
GACCGCCGTG TGGTGATTGC GCTGCCGCCA TCTCAGGCGC TGGCGGTCGC GGAGCTTGTG
GTGCAGTTCG GCGGCCGGAT TGCCGGGATC AGCGTCGACT GGATCGACAG CCTTCATGTC
GAGGGGCTGA AGGCCCTGAC CAATGCAGCG ACCGTCGCTC TGCATGTCGG GGCGGGGCAG
GCGTTTGAAT TGGTGAACTG GCTCGGCAAG ATTGAGCCGG ATCTGCTCAT CGGAACGCCG
ACCGCTGCGG CGACCGCGAC CCGCGTTGGG ATTGCGGCTG TCGCAATCGA AGGTGACGAT
TTGCTTGGCA CGGCCGGAGA GACACGGCTT GCAACGCGCA TCGGCCGGGC GCTGGCCGCA
CAGCAAATTG GTTTCACCGC TGGGGCCGGG ACGTCTCCGT ATCGGGCAGG TTGGCTGAAG
CGCAGTCCGG ATTGGCACAT CAAGCGTGAG GTGAGATGA
 
Protein sequence
MEKALQPPTR ERRLNGVSAY LGDVASLIAE LEAGPPASRI RTFSQVASDD VSIALRVLAE 
VEGLGIIVHG ARGCAGALVR SRPGRPWAVT NLDQRDTILG ADGNLARTLR RLHQRHRPWG
IVVVATPVVA INNDDIQAVA QELSDELGIP VVELRTDGFR SRIGATGYDI ASAALASLVP
PQHGRRRDLI NLLAFERGPG LTAVVQQLAG LGLEVNLVPA GAGQEAFAKA AQAVLSVAVF
QDEADVLRRE LDRLHRVPFL HLPPPIGSTG ALRFVEAVAE ATERPLPIRI DDRISTDLLE
DRRVVIALPP SQALAVAELV VQFGGRIAGI SVDWIDSLHV EGLKALTNAA TVALHVGAGQ
AFELVNWLGK IEPDLLIGTP TAAATATRVG IAAVAIEGDD LLGTAGETRL ATRIGRALAA
QQIGFTAGAG TSPYRAGWLK RSPDWHIKRE VR