Gene RPC_4681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4681 
Symbol 
ID3972387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5239648 
End bp5241036 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content61% 
IMG OID637927793 
Productnitrogenase 
Protein accessionYP_534522 
Protein GI90426152 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR02931] Fe-only nitrogenase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTGCC AAGTCAAAGT CAAGGACCGC GTCGGCACCA TCAACCCGAT CTTCACCTGC 
CAGCCGGCCG GCGCTCAGTT CGCCTCGATC GGCATCAAGG ACTGCATCGG CATCGTGCAT
GGCGGTCAGG GCTGCGTGAT GTTCGTCCGG CTTTTGATCT CCCAGCACCT CAAGGAGAGC
TTCGAGATCG CCTCGTCCTC AGTGCACGAG GACGGCGCGG TGTTCGGCGC GCTCGACCGG
GTCGAGACCG CGGTCGACGT GCTGTTGATG CGCTATCCGC ATGTCAAGGT GGTGCCGATC
ATCACCACCT GCTCGACCGA AGTGATCGGC GACGACGTCG ATGGCCTGAT CACTAAGCTG
GAGGAGGGCC TGCTCGCCGA GAAATATGCC GACCGCGAAG TGCATCTGCT GGCCATCCAT
TCGCCGAGCT TCGTCGGCTC GATGGTGTCG GGCTATGACG TCGCGGTGCG CGACTTCGTC
AAGAAGTTCG CCAAGAAGGG TGAACCGTCG GGCAGGATCA ACCTGTTGAC CGGCTGGGTG
AACCCGGGCG ACGTCAAGGA GCTCAAACAC CTGGTCTCCG AACTCGGCAT CGAGGCCAAC
GTGCTGTTCG AGATCGAGAG CTTCGACAGC CCGCTGATGC CCGACGGCGC CGGCATCTCG
CATGGCAACA CCACCGTCGC CGACCTCGAA GCCACCGGCT CGGCGATCCA CACCTTCGCG
CTGAACCGCT ACGAGGGCGG CAAGGCGGCG CAACTTCTGG AAAAGAAGTT CAAAATCCCC
TCGACCATCG GCCCGACCCC GATCGGCATC CGCAACACCG ACGCGCTGTT GAAGAAGCTC
TCGCAGGTCA CCGGCAAGCC GATCCCTGCG AGCCTGGTCA AGGAACGCGG CATCGCGCTC
GACGCCATCT CTGACGTCGC GCACATGTTC CTCGCCGATA AGAAGGTGGC GATCTACGGC
AATGCGGATC TGGTGATCGG ATTGGCGGAA TTCTGCCTCG ACCTGGAGAT GAAGCCGCAG
CTGTTGCTGC TCGGCGACGA CAATGCCGGC TATGTCAACG ATCCCCGCAT CAAGGCGCTG
CAGGAGAACG TCGATTATCC GATGGAGATC ATCACCAACG CCGACTTCTG GGAGCTGGAA
AACCGGATCA AGAATGAGGG ACTCGAGCTC GATCTGATCC TCGGTCACTC CAAGGGCCGC
TTCATCTCGA TCGACTACAA TATCCCGATG GTGCGGGTGG GCTTCCCGAC CTACGACCGC
GCCGGGCTCT ATCGTCACCC GGTCGTCGGC TATGGCGGCG CCACTTGGCT CGCCGAACAG
ATGGCCAATG CGCTGTTCAC CGACATGGAG CACAAGAAGA ACAAGGAATG GCTGCTGAAC
GTGTGGTGA
 
Protein sequence
MNCQVKVKDR VGTINPIFTC QPAGAQFASI GIKDCIGIVH GGQGCVMFVR LLISQHLKES 
FEIASSSVHE DGAVFGALDR VETAVDVLLM RYPHVKVVPI ITTCSTEVIG DDVDGLITKL
EEGLLAEKYA DREVHLLAIH SPSFVGSMVS GYDVAVRDFV KKFAKKGEPS GRINLLTGWV
NPGDVKELKH LVSELGIEAN VLFEIESFDS PLMPDGAGIS HGNTTVADLE ATGSAIHTFA
LNRYEGGKAA QLLEKKFKIP STIGPTPIGI RNTDALLKKL SQVTGKPIPA SLVKERGIAL
DAISDVAHMF LADKKVAIYG NADLVIGLAE FCLDLEMKPQ LLLLGDDNAG YVNDPRIKAL
QENVDYPMEI ITNADFWELE NRIKNEGLEL DLILGHSKGR FISIDYNIPM VRVGFPTYDR
AGLYRHPVVG YGGATWLAEQ MANALFTDME HKKNKEWLLN VW