Gene RPB_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3359 
Symbol 
ID3911161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3840944 
End bp3842905 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content63% 
IMG OID637885262 
Productradical SAM family protein 
Protein accessionYP_486966 
Protein GI86750470 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.39571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0330371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGC CAAGGCGTGC CCGTGGTGAC GAACTATTGG CCCCCGGCAA GATGCTGGCT 
CTGCGGGAGC GGCTGCGACA TCTTTCCGGA AGCCACGACC TGACGACCGT CATCGTCAGT
GCCTTTGATC ACAGAACGAG GGTCCTTCCC TTCATTCTTG CCGATACGCG GATGGCCCCC
GGCGGCGTGC GGGCGATCGG GTCGGCGCTC GTGGACGTCG GCTTCGACAA AACGCGGATC
GTCCTCCAGC AGTGGAATCG CAATTTCAGC CCGTTGCACA TGCGCCTTGA TGGCCGCATT
CCGGATCTGT TCCTGGTTTC GAGCATGCAT CTGCATTCGG CGGAGTGCGA CCGACTGATC
CGGGAGGCCT GCCGGATCGA GCCGGACAAG CGCCCGCTGA TCATCGTGGG CGGTCCGCGG
ATCCGCTACG AGCCCTGGCA CGTCTTCGGT GCAGACCCGA ACGCGAACTG GGCCGCGGAC
GTCGCCGTCA CCGGCGAAGA GTTTGTTTTT CTGGAATTGC TGGAAGTGCT GCTGTCGATG
CGCGCGGGCG GCGAATCGAT GCGCTCGGTG TTCGCCCGCG CGCGCGATAG CGGCGCCCTC
GACCACGTCG CGGGGATCGT CTACGCGCGC AGCGCGCCGC GCGGCGGACC GATCGAGGAA
CTGATCGACA CCGGCGTCCA GCGCCTGCTC GGCGATCTCG ATGAACTTCC CGACTCGGTC
CACGGCTACA AACTGCTGGA GACGCCGAGC AACGAGACGA CGCTGGCGAG CCATGCCTTG
CCCGCCAATC GTGTGAGGAA GTTCAGCCCG ATCGCCGGGA TCGTCATGAC GGCCGGCTGC
AAGTTTCGTT GCTCCTACTG TCCGATACCT GCCTACAACC AGTCGCAGTT CCGGGCCAAG
AGCGGTGAGC GCATCGCTGA GGAAATCGGG CAGATCGCCA CCACCTTCGG CCTCTATAAT
TTCTTCGGCG CCGACGACAA TTTCTTCAAC AACACCAAAC GCACGCTTGA TATCGCCGGG
ACCCTGGCCC GCAAAGCCCG GGCCGGCCGA CCGTTCTGCA AGATCAGGAT CGGAACGGAA
GTCACGGTGC ACGACACGGT CGGAATGCGC GATCACCTGC CGCTGATCCG CGACGCCGGT
TTCGCAGCAG TCTGGCTCGG CGTCGAGGAC ATGACCGCGA CGCTGGTCAA GAAGGCGCAA
GACAAGGACA AGACCGAACT CGCGTTCCGA TTGCTGCGGG AAAACAACAT TCTGCCGATG
CCGATGATGA TGCATCACGA CAGCCAGCCG CTGGTGACCT GGAAATCGAA CTACGGTCTC
ATCAACCAGA TCAGGTTGTT GCGCAAGGCG GGATCGATCA CCACGCAAGT GATGATGCTG
ACGCCGGCGC CCGGATCGAA ATGGTACGAG AACGTGTTCG ATTCGGGGAT GGCTTTCAGC
AAGGCGGGCG ATGTCGCGGT CGAGCCGCAC ATCATGGACG GCAATTATGT CGTCGCCTCG
AAACACGCCC GGCCTTGGCT CAAGCAGATG AATCTGCTGC TCGCCTACAC CTACTTCTTC
AATCCTGTTC GCTTTCTGAG CGCGCTGATC TTCTCGAAGT CGGCGGGGGT GTTCTCGAGC
GCGGAAACCA GGCCGGCAGA AGAGGTCGAG GATTACTCGG CCGCGAAGAA GATGCTTCGC
CGCGCCGAAC TCAGAGCACG GGCTCATCTG ATCGATGCCG GCGCGCAGAT CTACGGCATG
ATCGGGCTGA TCCAAACCTA CCGCCGAACG GCGGGATGGG CATGGCGGCT GTTTCGCGAT
GATATTCAGC GATACGACCG GGCTCCCGTC AGTCCGATCC CAATGCGCGG CGTCGGCGAC
AAGCCTTCCG ATCACGCGCT CCCGGGAACG ATTGATCTGC GCGACCTCGG AAAGCCCGTC
GTTCGCGAAG CGTCGGCGCG GGACATGGCG GGGACTGCCT GA
 
Protein sequence
MELPRRARGD ELLAPGKMLA LRERLRHLSG SHDLTTVIVS AFDHRTRVLP FILADTRMAP 
GGVRAIGSAL VDVGFDKTRI VLQQWNRNFS PLHMRLDGRI PDLFLVSSMH LHSAECDRLI
REACRIEPDK RPLIIVGGPR IRYEPWHVFG ADPNANWAAD VAVTGEEFVF LELLEVLLSM
RAGGESMRSV FARARDSGAL DHVAGIVYAR SAPRGGPIEE LIDTGVQRLL GDLDELPDSV
HGYKLLETPS NETTLASHAL PANRVRKFSP IAGIVMTAGC KFRCSYCPIP AYNQSQFRAK
SGERIAEEIG QIATTFGLYN FFGADDNFFN NTKRTLDIAG TLARKARAGR PFCKIRIGTE
VTVHDTVGMR DHLPLIRDAG FAAVWLGVED MTATLVKKAQ DKDKTELAFR LLRENNILPM
PMMMHHDSQP LVTWKSNYGL INQIRLLRKA GSITTQVMML TPAPGSKWYE NVFDSGMAFS
KAGDVAVEPH IMDGNYVVAS KHARPWLKQM NLLLAYTYFF NPVRFLSALI FSKSAGVFSS
AETRPAEEVE DYSAAKKMLR RAELRARAHL IDAGAQIYGM IGLIQTYRRT AGWAWRLFRD
DIQRYDRAPV SPIPMRGVGD KPSDHALPGT IDLRDLGKPV VREASARDMA GTA