Gene RPB_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1829 
Symbol 
ID3908988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2091628 
End bp2092941 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content65% 
IMG OID637883723 
Producthypothetical protein 
Protein accessionYP_485448 
Protein GI86748952 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.751801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.279512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG AATTCAGCCT CGCCCAGCTC GCCGATCCGG ACATCGCCGA GGCCGACAAG 
ATCCTGCGGG CCTGCGTGCA CTGCGGATTC TGCACGGCGA CGTGCCCGAC CTATGTGCTG
CTCGGCGACG AACTCGACTC ACCGCGCGGT CGAATCTACC TGATCAAGGA GATGCTGGAG
AGCAACAAGC CGCCGACCGC GGACGTGGTC AAGCACATCG ACCGCTGCCT GTCGTGCCTC
GCCTGCATGA CCACGTGTCC CTCCGGCGTG AACTACATGC ATCTGGTCGA TCAGGCGAGG
GCGCGGATCG AGAAGGACTA CACGCGGCCG TTCTGGGATC GGCTCATCCG CGAAGCGCTG
TCGTGGATGA TGCCGCATCC CGGCATGTTC CGGCTCGGCA TGTGGGCGGC GCGGATCGTG
CGCCCGATTG CGGGGCTGCT GCCCGGCTCG CACGATCTCG CCCATCCGAC GTTCCTGAGC
CGGGTCAAGG CGATGTTGGC GCTGGCGCCG AAGCATCTGC CGGAGCGCGG ACCGGACTCC
GGAACCACGT TCCCGGCGAT CGGGACAAAG CGCGGGCGCG TCGCGCTGCT GCACGGCTGC
GCCCAGCAGG TGCTGGCGCC GCGAATCAAC CGGGCCGCGA TCAATCTGCT GACCCGCCAC
GGCATCGAGG TCGTCCTCGC GGCCGACGAG GCCTGTTGCG GTGCCCTGAT CCATCACCTC
GGCCGCGATA CCCGCACGCT CGAATATGCC CGCACCAACA TCAAGGCGTG GCTCGCCGAG
ATCGAGCGCG GCGGCCTCGA CGCGATTCTG GTGACGACGT CGGGCTGCGG CACGGTGATC
AAGGACTACG GCTACATGCT GCGCGAAGAC CCGGCGTTCG CCGCATCGGC CGCGAGGGTC
TCGGCGCTCG CCAAGGATAT CAGCGAATAT GTCGGCGATC TGGAGCTGTC GGTGCCGCGG
CCGCAGCGCG ATGTCGTGGT CGCTTATCAC TCCGCCTGTT CGCTGCAGCA CGGTCAAAAA
GTCACGCGGC TGCCCAAAGA ATTGCTTTCC AAGACCGGAT TCGTGGTGAA AGATATCCCG
GAGAGTCATT TGTGTTGTGG TTCGGCGGGC ACGTACAACA TTCTCCAGCC TGACATCGCG
ACCCGATTGC GCGACCGCAA GGTCGCCAAC ATCGCTGCCG TCAAGCCGGA CATGATCGCC
GCTGGGAACA TCGGCTGCAT GGTGCAGATC GCCAGCGGAA CGGAAGTCCC TGTGGTGCAC
ACGATTGAAC TTCTCGATTG GGCGACAGGC GGTCCCCGGC CTGCGAGTAC CTGA
 
Protein sequence
MKTEFSLAQL ADPDIAEADK ILRACVHCGF CTATCPTYVL LGDELDSPRG RIYLIKEMLE 
SNKPPTADVV KHIDRCLSCL ACMTTCPSGV NYMHLVDQAR ARIEKDYTRP FWDRLIREAL
SWMMPHPGMF RLGMWAARIV RPIAGLLPGS HDLAHPTFLS RVKAMLALAP KHLPERGPDS
GTTFPAIGTK RGRVALLHGC AQQVLAPRIN RAAINLLTRH GIEVVLAADE ACCGALIHHL
GRDTRTLEYA RTNIKAWLAE IERGGLDAIL VTTSGCGTVI KDYGYMLRED PAFAASAARV
SALAKDISEY VGDLELSVPR PQRDVVVAYH SACSLQHGQK VTRLPKELLS KTGFVVKDIP
ESHLCCGSAG TYNILQPDIA TRLRDRKVAN IAAVKPDMIA AGNIGCMVQI ASGTEVPVVH
TIELLDWATG GPRPAST