Gene RPB_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2501 
Symbol 
ID3910290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2863041 
End bp2864807 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content68% 
IMG OID637884400 
Producthypothetical protein 
Protein accessionYP_486117 
Protein GI86749621 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.732185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTGA ACGAGTTTCG CGGTTGCGAC ACTCATCTGA GATCAATCAA AAGATTCGAG 
GACCACCGCG TGCGCCTGGA ACAACCGACG ATGCCGCGCG CAAGCGCTCC GCGAACCGGA
CCGATTTTCA AAGCCGCATG GCGCGCCGGG ATTCTGACGG CGGCGGGCGT CTTCGCACTG
TCCTCCGAGG CTCATGCCGC GAACTGGTGG TGGCCCGAGA ACGACGACGC GGTCTACATC
CCGGCGCAGC CGGCGCCCAA GCGGTATCAA CGGAAGCAGC CGCATCTCGA TACGCGGCAG
CAGAAGCTGA TCGAGAAGCC GACCGCCAAG CCGCAGGGGC CGCTGGTGAT CGCGGTGTCG
ATCGAGCAGC AAAAGATGCG CGTCTACGAC GCCAACGGCT TCTTCGCCGA GGCGCCGATC
TCCACCGGGA TGCGCGGCCA CGCCACGCCG ATGGGCGTGT TCAGCGTCAT CCAGAAAAAC
AAATGGCACC GCTCCAACAT CTACAGCGGC GCGCCGATGC CCTACATGCA GCGCCTGACA
TGGTCGGGCA TCGCGCTCCA TGCCGGCGTG GTGCCGGGCT ACCCGGCTTC GCATGGCTGC
ATCCGGATGC CGACATCGTT TGCGACGAAG ATGTGGGGCT GGACCCGGAT GGGGGCCCGC
GTGATCGTCA CGCCCGGCGA CATCACGCCG ACCCACTTCA CCCACCCATT GCTGACAGCC
AAACGGCCCG CGCCGGCGGA CGCGCCGATG GCCTCCGATC CGCAGAAGCC GGGCACGGCG
CCGAAATCCG ACAAGGCCGC GACGGCCGAA CCGGCCGCCG CCCCCGCCGC CGGACTGCAT
CCGGAACTGC GTGCCGGCAT CCTGGGCGAC GCCCCCCGCC CGCCGGTCCA GACCGCCGAC
GCCAGCGCGG CGGACCCTGG CGCCGCGCTC GTGCTGTCCG ACTCGCCGGC GCGAGAAAAC
GCCTCGGCAG AGCCAGCCCC CGCGACGAGC CCGGCCGACG CCCACGCCGA AGCGACAGTC
GACTCGATCG CCAAGGACGA GGCCGCCGAG CCCACCGCCG CCGTTGCCAA GGACATCGCC
GAAGCGACCG AAGCCCGGCC GGACGACCTC GATCCCGCGA CCACCGCCAC CGTCGTCATC
GCTCCCGGAT CCCCTCCCCA AGCGACCGAA AAAGCGCCTT CTTCCAATGA GAAGGCGCCA
TCCGTTGAGT CCGCGGACAA GCTCGACGGC AAGCCGGATG CCGCCAGCCA GAGCAGCGCC
CCGGGCAAGG ATCAGTCGCG CCCGGGCGAT CCGGCTGCGC CGCCGGCGGC AGAATTGCCG
GCAGCCAAGC GCGGCGGCGG CCAGATTGCG ATGTTCGTCA GCGCCAAGGA CCAGAAGCTC
TACGTCCGTC AGAACATGAC ACCGTTGTTC GACGTTCCGG TGGTGATCGC GGCAGGCGAG
CGGCCGCTGG GGACGCATGT GTTTACCGCC GAGCTGGCGA AGGATGACGG CGTTCGCTGG
ACGGTGGTGT CGCTGCCGGC ACCGCAACGG GTCAATGATG CGGGCAATTC CCGCCGATCC
AAGAAGACCT TGCCGACACC TGTGAAAGCA TCGGTGGAAA CAGAAGGCCC CGCCGCAGCG
CTCGATCGCC TGACAATTCC CGCCGAAGCC ATGGCGCGGA TCGCCGACGC GATCACCACC
GGAACATCGT TCATCGTCTC CGATCAGGGC ATCACTGCGA GCGGCGAAAC CGGCCGGGGG
ACCGATTTCA TCATCAGCCT GCGGTAG
 
Protein sequence
MRVNEFRGCD THLRSIKRFE DHRVRLEQPT MPRASAPRTG PIFKAAWRAG ILTAAGVFAL 
SSEAHAANWW WPENDDAVYI PAQPAPKRYQ RKQPHLDTRQ QKLIEKPTAK PQGPLVIAVS
IEQQKMRVYD ANGFFAEAPI STGMRGHATP MGVFSVIQKN KWHRSNIYSG APMPYMQRLT
WSGIALHAGV VPGYPASHGC IRMPTSFATK MWGWTRMGAR VIVTPGDITP THFTHPLLTA
KRPAPADAPM ASDPQKPGTA PKSDKAATAE PAAAPAAGLH PELRAGILGD APRPPVQTAD
ASAADPGAAL VLSDSPAREN ASAEPAPATS PADAHAEATV DSIAKDEAAE PTAAVAKDIA
EATEARPDDL DPATTATVVI APGSPPQATE KAPSSNEKAP SVESADKLDG KPDAASQSSA
PGKDQSRPGD PAAPPAAELP AAKRGGGQIA MFVSAKDQKL YVRQNMTPLF DVPVVIAAGE
RPLGTHVFTA ELAKDDGVRW TVVSLPAPQR VNDAGNSRRS KKTLPTPVKA SVETEGPAAA
LDRLTIPAEA MARIADAITT GTSFIVSDQG ITASGETGRG TDFIISLR