Gene RPB_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3976 
Symbol 
ID3911783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4539177 
End bp4540610 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content67% 
IMG OID637885880 
ProductLhaA protein 
Protein accessionYP_487580 
Protein GI86751084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.27032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.197279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA TGGCGGCGAC ATTGGCGAAA GGCTGGATGC GTCTCGGGAC GCGTTTCCTG 
CCGTTCGCCG ACGCGGCGAC GAAAGAGCTC CCGCTCGGCC GTCTGCTGCG CCTGTCGCTG
TTTCAGGTCT CTGTCGGCGC TTCGGTGGTG CTGCTCAACG GCACGCTGAA CCGGGTGATG
ATCGTCGAGC TCGGCGTCTC GACGCTGCTG GTCTCGCTGA TGGTGTCGCT GCCACTGATC
TTCGCGCCGT TCCGCGTGCT GATCGGATTC AAGTCGGACA ACCACCGCTC GGTGCTGGGC
TGGCGTCGTG TGCCTTATAT CTGGATGGGC ACGATGCTGC AGTTCGGCGG CTTCGCGATC
ATGCCATTCG CGCTGCTGGT GCTGTCCGGC GGCGGCGAGT ATCCGGCGGT CTATGGTCAG
ATCGGCGCTG CTCTGGCGTT CTTGCTTGTC GGCGCCGGGC TGCACACGAC GCAGACCGCC
GGCCTTGCGC TCGCCACCGA TCTGGCGCCG GAAGAATCGC GGCCGCGGGT CGTCGCGTTC
CTTTACGTGA TGCTGCTCGT CGGGATGACC GGCAGCGCGC TGCTGTTCAG CGAACTGCTC
CGCGACTTCA GCGAGCTTCA ACTGATCCAG GTGATTCAGG GCGTCGCCGT CGCCCAGTTG
CTGCTCAACA TCGCTGCGCT GTGGAAGCAG GAGGCGCGCA ATCCCGCGCT CACCTCGGCG
ACCCGTCCGC GGCCGCAGTT CAATCAATCT TGGGCGCGGT TCCGCGCTGC CGGCGGTTCG
AACCGGATGC TGGTCGCGGT CGCGCTCGGT ACCGCGGGAT TCTCCATGCA GGACATCCTG
CTGGAGCCTT ACGGCGCCGA AGTGCTGAAG CTCTCCGTCG GTCAGACCAC GGCGCTGACC
GCGTTTTTCG CGCTCGGCAC GCTGGCCGGC TTCGGCCTCG CGGCGCGGAC GCTCGGACGC
GGCAGCGATC CGTATCGGAT CGCCGGCTTC GGCGCGCTGA TCGGCATCTT CGCGTTTGCG
GCCGTCGCGC TGGCGGCGCC GGCGCAATCG GTTGTTCTGT TCCGGATCGG CACCGCGCTG
ATCGGGCTGG GCGGAGGCCT GTTCGCGGCC GGCACGCTGA CCGCAGCGAT GCAGATCGGT
TCCGACAGCG AACCCGGGCT CGCGCTCGGT GCCTGGGGCG CGGTGCAGGC CACCGCGGCG
GGCGGCGGCA TCCTGCTCGG CGGCGGTCTG CGCGATTTGT TCGCTTCGCT CGCCGACAGC
GGCATGCTCG GCGCCGTGCT GTCGGGGCCC GCGATCGGTT ACGGCTTCGT CTACAACATC
GAGATCGCGT TGCTGTTCGC AACGTTGGTT GCGGTAGGTC CTCTCGTGCG GGTCGCACGG
CCGAACTACG CGCAGCCTTC ATCCAAGTTC GGCCTAGCCG AATTTCCAGG TTAA
 
Protein sequence
MSQMAATLAK GWMRLGTRFL PFADAATKEL PLGRLLRLSL FQVSVGASVV LLNGTLNRVM 
IVELGVSTLL VSLMVSLPLI FAPFRVLIGF KSDNHRSVLG WRRVPYIWMG TMLQFGGFAI
MPFALLVLSG GGEYPAVYGQ IGAALAFLLV GAGLHTTQTA GLALATDLAP EESRPRVVAF
LYVMLLVGMT GSALLFSELL RDFSELQLIQ VIQGVAVAQL LLNIAALWKQ EARNPALTSA
TRPRPQFNQS WARFRAAGGS NRMLVAVALG TAGFSMQDIL LEPYGAEVLK LSVGQTTALT
AFFALGTLAG FGLAARTLGR GSDPYRIAGF GALIGIFAFA AVALAAPAQS VVLFRIGTAL
IGLGGGLFAA GTLTAAMQIG SDSEPGLALG AWGAVQATAA GGGILLGGGL RDLFASLADS
GMLGAVLSGP AIGYGFVYNI EIALLFATLV AVGPLVRVAR PNYAQPSSKF GLAEFPG