Gene RPB_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0939 
Symbol 
ID3909793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1085686 
End bp1086873 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content68% 
IMG OID637882832 
Productnitrile hydratase regulator 
Protein accessionYP_484560 
Protein GI86748064 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.108797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTCGA TCGGGGATTG CGGAGGCGGT GGCGCCGGGC TTGCGCCGCC GCTCGGGTTG 
CTTGAGAGGG CATCTTCGCT CGCCGACGCG GCGTGGGTGG AAGGGCCGGG CGACGGCGCC
GGTGCCGCGG CGCGTCGGGC GCGCAACAAA TTGCGCATCG CCAATTTCGT CACTTTTCAG
GGCGCGCCGG GAATCTGGGG GCCTGCCTCG AGCAATGCCG CGCTGCTGGC TGCCGCCGAA
ATCAACAAAC GCGGCGGGAT CCTCGGCCGC GAAATCGAGT TGGTGATGTG CGACGCCGGC
GGTCCGATCG AGGATGTCGC GCGGCGGGTG GCGCAGGCGG TCGATTTCGA CGACGTCGAC
ATCGTGATGG GCTCGCATAT CAGTGCGGTC CGCGTCGCGC TGCGCAAGGC GATCCGCGGC
CGCGTGCCGT ACATCTACAC GCCGGTCTAT GAAGGCGGCG AACGCACGCC CGGCGTGATG
GCGATCGGCG AGACGCCGCG CTGGCAGAGC CGGCCGGCGA TCGACTGGCT CACCCAGGTC
AAGAAGGCGC AGCGCTGGTA TCTGATCGGC AGCGACTATG TCTGGCCCTG GCTGTCGCAT
CGCGCGGTCA AGACATACAT TAAGAACGCC GGCGGCCAGG TGGTCGGCGA GGAATTCGTG
CCGCTCGGCG AGGACGATCA CGAGCGCCAT CTCGCGCGTA TTCGCGCCGC GCGTCCCGAC
GTGGTGCTGA TCTCGCTGAT CGGCGCCGAC AGCGTCACCT TCAATCGCGC CTTCGCCGAA
TGCGGGCTGG CCGGTGGCAC TCTGCGGCTT GCCGGCGCGA TGGACGAGAC CGTGCTGCTC
GGCATCGGCG CCGACAACAC CGAGAACCTG TTCTGCGCCT CGGGCTATTT CGGCTGCCAC
GACTCCAGCG CCAACGATCA ATTCCGCGCT GCCTGCCTGA GGGCTTTCGG GCCGACCGCG
CCGCCTATCG GATCTGTCGG ACAATCCAAC TACGAAGGCT TGCGATTCCT GGAGGCCGTC
GCCGACAAGG CGCAGACGCT GGCCGCGCGT CCATTGCTCT CCGCGGCCAA GAACGTCGTC
TACAACGGCG CGCGCGGCGC CGTGACGATC CGCGACGGCC GTGCGCGGAT GACGATCCAT
CTCGCCGAAG CCGACGGCCT CGATTTCAAG CTGATCCGCA CGTTCTGA
 
Protein sequence
MLSIGDCGGG GAGLAPPLGL LERASSLADA AWVEGPGDGA GAAARRARNK LRIANFVTFQ 
GAPGIWGPAS SNAALLAAAE INKRGGILGR EIELVMCDAG GPIEDVARRV AQAVDFDDVD
IVMGSHISAV RVALRKAIRG RVPYIYTPVY EGGERTPGVM AIGETPRWQS RPAIDWLTQV
KKAQRWYLIG SDYVWPWLSH RAVKTYIKNA GGQVVGEEFV PLGEDDHERH LARIRAARPD
VVLISLIGAD SVTFNRAFAE CGLAGGTLRL AGAMDETVLL GIGADNTENL FCASGYFGCH
DSSANDQFRA ACLRAFGPTA PPIGSVGQSN YEGLRFLEAV ADKAQTLAAR PLLSAAKNVV
YNGARGAVTI RDGRARMTIH LAEADGLDFK LIRTF