Gene RPD_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0428 
Symbolrho 
ID4020894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp491591 
End bp492856 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content62% 
IMG OID637960613 
Producttranscription termination factor Rho 
Protein accessionYP_567567 
Protein GI91974908 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAA TGAAACTTCA AGACCTCAAG GCCAAGACGC CGGCCGAGCT TGTCTCGTTC 
GCGGAAGAGT TGGGGGTCGA GAACGCCAGC ACGATGCGCA AGCAGGAGCT GATGTTCGCC
TGCCTGAAGC AGCTGTCGGC GAAAGAAACC GACATCATCG GCGAAGGCGT CGTCGAGGTT
CTGTCCGACG GCTTCGGCTT CTTGCGATCG CCCGATGCCA ACTACCTGCC GGGTCCGGAT
GATATCTACG TTTCACCCTC GCAGATCCGC CGCTTCGGCC TGCGCACCGG CGACACCATC
GAGGGTCATA TCCGCAGCCC GAAGGAAGGC GAACGCTACT TCGCGCTGCT GAAGGTCAAC
ACGCTGAACT TCGAGGATCC CGAGAAGTCC AAGCACAAGG TCAATTTCGA CAATCTGACG
CCGCTGTTTC CGGACGAGCG GTTCCGGCTC GAGATCGACG ATCCGACCCG CAAGGATCTG
TCGGCGCGGG TGATCGACAT CGTCGCGCCG ATCGGCAAGG GCCAGCGCGC GCTGATCGTC
GCGCCGCCGC GCACCGGCAA GACCGTGCTG ATGCAGAACA TCGCGCATTC GATCACCGCC
AATCATCCGG AATGCTATCT GATCGTGCTG CTGATCGACG AGCGGCCGGA AGAAGTCACC
GACATGCAAC GCTCGGTGAA GGGCGAAGTC GTTTCCTCGA CCTTCGACGA ACCGGCGGTG
CGTCACGTTC AGGTCGCCGA GATGGTGATC GAGAAGGCCA AGCGTTTGGT CGAACACGGC
CGCGACGTGG TGATCCTGCT CGACTCGATT ACGCGCCTCG GCCGCGCCTA CAACACCGTG
GTGCCGTCCT CCGGCAAGGT GCTGACCGGC GGCGTCGACG CCAATGCGCT GCAGCGGCCG
AAGCGGTTCT TCGGCGCCGC GCGCAACATC GAGGAGGGCG GTTCGCTCAC CATCATCGCC
ACCGCGCTGG TCGATACCGG CTCGCGCATG GACGAAGTGA TCTTCGAAGA ATTCAAGGGC
ACCGGCAATT CGGAGCTGAT CCTCGACCGC AAGGTCTCGG ACAAGCGCAC CTTCCCGGCG
ATCGACATCT CGCGCTCCGG CACCCGCAAG GAAGAGCTGA TCACCGATCC GCAGGTGCTG
AAGAAAATGT ACGTGCTGCG CCGGATCCTC AATCCGATGG GAACGATGGA CGCGATCGAC
TTCCTGCTCG ACAAGCTGCG CAACACCAAG AACAACTCGG AATTCTTCGA GTCGATGAAC
ACCTGA
 
Protein sequence
MREMKLQDLK AKTPAELVSF AEELGVENAS TMRKQELMFA CLKQLSAKET DIIGEGVVEV 
LSDGFGFLRS PDANYLPGPD DIYVSPSQIR RFGLRTGDTI EGHIRSPKEG ERYFALLKVN
TLNFEDPEKS KHKVNFDNLT PLFPDERFRL EIDDPTRKDL SARVIDIVAP IGKGQRALIV
APPRTGKTVL MQNIAHSITA NHPECYLIVL LIDERPEEVT DMQRSVKGEV VSSTFDEPAV
RHVQVAEMVI EKAKRLVEHG RDVVILLDSI TRLGRAYNTV VPSSGKVLTG GVDANALQRP
KRFFGAARNI EEGGSLTIIA TALVDTGSRM DEVIFEEFKG TGNSELILDR KVSDKRTFPA
IDISRSGTRK EELITDPQVL KKMYVLRRIL NPMGTMDAID FLLDKLRNTK NNSEFFESMN
T