Gene RPD_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0082 
Symbol 
ID4020537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp97298 
End bp98515 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID637960259 
Producthypothetical protein 
Protein accessionYP_567223 
Protein GI91974564 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.552116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0373517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA CCCTCTTCTA TCTTGGCGAC GCTCCGGTCA GCATTGGCGC GGCGCTGTTC 
GGCGCGAGCG CGATGGCGTT GCTGTTGCTG CTGGCGATCG TGCTGGTGAT CGCGCACGGG
CTGCAGAGCG GCAGCGCCGC GGCCCTGGCG CAGGCCCGCC GCGCCTCTGA CCTGGAGCAA
CGCCTGTCGG GCCTGATCAG GTTCCAGAGC GAAGCCAATG GCCGGGTCGA CGCGATGGGC
CGGGCGCTGG CGGGGCGGCA GGCCGAAATG GCGCGCGCGG TCAGCGAGCG GCTGGATACG
GTCACCCACC GGGTCGGCCA GTCGATGACG CAATCGACCC GCCACACCAT GGAAAGCCTG
CAGGCGCTGC ACGAGCGGCT CGGCATCATC GATCGCGCCC ACGACAACCT CACCGAGCTG
ACCGACCAGG TGACGTCGCT GCGCGACGTG CTCGCCAACA AGCAGGCTCG CGGCGCGTTC
GGCCAGGCGC GGATGGAGTC GATCGTGCAG GACGGGATGC CGAAGGGCGC CTACGCCTTC
CAGTACACGC TCTCCACCGG CAAGCGGCCG GATTGCGTGG TGTTCCTGCC CGACCAGCGG
CCGCTGTGCA TCGACGCCAA GTTTCCGCTC GAGGCGGTCA CCGCGCTCCG CGAATCCCGC
AGCGACGGAG AGAAGAAGGC GGCGTCGCAG CGGCTGCGGC TCGACGTGAT GCGGCATGTC
GACGATATCG CGGCCAAGTA TCTGATCCCC GGCGAGACCC AGGACACCGC GTTGATGTTC
GTGCCATCGG AATCGGTCTA TGCCGAGATC CATGACGGCT TCGACGATGT GATCCAGAGG
GCATATCGCG CCCGCATCGT GCTGGTGTCG CCGTCGTTGC TGATGCTGGC GATCCAGGTG
ATGCAGCAGA TTCTGAAAGA CGCGCGGATG CGCGATGCCG CCGATCAAAT CCGAACCGAA
GTGCTGAGCC TCGGCGACGA TCTCGCGCGG CTGCGCGAGC GTGTCACCAA GCTGCAAACC
CATTTCGGCC AGGTCAACGA TGACGTCCGC CAGATCCTGA TCTCGGCCGA CAAGATCGAA
CGCCGCGCCG TGCGGATCGA GGAACTGGAT TTTTCCGCGG TCGAACCGTC GACCGGCACG
CAAGCGCCGC TGGCGCCGGA AGCCAGAGAC CTGTTCGCGT CCCGCGCGTT CAAGATCGAC
GAAGTCGCTT CAGACTGA
 
Protein sequence
MNETLFYLGD APVSIGAALF GASAMALLLL LAIVLVIAHG LQSGSAAALA QARRASDLEQ 
RLSGLIRFQS EANGRVDAMG RALAGRQAEM ARAVSERLDT VTHRVGQSMT QSTRHTMESL
QALHERLGII DRAHDNLTEL TDQVTSLRDV LANKQARGAF GQARMESIVQ DGMPKGAYAF
QYTLSTGKRP DCVVFLPDQR PLCIDAKFPL EAVTALRESR SDGEKKAASQ RLRLDVMRHV
DDIAAKYLIP GETQDTALMF VPSESVYAEI HDGFDDVIQR AYRARIVLVS PSLLMLAIQV
MQQILKDARM RDAADQIRTE VLSLGDDLAR LRERVTKLQT HFGQVNDDVR QILISADKIE
RRAVRIEELD FSAVEPSTGT QAPLAPEARD LFASRAFKID EVASD