Gene RPD_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1974 
Symbol 
ID4022456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2213099 
End bp2214403 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content54% 
IMG OID637962167 
Producthypothetical protein 
Protein accessionYP_569110 
Protein GI91976451 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00010435 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.369682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGCGC GTCCTGAACC TGACGGAAAG ATCGTCTATA GCGTTCTCTC CCGACAGGCG 
TTTGAACTAG ATCTGGCGGA TGTCACTTTG GACGGACGGC TTGAGATCCT ACCGCATGTC
GAAGAGAGGG GACTGCTCTT CCTGCAATTC AGGAAAAGCA AACTGATCGT AACGCCAGGT
GCTTACATTG GCCTTGTTCC ATTAACGCGT AAGATCGCGT TCGATGTCAG ACCAAAGTTT
CCCGTGAGCA ATCTTGCCCG CGTTATTGAC ACTTCCAAAC GACAGTTAAA TTCGATTCCT
GGCGCTGACC GGTCATACCT TGCGAACGAT CTGTCTGGCG GCAGCGTCTT GAATTTTCTC
GCCGCCAATC TTGTAGACGC GTTGCGTCCT ATCGCCGCAA GAGGTCTTCA CAAGGAGTAT
TCTTGCCGCT CGGAAACTAC AAGCCATCCC CGTGGACGCA TTGAGATTGC CGGCACGATG
CGTGGCTGGT CGCGCGGACA ATTCCACAAA GTGCAGGCGC AGCGATTCGA CCAAACCTCG
GACTTGCCAG TGAACCGGAT CCTAAAGGCA GCACTGGAAT CTGTCTTGAA GCTAATGTGG
CCTCATTCCA CCGAAAGTCG TCGGCTGATA GTGCGCGCCA ATGCGTCATT CCTGGAGTTT
CCCCAGCTCG TCGGTAGTTG TAAGCCGTTG GATTTGGCGG AGAGTCAGGC GATTCTGGCC
GCGAGAAGTC TTCCCGCCGA TCGCATTTAC TACTACCGCG CGATCGAGAT TGCGCTGCTC
ATACTGTCTA GCCGCGGCAT TTCTTTGCAG GAGGAGGGAG TCGATGTTCT CCTGGATAGT
TTTATCATCA ACTTCGACGA TTTGTTTGAG GAGTATCTCC GTCGCGTCCT GCAGGCGCGA
GCACCGAACC TTCTCAGCGT CAAGGACGGC AACTTCGAGG GGAAGCGCCA GCTATTCGAA
GATCGCAAAG ATCAGCCAGC GCAGCCCGAC GTGGTGCTTA CGTGGCAACC AACCAGCGTC
AATGTCGTCG GTGAGATAAA GTACAAGGAT AGGCCTTCGC GTGACGACAT CAATCAGGCT
ATCACTTATG CGCTCTGTTA CAATACCAAA TGCGCCGTTC TTATCCACCA ATGTCGATCG
GGCGAGTCGC GTGGATTGCG ACACCATGGT ACGATTCGTG GAATACGGTT AGAGAACTAT
GCCTTCGACC TCGGTGCCGC GAATCTTGAC GCGGAGGAGG AAGCTTTCGC TACGGCCATG
TTTGACTTAG TGCGCATGCA GCTATCGGAA AACGTGGCAG CTTAG
 
Protein sequence
MTARPEPDGK IVYSVLSRQA FELDLADVTL DGRLEILPHV EERGLLFLQF RKSKLIVTPG 
AYIGLVPLTR KIAFDVRPKF PVSNLARVID TSKRQLNSIP GADRSYLAND LSGGSVLNFL
AANLVDALRP IAARGLHKEY SCRSETTSHP RGRIEIAGTM RGWSRGQFHK VQAQRFDQTS
DLPVNRILKA ALESVLKLMW PHSTESRRLI VRANASFLEF PQLVGSCKPL DLAESQAILA
ARSLPADRIY YYRAIEIALL ILSSRGISLQ EEGVDVLLDS FIINFDDLFE EYLRRVLQAR
APNLLSVKDG NFEGKRQLFE DRKDQPAQPD VVLTWQPTSV NVVGEIKYKD RPSRDDINQA
ITYALCYNTK CAVLIHQCRS GESRGLRHHG TIRGIRLENY AFDLGAANLD AEEEAFATAM
FDLVRMQLSE NVAA