Gene RPD_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2000 
Symbol 
ID4022482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2236920 
End bp2238320 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content66% 
IMG OID637962193 
Producthypothetical protein 
Protein accessionYP_569136 
Protein GI91976477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.203804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA CACCGACCGA ACGGCTGAGG GAGTACCTCG CCCAGCTCCC GCCTCAGTCG 
CAGGCGCTGC TGATGCGGGA GTTCGAGCGT GCGCTGGAGC GTGGCGACGA GGTCGCCGTG
GCCAGCTTCG TGCTCGAAGA GCTGCGCAAG ATCGTCCGCG GTTCCGATGA AGAATCCGCG
CCGCGGACCG ACGATCCGGC GCGGTTGATG TTCCGCTCGC TCGAGCCGTT TCTGATCGAC
AACAGCCAGC AGCCTCGGCC GGGCCAGATC CGCCGCGCGT CGCTGAGCTC GATCTGGCAA
TGGCTGGTGA GCGAGGGAAT TCCCACGCCA GTCCGGGAAT TCGAAGCCGA CCTGATCAGG
TTGCGCAAGG GCTCCGCCGT CGAGATCGAC GCGCTGGTCC GCAAGCTGCA GGCCGTGGCG
GCCGAGGCGA TCGACAAGGT GATCAACCCG GAGCCGGGGA TCGACCGGCA GCGCGCGATG
GCGCGGGTAG GGCCGCCATC GGCGGTCGAG GATCTGCCGG CTATCGGGGC GGTGCTCAAG
AACCGCGAGG CGCTCGAAAC CTTCGACGCC AAGCTGTCGT CGAATCTCAG GGCGTTCGGC
GACTCGCAGG TCACGTCGAT GATTGCGTCG CTCAACGTTC CGGCGCTGCA AACCCCGACC
ATGCTGCCGT TCGCGCTGAC GATGATCCTC GCCCATTTGA CCCAGCCGTG GCAGATCGTC
CGGCTGGCGC TCAAGGTCGC CGGCTCCGAC GACGAGATCA GGGTCGCCGC CACGCCCTAT
GGCGTCGCCG TCACCATGGC GATCCACGAC GTCGCCCAGC TCACCGCCGA CCTGCGCGAC
GAGATCAAGC GCGGCCATTA CAGCAATGTC GCCGAGAAAC TGAAACTGGT CCATGACGGC
GTGCGCGGGC TGCGGACCGA ACTCGATATC CGCAGCGACT CGACCTGGGG CAAGCGGCTC
GCGGCGATCC GCGTCGACAT TTCCAACGCG CTGAAATCCG AGATCGAAAG CGTTCCGGGC
CGTGTGCGCC GGTTGCTGCG GCAACGGCCC GACAAGGAGA TCTCGGCCAA CAGCCGGATC
GACCAGATCG AGGTCGATGA AGCCGCGGCG CTGATCGACT TCGTTGCGAT CTGTCGCAAC
TACGCGAGCG AACTGGCGAT CAACGAGATG ACGTTGCGGA CCTATTCCGA GCTGCAGCAA
TATGTCGAGA AGTCCACCGA GGCGCTGGTG CAGTCGCTGC GCGGCTGCGA TCCGCGGGTG
AAGCCGTTCC GGCACATGCA GGCGCTTGCC GCGATCCGGT TCTGCGAAGT GCTGTTCGGC
CACGACTACG GCCAGCTGAT GCGCCGGGCG GTGGAAAGCG CGATGGTCGT GGTCGACCGC
AAGCCGGCCC GGGCGGGGTA A
 
Protein sequence
MSQTPTERLR EYLAQLPPQS QALLMREFER ALERGDEVAV ASFVLEELRK IVRGSDEESA 
PRTDDPARLM FRSLEPFLID NSQQPRPGQI RRASLSSIWQ WLVSEGIPTP VREFEADLIR
LRKGSAVEID ALVRKLQAVA AEAIDKVINP EPGIDRQRAM ARVGPPSAVE DLPAIGAVLK
NREALETFDA KLSSNLRAFG DSQVTSMIAS LNVPALQTPT MLPFALTMIL AHLTQPWQIV
RLALKVAGSD DEIRVAATPY GVAVTMAIHD VAQLTADLRD EIKRGHYSNV AEKLKLVHDG
VRGLRTELDI RSDSTWGKRL AAIRVDISNA LKSEIESVPG RVRRLLRQRP DKEISANSRI
DQIEVDEAAA LIDFVAICRN YASELAINEM TLRTYSELQQ YVEKSTEALV QSLRGCDPRV
KPFRHMQALA AIRFCEVLFG HDYGQLMRRA VESAMVVVDR KPARAG