Gene RPC_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4223 
Symbol 
ID3972946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4697941 
End bp4699194 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content58% 
IMG OID637927326 
Producthypothetical protein 
Protein accessionYP_534066 
Protein GI90425696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATA TCAGCGATTG CAGCAGTTGG ACGGAGGCGG TGGCACCGAG ACGGGATGAT 
CCCGTTCTGT TTCTGCATCC GCCGAAGAGC GCCGGGATGA CCGTCCTTTC GTTTTTTCGC
TTGAATCAGT CCGGCGATAA TTTCGTCAAT TTGAGCCGGG ATTCCGGCGG CGTTGCGGCC
AATACGCCTC GTTATCTTCA GACCAGGATC GGAGGCGGCC ACCTGGTCTA TGGCGTCCAC
CATCAACTCC GAAGGCCACT GAACTACGTG ACGATCCTTC GCGATCCGCT TCAAAGGCAG
ATTTCGCATT TCCATTATGT TCGAACCGGC AAGAACGGAG TGATGAGCGA GGGCTGCAGC
GTCTCCTCGG AGGAGAGTCT CGTCTATCGA GGGGCCATCA CGCTGGAGGA ATGGGTCGAA
AATTCCCTCC AGGAAACCAA TTTGCTCGTC AAAATGCTGA GCGGTAAAGC GCCCAACGAG
CGCTCGCTCG CGGAGGCCAA GGCCAACATC GAATCCGGAA GAATCTTCGC TGGACTTGCC
GAAGACATGG AGTCCTACTT GCTGCTTCTG TGCGGGCGAA CCGGTTTGTC GCGTCCATTT
CATTTCGACA CAAACCGTAC CAGGACGATT GAAAAGAGCG ACTCGCCGAG TCCAGCGGCC
ATCGCGAGCT TCAAACATCT CAACCGGCTC GACTACGAGT TGTTTGAGTT TGTGAAATTG
CGGACCCGGT GCCTCGTCGC AGAGCTCCCG GACTTCTTTG AGAAGGCGCT GACCGATGTC
CGAGTGATCC AGCGCAGAAT AGATCACCTC GCGAATCCTC ACGAACACAA TTACATCGAC
CGCGGTTTTG ACGCCGGCTT CCTCTCAAGG GTTTGCGATT GCATCGGAGG CGGCGTTGAG
TCGATCGACC GATGCATCGA GGCACTACGG CCTCTGCTGG GCAGCCAGCC GCCGTTTCAT
CACGGCTTCG TTGACGACGT CAGCAACGGT GTGGTGACCG GATGGGCCGT CAATCTCGAA
GCTCCCGGCG AGAGATTGTT GATCGAGATT CGTTCGGCCG CCGAGGTGAT CGCCACCGGC
TGGACCGACC TGCCCAGGCC GGATGTCAGC CAGGCCGGAT TTGGCAACGG CGGATCTGGC
TTTTCGATCA AGTTGCCGGA AGGCTTTTCC GACGACTTCG TGGTCGGCAT CCGCGGCGCC
TTCGATGGTC TGCAACATGC CGGCCCTTGG AAGCTTGGTT GGAATTGTTG CTGA
 
Protein sequence
MIDISDCSSW TEAVAPRRDD PVLFLHPPKS AGMTVLSFFR LNQSGDNFVN LSRDSGGVAA 
NTPRYLQTRI GGGHLVYGVH HQLRRPLNYV TILRDPLQRQ ISHFHYVRTG KNGVMSEGCS
VSSEESLVYR GAITLEEWVE NSLQETNLLV KMLSGKAPNE RSLAEAKANI ESGRIFAGLA
EDMESYLLLL CGRTGLSRPF HFDTNRTRTI EKSDSPSPAA IASFKHLNRL DYELFEFVKL
RTRCLVAELP DFFEKALTDV RVIQRRIDHL ANPHEHNYID RGFDAGFLSR VCDCIGGGVE
SIDRCIEALR PLLGSQPPFH HGFVDDVSNG VVTGWAVNLE APGERLLIEI RSAAEVIATG
WTDLPRPDVS QAGFGNGGSG FSIKLPEGFS DDFVVGIRGA FDGLQHAGPW KLGWNCC