Gene RPD_2976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2976 
Symbol 
ID4023479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3315413 
End bp3316690 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID637963175 
Productaminodeoxychorismate lyase 
Protein accessionYP_570103 
Protein GI91977444 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.202928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA GGCCGCCGAT CTCGCCGAGA AGCCCGCGTG CGGCGCTGGA ACCGGAGCAA 
CTCCCGCCGC CGCCGAAGCG GTCCGATCGG GCGCGCAGCC CGTTGGTCAT CATCGGCAAC
GCCATCATCA CCATCCTGCT CGTCCTGATG ATCGGTGCCG GCGGCATCTA CGTCTACGGC
AAGCAGAAGA TCGAGGCGGC CGGTCCGCTG CAGGAAGACA AGGTTGTTAA TATTCCGCAG
CGTGCGGGGC TCGGCGATAT CGCCGAGATC CTGCAGCGTG AAGGCGTGAT CGAGAACAAT
CGCTGGGCTT TCATCGGCAG CGTGTTGGCC TTGAAGGCGC GTTCGGAACT GAAGCCCGGC
GAATATTCGT TCCATAAGAA GGCCAGCCTG CGCGACGTCA TCGGCACCAT CGTTGAGGGC
AAGGTGGTGC AGCACACCGT GACGATTCCG GAAGGCCTGA CCTCGGAACA AATCGTTGCG
CGGTTGTCCG AGAACAACAT CTTCAGCGGA AGCCTGCGCG AAATCCCGCG CGAGGGAACG
CTGTTGCCGG AGACCTACAA ATTCCCGCGC GGGACGATGC GCGATCAGGT GATCAATCGG
ATGCAGCAGG CTCAGAAGCG CGTGCTCGCA GAAGTCTGGG AGCGGCGCAA CCCGGAAATT
CCGATCAAGT CGCCGGAGCA ATTGGTGACG CTCGCGTCGA TCGTGGAGAA GGAGACCGGC
AAGGCGGATG AGCGCAGCCG GGTCTCGTCG GTGTTCATCA ATCGACTGCA GAAGAAGATG
AAGCTGCAGT CCGATCCGAC GATCATTTAC GGCCTGGTCG GCGGCAAGGG CACGCTTGGA
CGGCCGATCA AGCGCAGCGA AATCCAGCAG CCGTCCCCGT ACAACACCTA TGTCATCGAC
GGCCTGCCGC CGGGGCCGAT TGCCAATCCC GGTCGTGCGT CGCTCGAGGC CGTGGCGAAT
CCGGCCCGCA CGCGCGATCT CTATTTCGTC GCCGACGGCA CCGGCGGTCA CGCCTTCAGC
GACGGCTACG ATCAACACTT GAAGAACGTG GCGAAGCTGC GTGCGCAGGA ACGCCAGATG
CAGAACGACA CCGTCGAGCC GGCGGAGGAC GCTCCGCCGA CGGCAACGAT TACTCCCGAT
GCCGACGGCT CGGCGGCCGC GCCGGCGGCG GTGCCCAAGC CCGCGAAAAA TGCCGGGACG
CCGAAGAAGC GAACGCGGAA CGGGACGCAA AACAGCGCCG CCCCAACTGG CGCGCCGGCG
GCGGCAGACC AGGACTAG
 
Protein sequence
MSERPPISPR SPRAALEPEQ LPPPPKRSDR ARSPLVIIGN AIITILLVLM IGAGGIYVYG 
KQKIEAAGPL QEDKVVNIPQ RAGLGDIAEI LQREGVIENN RWAFIGSVLA LKARSELKPG
EYSFHKKASL RDVIGTIVEG KVVQHTVTIP EGLTSEQIVA RLSENNIFSG SLREIPREGT
LLPETYKFPR GTMRDQVINR MQQAQKRVLA EVWERRNPEI PIKSPEQLVT LASIVEKETG
KADERSRVSS VFINRLQKKM KLQSDPTIIY GLVGGKGTLG RPIKRSEIQQ PSPYNTYVID
GLPPGPIANP GRASLEAVAN PARTRDLYFV ADGTGGHAFS DGYDQHLKNV AKLRAQERQM
QNDTVEPAED APPTATITPD ADGSAAAPAA VPKPAKNAGT PKKRTRNGTQ NSAAPTGAPA
AADQD