Gene RPC_4795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4795 
Symbol 
ID3973499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5355366 
End bp5356796 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID637927907 
Productthreonine synthase 
Protein accessionYP_534636 
Protein GI162136024 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.420552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGACG TGTTGACGCA GTATATTTCG ACGCGGGGCG AGGCCCCCAA GCTCGGCTTT 
TGCGATGTCA TGCTGACCGG GCTCGCCCGC GACGGTGGGC TCTATGTGCC TGAGATCTGG
CCGCAATTGG CGCCCGACGC GATCGCGGGG TTTTTCGGCC GGCCGTATTG GGAGGTCGCC
GTCGAGGTGA TCCGCCCGTT CATCGGCGGC GAGATCTCCG ACGAAGATCT CGGCCGGATG
GCCAACGAGG CCTATGCCAC GTTCCGGCAT CCCGCGGTGG TGCCGCTGCG CCAGACCGCG
CCCAGCCAAT TCGTGCTGGA ATTGTTTCAC GGCTCGACGC TGGCGTTCAA GGATGTCGCG
ATGCAGCTAT TATCGCGGCT GATGGACCAC GTGCTGGCGA AGCGCCAACA GCGCATCACC
ATCGTGGTCG CGACCTCCGG CGACACCGGC GGCGCCGCGG TCGACGCCTT CGCCGGGCGC
GACAATGTCG ATCTGATCGT GCTGTTTCCG CACGGCCGGA TCTCCGACGT GCAACGCCGG
ATGATGACCA CGTCGGCGGC GAGTAACGTT CACGCGGTGG CGCTGCAAGG CAATTTCGAC
GATTGCCAGG CGATCGTGAA GGGGCTGTTC AACCACCACA AATTCCGCGA TCGCGTGGCG
CTGTCCGGCG TCAATTCGAT CAACTGGGCG CGGATCATCG CCCAGGTGGT GTACTACTTC
ACCAGCGCGG TGGCGCTCGG CGCGCCGGGG CGCGGCGTCG ATTTCACCGT GCCGACCGGC
AATTTCGGCG ACATCTTCGC CGGCTATGTC GCAAAGCGGA TGGGGCTGCC GATCCGCAAG
CTGAAGATCG CCGCCAACGT CAACGACATT TTGGCGCGCA CCCTGAAGAC CGGGATCCAC
GAGGTCCGCG AGGTCCACGC CACCGCGTCG CCGTCGATGG ACATCCAGGT GTCGTCGAAT
TTCGAGCGGC TGCTGTTCGA GGCCTCGGGC CGCGACGCCG CCTTGGTGCG CGGGTTGATG
GCCTCGCTGC AGCAATCCGG CCGCTACGTG CTGCCGGATC GCGTGCTGGC GGCGATCCGC
GAACAATTCG ACGCCGGCCG CGCCGACGAA GAGGAAACCG CAGCCGCGAT CCGCACCGCC
TGGCGCGAGG CCGGCGACCT GGTCGACCCG CACACCGCGG TGGCGTTGGC GGTGGCCGAC
CGCGACCAAA CCGATTCGAG CGTTCCCAAC ATCGTGCTGT CCACCGCCCA TGCGGCGAAA
TTCCCCGACG CGGTGGAAGC GGCCTGCGGC GTGCGTCCGG ACTTGCCGCT GTGGCTGGAG
GGTCTGATGA CGAGGCCCGA GCAGATCACC ACCCTGAAGC CCGATCAGGC GACGGTGGAA
AACTACGTGC TTGCGGTCAG CCGCGCCGCG AAACAAGGAG TTGCCGGATG A
 
Protein sequence
MEDVLTQYIS TRGEAPKLGF CDVMLTGLAR DGGLYVPEIW PQLAPDAIAG FFGRPYWEVA 
VEVIRPFIGG EISDEDLGRM ANEAYATFRH PAVVPLRQTA PSQFVLELFH GSTLAFKDVA
MQLLSRLMDH VLAKRQQRIT IVVATSGDTG GAAVDAFAGR DNVDLIVLFP HGRISDVQRR
MMTTSAASNV HAVALQGNFD DCQAIVKGLF NHHKFRDRVA LSGVNSINWA RIIAQVVYYF
TSAVALGAPG RGVDFTVPTG NFGDIFAGYV AKRMGLPIRK LKIAANVNDI LARTLKTGIH
EVREVHATAS PSMDIQVSSN FERLLFEASG RDAALVRGLM ASLQQSGRYV LPDRVLAAIR
EQFDAGRADE EETAAAIRTA WREAGDLVDP HTAVALAVAD RDQTDSSVPN IVLSTAHAAK
FPDAVEAACG VRPDLPLWLE GLMTRPEQIT TLKPDQATVE NYVLAVSRAA KQGVAG