Gene RPB_4597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4597 
Symbol 
ID3912414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5191304 
End bp5192674 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content65% 
IMG OID637886501 
Productglutamate--cysteine ligase 
Protein accessionYP_488191 
Protein GI86751695 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3572] Gamma-glutamylcysteine synthetase 
TIGRFAM ID[TIGR01436] glutamate--cysteine ligase, plant type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.4332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTG ACCAGATCGA TATGACGCCG CTGAACTCGC GCGACGAACT GGTCGCGTGG 
ATCGAAGCGG GCGTCAAACC GCCGTCGGAA TTCCGCATCG GCACCGAACA CGAGAAGACG
CCGTTCACGC TCGAGGGCCA TCACCCGGTA CCTTACGACG GCGCGCGCGG CATCGGCGCG
CTGCTCGAGG GCATGAAGAT CCTGCTCGGC TGGGAACCGA TCATGGAAGG CCCGCACATC
ATCGGGCTGC ACGACGTCAC CGGCGGCGGC GCGATTTCGC TCGAGCCCGG CGGACAGTTC
GAATTGTCCG GCGCGCCGGT CGACAATGTG CACCAGACCC ATTCCGAGCT GATGGCGCAT
CTGGCGCAGG TGCGCGAGAT CGCAGCCCCG CTCGGCATCG GCTTCCTCGG CCTCGGCATG
ACGCCGTCGT GGTCGCGCGA CGACATTCCG GTGATGCCGA AGGGCCGCTA CAAGATCATG
ACCAACTACA TGCCGAAGGT CGGCCGCTAC GGCCTCGACA TGATGTATCG GACCTGCACG
GTGCAGACCA ATTTGGACTT CTCGTCCGAA GCCGACATGG TCAAGAAGCT GCGGGTTTCG
GTGGCGCTGC AGCCGGTCGC GACCGCTCTG TTCGCCAACT CGCCGTTCAC CGAAGGCAAG
CCGAACGGCT TCTTGTCGTT CCGTTCCGAA ATCTGGCGCG ACACCGACAA CGCCCGCTCC
GGCATGATCC CGTGGGCGTT CGAGGACGGC ATGGGGTTCG AGCGCTGGGT CGACTACGCG
CTCGACGTGC CGATGTATTT CGTCAAGCGC GGCGATGATT ACATCGACGT CTCCGGCTCG
TCGTTCCGCG ATTTCTTCGA CGGCCGAAAC GACAAGATGC CGGGCGAGCG ACCGACGCTG
TCGGACTGGG CCAACCATCT GTCGACGATC TTCCCCGAAG TGCGGCTGAA GCGTTACCTC
GAAATGCGTG GCGCCGACGG CGTGCCGTGG GGCCGGCTGC CGGCGTTGCC GGCGTTCTGG
GTCGGCCTCT TGTACGACGA CCAGAGCCTC GACGCCGCCT GGGAGATCGT CAAAGGCTGG
GACGCCTGGG AGCGGCAGGC GCTGCGCGAC GACGTCCCCC GGCTCGGCTT CAAGGCCAAG
ATCCGCAACC GATTTCTGTT CGAGATCGCC AAGGAATGCC TGGTGCTGGC CCATGCGGGC
CTGAGGCGCC GCGGCCGGAT CGATTCGTTC GGCAACGACG AATCGCGGTA TCTCGCGCCG
CTCGAGGACA TCCTCGCCTC CGGCCGCACC CCGGCCGAAG AGATGCTGGA GAAATTCAAC
GGCGCCTGGC AGGGCTCGGT GGAGCCGGCC TACGACGAAT ACGCGTTCTG A
 
Protein sequence
MARDQIDMTP LNSRDELVAW IEAGVKPPSE FRIGTEHEKT PFTLEGHHPV PYDGARGIGA 
LLEGMKILLG WEPIMEGPHI IGLHDVTGGG AISLEPGGQF ELSGAPVDNV HQTHSELMAH
LAQVREIAAP LGIGFLGLGM TPSWSRDDIP VMPKGRYKIM TNYMPKVGRY GLDMMYRTCT
VQTNLDFSSE ADMVKKLRVS VALQPVATAL FANSPFTEGK PNGFLSFRSE IWRDTDNARS
GMIPWAFEDG MGFERWVDYA LDVPMYFVKR GDDYIDVSGS SFRDFFDGRN DKMPGERPTL
SDWANHLSTI FPEVRLKRYL EMRGADGVPW GRLPALPAFW VGLLYDDQSL DAAWEIVKGW
DAWERQALRD DVPRLGFKAK IRNRFLFEIA KECLVLAHAG LRRRGRIDSF GNDESRYLAP
LEDILASGRT PAEEMLEKFN GAWQGSVEPA YDEYAF