Gene RPD_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4104 
Symbol 
ID4024626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4568470 
End bp4569753 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content68% 
IMG OID637964312 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_571224 
Protein GI91978565 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC TGCTGCTCGG TTCAGGCGGC CGTGAACACG CGCTGGCCTG GAAGATCGCG 
GCCTCACCTC TGGTCACCAA ATTGTGGTGC GCGCCGGGCA ATGCCGGGAT CGCGCGCGCC
GCCACCTGCG TGCCGCTCGA CCTCGCCGAC CACGCCGCGG TGATCGATTT CTGCAAGGCC
AATGCGATCG AATTCGTGGT GGTCGGACCG GAGGCGCCGC TGGTCGCCGG CATCGTCGAC
GATCTCACAG CCGCCGGCAT CAAGGCATTC GGGCCGAGCA GCAAGGCGGC GCAGCTCGAA
GGCTCCAAGG GCTTCACCAA GGATCTGTGC AAGGCGCACG GCATCCCCAC CGCCGCCTAC
GAGCGGTTCA GCGATCCGGA CGAGGCCAAG GCCTATATCC GGGTGCAGGG CGCGCCGATC
GTGGTCAAGG CCGACGGCCT CGCCGCCGGC AAGGGCGTCG TGGTGGCGAT GACGCTGGCC
GAGGCGGAAG CCGCGGTCGA CATGATTTTC GGCGGTGCGC TCGGCGACGC CGGCGTCGAG
GTGGTGGTCG AGGATTTCCT GGTCGGCGAG GAGGCCTCGT TCTTCGTGCT GTGCGACGGC
GAGCACGCGC TGGCGCTCGC CACCGCCCAG GATCACAAGC GCGCCTATGA CGGCGACAAG
GGGCCGAACA CCGGCGGCAT GGGCGCCTAT TCGCCGGCGC CGGTGATGAC CGAGGCGGTC
TGCAAGCAGG CGATGGAACG GATCATCACC CCGACGCTCA AGGGCATGAA GGCGATGGGG
ATGCCGTTCA AGGGCGTGCT GTTCGCCGGG CTGATGATCA CCGAGGACGG ACCGCAACTG
ATCGAATACA ACGTCCGCTT CGGCGATCCG GAATGCCAGG TGCTGATGCT GCGGATGATG
TCCGACATCG TGCCCGCGCT GCTCGCCTGC GCCGACGGCC AGCTCGCGCA TTTCAGTCTG
CGCTGGGTCG ACGAGCCGGC GCTGACCGTG GTGATGGCGG CGAAGGGCTA TCCGGGCGCC
TATGAGAAGG GCACCAGGAT CGCCGGGCTC GAGCGCGCCG AGCAGATTCC GGGCGTGGAG
ATCTTCCACG CCGGCACCAT GGCATCGGGC GACTGGATCC TCGCCAATGG CGGCCGCGTA
CTCGCGGTGA CCGCCTCGGC GAACACGGTG GCGGAAGCGC AACGCCGCGC CTACGAGGCG
ATCGGCGTGA TCAACTGGCC GGAAGGCTTC TGCCGCCGCG ACATCGGCTG GCAGGCTGTG
GCGCGGGAAC GCGGCCGCAA GTAG
 
Protein sequence
MNILLLGSGG REHALAWKIA ASPLVTKLWC APGNAGIARA ATCVPLDLAD HAAVIDFCKA 
NAIEFVVVGP EAPLVAGIVD DLTAAGIKAF GPSSKAAQLE GSKGFTKDLC KAHGIPTAAY
ERFSDPDEAK AYIRVQGAPI VVKADGLAAG KGVVVAMTLA EAEAAVDMIF GGALGDAGVE
VVVEDFLVGE EASFFVLCDG EHALALATAQ DHKRAYDGDK GPNTGGMGAY SPAPVMTEAV
CKQAMERIIT PTLKGMKAMG MPFKGVLFAG LMITEDGPQL IEYNVRFGDP ECQVLMLRMM
SDIVPALLAC ADGQLAHFSL RWVDEPALTV VMAAKGYPGA YEKGTRIAGL ERAEQIPGVE
IFHAGTMASG DWILANGGRV LAVTASANTV AEAQRRAYEA IGVINWPEGF CRRDIGWQAV
ARERGRK