Gene Rsph17029_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2012 
SymbolpyrG 
ID4896870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2131017 
End bp2132660 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content66% 
IMG OID640112606 
ProductCTP synthetase 
Protein accessionYP_001043888 
Protein GI126462774 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00909929 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCT ACGTCTTCAT CACCGGCGGG GTGGTGTCTT CCCTGGGCAA GGGTCTGGCC 
TCGGCGGCGC TCGGCGCCCT GCTGCAGGCG CGGGGCTTCT CGGTCAGACT GAGAAAGCTC
GACCCCTATC TCAACGTCGA TCCCGGCACG ATGTCGCCCT TCGAACATGG CGAAGTCTTC
GTGACCGACG ACGGTGCCGA GACCGACCTC GACCTCGGCC ATTACGAGCG ATTCACCGGC
GTCTCGGCCC GCAAGACGGA CTCGGTCTCG TCCGGCCGGA TCTATTCGAA CGTGCTCGAG
AAGGAGCGTC GCGGCGACTA CCTCGGCAAG ACGATCCAGG TGATCCCGCA CGTCACCAAC
GAGATCAAGG ACTTCCTCCG CGTCGGTGAG GATGAGGTCG ATTTCATGCT CTGCGAGATC
GGCGGCACGG TGGGCGACAT CGAGGGCTTG CCCTTCTTCG AGGCGATCCG CCAGTTCGCG
CAGGACAAGC CGCGCGGCCA GTGCATCTTT GTCCATCTGA CGCTTCTGCC CTATGTCTCG
GCCTCGGGGG AACTGAAGAC GAAGCCCACC CAGCATTCGG TCAAGGAGCT GCGCTCGATC
GGCATTGCGC CCGACGTGCT GCTGCTGCGC TCCGAACGCG CGATCCCCGA GAAGGAGCGC
GAGAAGATCG CCCTCTTCTG CAATGTCCGG AAAGAGGCGG TGATCGCCGC CTACGACCTC
AAGACCATCT ACGAGGCGCC GCTCGCCTAT CACCGTGAGG GTCTCGATCA GGCGGTGCTC
GACGCCTTCG GGATCTCGCC CGCGCCGAAG CCCAACCTCG ACCGCTGGGT CGATGTGATG
GACCGGCTCG AGAATGCCGA GGGCGAGGTG CGCGTGGCCA TCGTCGGGAA ATACACCCAG
CTCGAGGATG CCTACAAATC CATTGCCGAG GCGCTGACCC ACGGCGGGAT GGCGAACCGC
ACCCGCGTGC GGGCGGAATG GATCAATGCC GAGCTCTTCG AGCGGGAGGA TCCGAGCCCC
TTCCTCGAGG GCTTCCACGC CATCCTCGTG CCCGGCGGCT TCGGCGAGCG CGGCACCGAG
GGCAAGATCC GGGCGGCGCA ATATGCGCGC GAGAAGGGCA TCCCCTATCT CGGCATCTGT
CTGGGGATGC AGATGGCGGT GATCGAGGCG GCGCGGAACC TCGCCCAGGT GAAGGATGCG
GGCTCCGAGG AGTTCGATCA CGAGGTGGGC AAGAAGCGGT TCACGCCGGT GGTCTATCAC
CTCAAGGAAT GGATCCAGGG CAACCATATC GTCGAGCGCA AGCACGACGA CGACAAGGGC
GGCACCATGC GGCTCGGCGC CTACACCGCC GCGCTGACGC CGGGCTCGCG CGTGTCCGAG
ATCTACCATG CGACCGAGAT CGAGGAACGC CACCGCCACC GCTACGAGGT GGACGTGCGC
TACCGCGAGG CGCTGGAGGG CTGCGGGCTG ACCTTTTCGG GGATGAGCCC GGATGGGCGG
CTGCCCGAGA TCGTCGAGAT CAAGGACCAT CCGTGGTTCA TCGGCGTGCA GTTCCACCCC
GAGCTCAAGT CGAAGCCCTT CGCGCCGCAT CCGCTGTTCG CCGATTTCGT GCGCGCCGCG
GTCGAGGTGT CGCGGCTGGT CTGA
 
Protein sequence
MARYVFITGG VVSSLGKGLA SAALGALLQA RGFSVRLRKL DPYLNVDPGT MSPFEHGEVF 
VTDDGAETDL DLGHYERFTG VSARKTDSVS SGRIYSNVLE KERRGDYLGK TIQVIPHVTN
EIKDFLRVGE DEVDFMLCEI GGTVGDIEGL PFFEAIRQFA QDKPRGQCIF VHLTLLPYVS
ASGELKTKPT QHSVKELRSI GIAPDVLLLR SERAIPEKER EKIALFCNVR KEAVIAAYDL
KTIYEAPLAY HREGLDQAVL DAFGISPAPK PNLDRWVDVM DRLENAEGEV RVAIVGKYTQ
LEDAYKSIAE ALTHGGMANR TRVRAEWINA ELFEREDPSP FLEGFHAILV PGGFGERGTE
GKIRAAQYAR EKGIPYLGIC LGMQMAVIEA ARNLAQVKDA GSEEFDHEVG KKRFTPVVYH
LKEWIQGNHI VERKHDDDKG GTMRLGAYTA ALTPGSRVSE IYHATEIEER HRHRYEVDVR
YREALEGCGL TFSGMSPDGR LPEIVEIKDH PWFIGVQFHP ELKSKPFAPH PLFADFVRAA
VEVSRLV