Gene Rsph17029_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0501 
Symbol 
ID4897468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp523928 
End bp525007 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content66% 
IMG OID640111085 
Productputative GTP cyclohydrolase 
Protein accessionYP_001042389 
Protein GI126461275 
COG category[S] Function unknown 
COG ID[COG1469] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.291801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCC TGACGCCCGT GGCCGAGCGC CTGCCGAGCC GTGAGGAAGC CGAAGAGGCA 
CTGGCCGTGC TCCGCCGCTG GGCGACGCAT ACGCCGGCCT CCGATGTGGC CGCGCTCGCG
CCCGAGGCCC CGGCGCTGGT CTATCCCGAC CTCAGCCGCG CCTATCCCCG CACCTTCACG
GTGGACGAGG CCTACAAGGC CTCGCTGCCC GACCTGCAGA ACGGGCCCGC CAGCCTGATC
GTCGGCGCGA AGGCCGTGAT CCAGCATGTC GGCATCTCGA ACTTCCGCCT GCCGATCCGC
TATCACACGC GCGACAACGG CGATCTGCAG CTCGAAACCT CCGTCACCGG CACGGTGAGC
CTCGAGGCCG AGAAGAAGGG CATCAACATG AGCCGCATCA TGCGGTCCTT CTATGCCCAT
GCCGAGCAGG CCTTCAGCTT CGAGGTGATC GAGCGCGCGC TCGAGGATTA CAAGCGCGAC
CTCGAGAGTT TCGACGCCCG CATCCAGATG CGCTTCTCCT TCCCGGTGAA GGTGCCGTCG
CTGCGGTCGG GCCTCACAGG CTGGCAATAT TACGACATCG CGCTCGAGCT GGTTGACCGC
GGCGGGGTGC GCAAGGAGAT CATGCATCTC GACTTCGTCT ATTCCTCGAC CTGCCCCTGC
TCGCTGGAGC TGTCCGAACA TGCCCGGCGC GAGCGCGGGC AGCTGGCCAC GCCGCATTCG
CAGCGGTCGG TCGCGCGGAT CTCGGTCGAG GTGCGGCAGG GCAAGTGCCT CTGGTTCGAG
GATCTTCTGG ATCTCGTCCG CAGCGCGGTG CCGACCGAGA CGCAGGTCAT GGTCAAGCGC
GAGGACGAGC AGGCCTTCGC CGAGCTGAAT GCCGCAAACC CGATCTTCGT CGAGGATGCC
GCGCGCAGCT TCTGTCAGGC GCTGCAGTCC GATCCGCGGA TCGGCGACTT CCGCGTGGTG
GCGAGCCATC AGGAATCGCT GCATTCCCAC GATGCGGTCT CGGTTCTGAC CGAGGGGCCG
ACATTCGCGG CCGAAAGTCT CGATCCGAGG CTCTTTTCCA GCCTCTACCA CGTCGGCTGA
 
Protein sequence
MNILTPVAER LPSREEAEEA LAVLRRWATH TPASDVAALA PEAPALVYPD LSRAYPRTFT 
VDEAYKASLP DLQNGPASLI VGAKAVIQHV GISNFRLPIR YHTRDNGDLQ LETSVTGTVS
LEAEKKGINM SRIMRSFYAH AEQAFSFEVI ERALEDYKRD LESFDARIQM RFSFPVKVPS
LRSGLTGWQY YDIALELVDR GGVRKEIMHL DFVYSSTCPC SLELSEHARR ERGQLATPHS
QRSVARISVE VRQGKCLWFE DLLDLVRSAV PTETQVMVKR EDEQAFAELN AANPIFVEDA
ARSFCQALQS DPRIGDFRVV ASHQESLHSH DAVSVLTEGP TFAAESLDPR LFSSLYHVG