Gene Rsph17029_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1044 
Symbol 
ID4895528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1078119 
End bp1079228 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID640111631 
Producthypothetical protein 
Protein accessionYP_001042927 
Protein GI126461813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0461825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCG ACAAGATCGA ACGTGAAGTG GAAGAGAACC GCGCGCGAGT CGAAAGCACT 
CTCGACGCGC TGAAGGAGCG GATGTCCGTC AATCAGGTCG TCGACGATCT CGCGAATTTC
GTGGGCGTCG AGGATCTGCG GGGCGTGATG CATTCCGCCG GCCGGCAGGT GCGCGACAAT
CCGGTTGCTC TGGGCCTCAT CGGAGTGGGT CTGGCGTGGC TCGCTTTCGG CGGCTCCTCG
AGCCGCTCGC GCCATGTCAG CGCCTACGAC CGCGAGGAAT ATTACCGGAG CGACTACGGC
CCCGCCGGCC GGTCCTACGA GCCCTATGGC GGCGGCGCCT CCTATCGTTC GGACCGGGGC
GAGGGCGTGG TCTCGCGCGT CAAACATGCG GTGAGCGACG CCGCCGACAG CGTGAGCCGC
GCGGCCCATT CCGCGACGGA CAAGGTTGCC GAAACTTTCG GCGACGCGCG TGACCGCGCG
GGCAGCCTGC GCGACGATGT CTACGACCGC GCCGGCCGGA TGCGCGAGGA TGCCTATGAT
CGTGCCGGCC ACTGGCGCGA CGATCTCGGC GAGCGGTCGT CGCACCTGCG CGACCGTGCC
GGGCATCTGC GCGACCGCGC GTCCCACGGG GCGCATCAGA TGCGCGACAG CATGAGCCAC
GGCATGGAGC AGCAGCCCCT GCTCGTGGGT GCCGCGGCCG TGGCGCTCGG CGCCGTGATC
GGGGCCGCGC TTCCCCGGAC GCGCACCGAG GACGAGTGGA TGGGCCGCAG CAGCGACGAG
CTCTGGGACG AGGCGAAGGC TTCGTCGTGG GAGCTGCGCG AGCGGGCGAT GAAGGCCGCG
CGCGAGACCT ACGACGCCAC CATCGCCGCG GCGCGCGACG AGGGTCTGGT GCCCGAAAAG
GGCGAGACGC TGGCCTCGAA GGTGGGACGC GTCGCGGATG CGGCGGCCAG CGAGGCGAAG
GCTCAGGTGG AGCCCGTGCT GCACGGCAAG GACGAGGACA AATCCTCGAC CGGCATGTCC
TCGGCTGGCG CGGGTTCGAC AGGCTCGACC GACTCCACGA CCAAGGGTCC CGGCACGTCC
GGCCCCAAGG TTGCAGGCTC GGGCTTCTGA
 
Protein sequence
MNADKIEREV EENRARVEST LDALKERMSV NQVVDDLANF VGVEDLRGVM HSAGRQVRDN 
PVALGLIGVG LAWLAFGGSS SRSRHVSAYD REEYYRSDYG PAGRSYEPYG GGASYRSDRG
EGVVSRVKHA VSDAADSVSR AAHSATDKVA ETFGDARDRA GSLRDDVYDR AGRMREDAYD
RAGHWRDDLG ERSSHLRDRA GHLRDRASHG AHQMRDSMSH GMEQQPLLVG AAAVALGAVI
GAALPRTRTE DEWMGRSSDE LWDEAKASSW ELRERAMKAA RETYDATIAA ARDEGLVPEK
GETLASKVGR VADAAASEAK AQVEPVLHGK DEDKSSTGMS SAGAGSTGST DSTTKGPGTS
GPKVAGSGF