Gene Rsph17025_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1074 
Symbol 
ID5083366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1102242 
End bp1103486 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content72% 
IMG OID640482632 
Producthypothetical protein 
Protein accessionYP_001167280 
Protein GI146277121 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.541704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.984253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGT TCTGGGCGCT GCCGCACCAG CTCGCACCGG AGGGCGGGTG GAAGAGCTGG 
GTGGTGATGG GCGGCCGGGG CGCGGGCAAG ACCCGGGCGG GGGCCGAGTG GGTCCGTTCG
GAGGTCGAGG GCCCGAGGCC CGGAGATCCG GGCCGGTCGC GCCATGTGGC GCTGGTGGGC
GAGACGGTCG ATCAGACCCG CGAGGTGATG GTCTTCGGCG AGAGCGGCCT GCTGGCCTGC
TCGCCCCCCG ATCGGCGGCC CGAATGGGAG GCGGGGCGCA AGCGCCTCGT GTGGCCGAAC
GGGGCGGTGG CGCAGGTGTT TTCGGCCCAC GATCCCGAGA GCCTTCGCGG GCCGCAGTTC
GACGCCGCCT GGGCCGACGA ACTGGCCAAA TGGGCCCGCG CCGAAGAGGC GTGGGACATG
CTGCAATTCT CGCTGCGGCT GGGCGATCAG CCGCGGCAGG TGGTGACGAC GACGCCGCGC
AACGTGCCGG TGCTGCGCCA GATCCTCGAC AACCCCTCGA CGGTGGTCAC GCATGCGCCG
ACCGAGGCGA ACCGTGCCTA TCTGGCCAAG TCCTTCCTCG ACGAGGTCCA TGCCCGTTAC
GACGGCACGC GCCTCGGGCG GCAGGAGCTG GAGGGGCTGT TGCTGGAGGA TGTCGAGGGC
GCGCTCTGGA CCACGGTGCG GATCGAGGCG CTGCGGGCCG AGGAGGCCGG TCCCCTCGAC
CGGATCGTGG TGGCGGTCGA TCCGCCCGTG ACCGGGCACG AAGCGTCGGA TGAATGCGGC
ATCGTGGTGG TGGGCGCGCG GACCGACGGC CCGCCTCAGG ATTGGCAGGC GGTCGTGCTC
GAGGATGCCT CGGTCGGGGC TGCGAGCCCG GATCGCTGGG CACGGGCGGC GCTTGATGCG
CTGCATCGGC ATGGGGCGGA TCGGCTGGTG GCCGAGGTGA ACCAGGGGGG CGATCTGGTG
GAAACGGTGA TCCGGCAGAT CGATCCGCTC GTGCCGTTCC GGGCTGTCCA TGCCTCGCGC
GGGAAGGCGG CGCGGGCGGA ACCGGTTGCC GCGCTCTACG AGCAGGGGCG GGTCCGGCAT
CTGCGGGGTC TGGGCGATCT CGAGGATCAG ATGTGCCGGA TGACGGTGCG CGGCTACGAC
GGCCGCGGCT CGCCCGACCG GCTGGATGCG CTGGTCTGGG CGCTGACCGA CCTGATGATC
GAGCCGGCGC GGGCCTGGGT GAACCCGCGG ATGCGCCTGC TGTAA
 
Protein sequence
MFEFWALPHQ LAPEGGWKSW VVMGGRGAGK TRAGAEWVRS EVEGPRPGDP GRSRHVALVG 
ETVDQTREVM VFGESGLLAC SPPDRRPEWE AGRKRLVWPN GAVAQVFSAH DPESLRGPQF
DAAWADELAK WARAEEAWDM LQFSLRLGDQ PRQVVTTTPR NVPVLRQILD NPSTVVTHAP
TEANRAYLAK SFLDEVHARY DGTRLGRQEL EGLLLEDVEG ALWTTVRIEA LRAEEAGPLD
RIVVAVDPPV TGHEASDECG IVVVGARTDG PPQDWQAVVL EDASVGAASP DRWARAALDA
LHRHGADRLV AEVNQGGDLV ETVIRQIDPL VPFRAVHASR GKAARAEPVA ALYEQGRVRH
LRGLGDLEDQ MCRMTVRGYD GRGSPDRLDA LVWALTDLMI EPARAWVNPR MRLL