Gene Rsph17029_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1035 
Symbol 
ID4895827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1068621 
End bp1070066 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID640111622 
Producthypothetical protein 
Protein accessionYP_001042918 
Protein GI126461804 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.414237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCC GCTTCGACAA CAGCTATGCC CGCGACCTCG AGGGCTTCTA TGTGGACTGG 
CCCGCCGCGC CGGTCCCCGC GCCGCGGCTT CTGCGGCTGA ACCGCCCGCT GGCCGAGGAG
CTGGGGCTCG ATCCCGACCT TCTCGAGCGC GAGGGGGCGG AGATCTTTTC GGGGAGGCGC
CTGCCCGAGG GGGCGCACCC GCTGGCGCAG GCCTATGCGG GCCACCAGTT CGGCGGCTTC
TCGCCCCAGC TTGGTGACGG GCGGGCGCTG CTCATCGGCG AAATCACCGA CCGCGCGGGC
CGGAGGCGCG ACCTTCAGCT CAAGGGCTCG GGGCGCACGC CCTTTTCGCG CGGCGCGGAC
GGCAAGGCGG CCCTCGGGCC GGTGCTGCGC GAATATCTGG TGGGCGAGGC GATGCACGGG
CTCGGCATCC CCACCACCCG CGCGCTGGCG GCCGTCGCCA CCGGCGAGCC GCTCCTGCGG
CAGGAGGGCG AGCGTCCGGG CGCGATCCTG ACGCGCGTCG CGGCGAGCCA CATCCGCGTG
GGCACCTTCC AGTTCTTCGC CGCGCGCAGC GACATCGAGC GGGTGCGGCG GCTGGCCGAC
TACGCCATCG CGCGGCATTA CCCCGAACTC GCCTCCGCGC CCGAGCCCTA TCTCGCCTTC
TATGAAGCGG TGGCCGAGGC GCAGGCGCAG CTCGTGGCGC GCTGGATGCT GGTGGGCTTC
ATCCATGGCG TGATGAACAC CGACAACATG ACGATCTCGG GCGAGACCAT CGACTACGGC
CCCTGCGCCT TCATGGAGGG CTACGATCCC GGCACGGTCT TCTCCTCCAT CGACCTGCAG
GGGCGCTATG CCTATGGCAA CCAGCCCTTC ATCCTCGCCT GGAACCTCGC GCGGCTGGGC
GAGGCGCTTC TGCCGCTTCT CGATGCCGAT GCGGAGCGGG CGGCGGACAA GGCCAATTCC
GTGCTGGAAA CGGTGGGCGC GCGCTATCAG GGCCACTGGC TCGCGGGCAT GCGCGCCAAG
CTCGGGCTGT CCGGGGCCGA AGAGGGCGAT GCGCGGCTTG CCGAGGATCT GCTGGAGGCC
ATGCGTAGCC AGCGCGCCGA CTGGACGCTG ACCTTCCGCC GGCTCGCGGA TGCCGTGACC
GACGAAGGCG CGCTCCGCCC CCTGTTCCGC GATGGGTCCG CGCTCGAGGC ATGGCTGCCG
CGCTGGCGGG ACCGGCTGGC GCCCGACGCG GCCCAGCGGA TGCGGGCGAC AAACCCGATC
TACATCGCGC GGAACCACCG GGTCGAGGAG GCGCTGGCCG CGGCCCATGC CGGCGATCTC
GCACCCTTCG ACCGGCTGCT CGAGGCGCTG GCTGAGCCCT TCACCGAACG GGCCGACCGC
GAGCTGTTCG CCCTGCCGGC CCCGGAAGGG TTCGACGACA GCTACCGCAC CTTCTGCGGG
ACGTGA
 
Protein sequence
MTFRFDNSYA RDLEGFYVDW PAAPVPAPRL LRLNRPLAEE LGLDPDLLER EGAEIFSGRR 
LPEGAHPLAQ AYAGHQFGGF SPQLGDGRAL LIGEITDRAG RRRDLQLKGS GRTPFSRGAD
GKAALGPVLR EYLVGEAMHG LGIPTTRALA AVATGEPLLR QEGERPGAIL TRVAASHIRV
GTFQFFAARS DIERVRRLAD YAIARHYPEL ASAPEPYLAF YEAVAEAQAQ LVARWMLVGF
IHGVMNTDNM TISGETIDYG PCAFMEGYDP GTVFSSIDLQ GRYAYGNQPF ILAWNLARLG
EALLPLLDAD AERAADKANS VLETVGARYQ GHWLAGMRAK LGLSGAEEGD ARLAEDLLEA
MRSQRADWTL TFRRLADAVT DEGALRPLFR DGSALEAWLP RWRDRLAPDA AQRMRATNPI
YIARNHRVEE ALAAAHAGDL APFDRLLEAL AEPFTERADR ELFALPAPEG FDDSYRTFCG
T