Gene Rsph17025_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4041 
Symbol 
ID5086214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp76694 
End bp78469 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content68% 
IMG OID640485604 
Producthypothetical protein 
Protein accessionYP_001170198 
Protein GI146280041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.116876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.77295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGT CGGATATTCT TTTATTGCGG TTTCGCCATG TTCGAGTGTC GCTCTGCGTC 
CGGTTTTCCT GTTGGATTTA TAAACGGCTC TTTTTCAATG ACTTTTCCCG AATGTCGGAT
AGACCCTGCC GTGTCGTCCT GTCCCACAAG CGATCCTGTG CGGTTCCGGC GGCGAAGGCT
GTCCGGACGG TCTGGGTCCG AGCCGGCGGG GCGGTCGTGA CACGGGTCGC CCGTCCCGGC
CTCGCCCTGC TGGCGGTCGG CCTTGCCTTC GCCCTTTCCC TGCTGCTACG CGCCGATTTC
TTCCTGGAAC ACGGGGGGGC CCAGGGGCTC GAGGCGACCT ATCACACGCT CTGGACCATC
GAGGCGCTGG AGACATCCCC GCCGTCGGCG CATCACTTCC TGCCGACGGT GACGCTCGAT
CCGGCGCCCG GCAATCCGGT CCGCTGGGGA TCGACCGTCC CCACCTCGGG CGGAAGCTAC
ATCTACACCT CCTTCCCGCC GCTCGGCTTC CTGGCCCCGA TGTATGGGCT GGAGTCGCTC
TCGCTGAAGG TGACCTTTCT CAATCTGGCA CTGTTCAACG CCCTCGTCGG GCTGGGGGCG
GCCCTTGGTA TCGGCGGGTT CATGCGGGCC GTCGCCCTGC GGGAGATCCG CAATCCGGCG
GAACGCGAGA GGGCGGGCTG GCTGATCTTC GCCGCAATGG CGATCTCCTA TCTGTGCCTG
CGCGAATCCC TCGTCAGCCA CGGCGCCGTC TATTGGCCGC ATTCGCTGTC GCAATTGTGC
CTTGTATTCG GATCATGGGC CGCATTCCGC CTGCTTTGCG GGCAGCGCGA CCCTGTCAGC
CTCGGCGGGC TGCTGGTGGC CTGCGCCCTT TATCCGCTTC TCGAATGGAC GGGATATGTC
TTCAACGTGG GCGTCGCGCT GGCGTTTGCG ATCGATGGCC TGGTCCTGCG CCGGAAGGCG
CTGCGGCCCG CCCTGGGCCT GCCCTTCGCG CTGGCCGGCG TGACCCTTCT CGCCGGAGGG
GGAACCCTCC TGCACTACGT CCTCGCGATC GGCGCGCCGG AGATGATGCG GGCGCTCGCC
CATCGCGCCG TTGATCGCAG CCTCCGACCC GACGCGGTCG CCCTGCCGCT CGGCTATCTC
GTCTCCTTCG CGGGCCTTCT GCTGACCGGG CTTGCCGCCG CCCTGACGAT CCGGCGGAAC
GCGCTGCTCC GGGGGCGGCC CGAACTGCTC CTGCTGTTCC TGCTGGTGAC CTTCCCCATG
GTCGAGAACC TCGTCATGAT GCAGCACGCC ACGCAATTCT CCTTCGACCG GCTGAAGCTG
GCGCTTCCGC TGCTGCTTGC GATGACGGTG GCGGCGGCCG CGCATGGCCG GACGGGCGTC
CGGGCGCTGG TGGCGGGGGC GGGCTTCGTG GTCGTGACCA ACGTCGCGAC CTTCGAGCTG
GACGCCGGCC GCTACGACGC CTGGGGCACC GCCGTCGCGC GCAACGGCGA GGTGATCGGG
CGCTTCCGCC GCGATCCGCT GGCCGGCTGC AGCCTGATGG GCGCCTCGGG CGCGGTGCGC
GGCTACCTGA ACATGGCCTT CCACAGGGAC ATCTTCGAAT TCGTCAGCGC CGACGGGCTG
GCCGGCGAGG CGGAGCGGCG GGACAGCTGC GCCCTGGCCT ACGTCACGAT CGATCCGGTC
TTCCCGGACC TGCCCCGCAT CGCCTCGATC GAGATCCTCG ACCGGGCGGG GCATCCGCTC
CGCCGCTACG AGCCCGAGCC GGAGGTCCGG CCATGA
 
Protein sequence
MSMSDILLLR FRHVRVSLCV RFSCWIYKRL FFNDFSRMSD RPCRVVLSHK RSCAVPAAKA 
VRTVWVRAGG AVVTRVARPG LALLAVGLAF ALSLLLRADF FLEHGGAQGL EATYHTLWTI
EALETSPPSA HHFLPTVTLD PAPGNPVRWG STVPTSGGSY IYTSFPPLGF LAPMYGLESL
SLKVTFLNLA LFNALVGLGA ALGIGGFMRA VALREIRNPA ERERAGWLIF AAMAISYLCL
RESLVSHGAV YWPHSLSQLC LVFGSWAAFR LLCGQRDPVS LGGLLVACAL YPLLEWTGYV
FNVGVALAFA IDGLVLRRKA LRPALGLPFA LAGVTLLAGG GTLLHYVLAI GAPEMMRALA
HRAVDRSLRP DAVALPLGYL VSFAGLLLTG LAAALTIRRN ALLRGRPELL LLFLLVTFPM
VENLVMMQHA TQFSFDRLKL ALPLLLAMTV AAAAHGRTGV RALVAGAGFV VVTNVATFEL
DAGRYDAWGT AVARNGEVIG RFRRDPLAGC SLMGASGAVR GYLNMAFHRD IFEFVSADGL
AGEAERRDSC ALAYVTIDPV FPDLPRIASI EILDRAGHPL RRYEPEPEVR P