Gene Rsph17025_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3820 
Symbol 
ID5085361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp714324 
End bp715364 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID640485374 
Producthypothetical protein 
Protein accessionYP_001169982 
Protein GI146279824 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.930447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACT ATGCCGGCTT GGACGTTTCG CTGAGGGAGA TTTCGATCTG CGTCATTGAC 
GCAGATGGCA AGAGCATTGC GCGAGGTTGT TGCCCAGCCG ACCCTGAGGG CGTCGCCGGG
TGGTTTCGCG CAAGGTCGCT GTCGCCAAAG CAGATCGTTC ATGAAAGCGG ACAGCTGTCG
ATTTGGCTTC AGCGCGGTTT GGAACGGCTG GGGCTTCCGT CCGTGTGCAT CGACGCCCGC
AAGGCGCACA AGAGCCTGTC AGCTCGCCCC AACAAATCCG ATGCCGCTGA CGCGGAAGGC
CTTGCGCAAC TGGCTCGCAC AGGGTGGTTC ACCCCTGTCC ACGTGCGCAG CGAGACATCC
GATGGGCTGC GTTGTCTGGT AGGTGCCCGT GCGCAGCTGA TCAGTCTCCG CAAGGACCTT
GAGGGCCACA TCCGAGGAAT TCTCAAGACC TTCGGCATCC GCCTGACGGG TATCGGGAAA
GGCGACCAGC GGCAGGTCTT TCGCGACCAA CTCGCCTCTG CGGGAGAGCG TGATCCAGTC
TTGCGCGCAA TCGCGGACGC GTTCATCTGC GCCCATGCCA CCATTTGCCA GGCGGCAGCG
GACCTTGATC GTGCCGTGAA AGAGAACGCT GAGAAGCATC CTGTTGCCCG CCGATTGATG
ACCATTCCCG GCGTCGGGCC CATCGTGTCG CTGAGCTTTG TCGCGCTGGT CGACGATCCT
GCGAGGTTTC GTCGGGCTGT TGATGTCGGC GCTTTTCTGG GCCTTACCCC TCGGCGATAT
CAGTCCGGAG AAATGGATTG GTCCGGACGT ATCTCGAGAT GCGGCGACAG GGCCATGCGA
AGCATGTTGT TCGAAGCCGC AACTGCTTTG ATCAGTCGAA CACGGCGGTT CTCTGCGCTG
AAAAGCTGGG CGGTCCGACT TGCTGGACGG CGAGGGTTTG CCAAGGCCGC CGTGGCCACC
TCGCGGAAAC TCGCAGTGCT GATGCTGACG CTCTGGAAAA ACGAGACCGA ATTCAAATGG
AAAGAGCGGG CCGTAGCCTG A
 
Protein sequence
MDYYAGLDVS LREISICVID ADGKSIARGC CPADPEGVAG WFRARSLSPK QIVHESGQLS 
IWLQRGLERL GLPSVCIDAR KAHKSLSARP NKSDAADAEG LAQLARTGWF TPVHVRSETS
DGLRCLVGAR AQLISLRKDL EGHIRGILKT FGIRLTGIGK GDQRQVFRDQ LASAGERDPV
LRAIADAFIC AHATICQAAA DLDRAVKENA EKHPVARRLM TIPGVGPIVS LSFVALVDDP
ARFRRAVDVG AFLGLTPRRY QSGEMDWSGR ISRCGDRAMR SMLFEAATAL ISRTRRFSAL
KSWAVRLAGR RGFAKAAVAT SRKLAVLMLT LWKNETEFKW KERAVA