Gene Rsph17025_3516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3516 
Symbol 
ID5085932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp401867 
End bp403408 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content68% 
IMG OID640485075 
ProductIS66 Orf2 family protein 
Protein accessionYP_001169691 
Protein GI146279533 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.578775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.957908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGATC TCGAGCGGCG GCTGGCCGAG GCGGAAGCCG CGCGCACCGC GGCCGAGGCC 
CGCCTTGCCG AGGTCGAGGA GGCCCGCGCC CGGCTGGAGC GGCTGCTCGG TCAGATGCGG
CGCGACACCT TCGGCTCGAA GTCCGAGAAG CTCGACCCCG ACCAGCGCAA CCTGCCGTTC
GAAGATGTCG AGGCGGCGGC CGGCATGCTG GCCGCGGCGA GCGAAGCGGC CGAGAAGGCG
CTCGGCACCC GGAGGGCGCC GTCCCGGCCT TCGGAGCGCA ACAAGGGGCA CCTGCCGGAG
CACCTGCCGC GGATCGAGCG GGTGATCGAG CCCGACAGCA TCCTCTGTCC CTGTGGCTGC
GGCGAGATGC AAAGAGTGGG CGAGAGCCGG ACCGAGCGGC TCGATGTCAT CCCGGCGCGG
TTCCGCGTGC TGGTGACGAT CCGCCCGAAA TACGTCTGCC GGACCTGCGC GGGGGCACAA
CATGCCCAGG CTCCGGCGCC GGAATGGCTG GTGCCGCGTG GCCTGCCGAC CGAGGCGCTG
GTCGCGCACA GCATGGTGGG CAAGTTCGGC GACTACCTTC CGTTCTACCG CCAGGCCGAC
ATCTACCGGC GGCAGGGGGT CGAACTCGAC CGCACGATGC TGGCGGAATG GTCGGGGCGT
GCGGCGCAGC TGCTGGCCCC GGTGATCGAC GCGATGATGG CCGAGCTTCG ACGAAGCGAC
CGGCTGCAGA TGGACGAGAC CACCGTGCCA GTGCTGGCGC CCGGGACCGG CGCTGTGCGC
AAGGACTGGC TCTGGGTGGT GCTGCGCGAC CAGCGCGGAT GGGGCGGCGG CGATCCTCCG
ATCGTGGTCT TCCACCACTC GCAAAGCCGC GGCGGCAAGG TCGCGCAGGA GATCCTGAAG
GGGTTCGCCG GCGGTACGCT GCTGGTTGAC GGCCATGGCG GCTACGATCC GCTGGCCGAC
CCCAAGAAGA CCGCGAAGCC CTGGACCCTG GCCTTCTGCT GGACACATTG GAGGCGGCGC
TTCGTCAAGT TCAGCCAGGA CACGCCTTCG CCGATCTGCG ACGAGATGAT CGCGCGGATC
GCGCAGCTCT ACGCCATCGA GAAGGAGATC CGCGGCCGTG ATCCCGCCGC CCGCGTCACG
GTCCGCCAGA AGTTCTCCAA GCCCATCGTC GAGGCGCTGC GGCCCTGGCT CGAGGGCTGC
CTTCAGGATC TCTCCTCGTC CAACGAGCTC AGCACGCACA TCCGGTACGG GCTGAAGAGA
TGGGACGGCA TGACCCGCTT TCTGGAGGAT GGCCGGCTCG AGATGGACAC CAATGGGGTC
GAGAACGCGA TCCGTCCGAT TCCACTTACA AGAAAGAATG CACTATTTGC CGGCTCCACG
GATGGGGCGA AGACATGGGC CCGCATCGCC TCGCTGATCG GCACCTGCCG TCTGAACGGC
GTGAACCCCG AAGCCTACAT CGCAGCGACG CTGCGCAAGA TCCTCGACCA ACACATGCAG
AGCGACATCG CCGCGCTGAT GCCCTGGAAC TTCCGCGAAT AG
 
Protein sequence
MADLERRLAE AEAARTAAEA RLAEVEEARA RLERLLGQMR RDTFGSKSEK LDPDQRNLPF 
EDVEAAAGML AAASEAAEKA LGTRRAPSRP SERNKGHLPE HLPRIERVIE PDSILCPCGC
GEMQRVGESR TERLDVIPAR FRVLVTIRPK YVCRTCAGAQ HAQAPAPEWL VPRGLPTEAL
VAHSMVGKFG DYLPFYRQAD IYRRQGVELD RTMLAEWSGR AAQLLAPVID AMMAELRRSD
RLQMDETTVP VLAPGTGAVR KDWLWVVLRD QRGWGGGDPP IVVFHHSQSR GGKVAQEILK
GFAGGTLLVD GHGGYDPLAD PKKTAKPWTL AFCWTHWRRR FVKFSQDTPS PICDEMIARI
AQLYAIEKEI RGRDPAARVT VRQKFSKPIV EALRPWLEGC LQDLSSSNEL STHIRYGLKR
WDGMTRFLED GRLEMDTNGV ENAIRPIPLT RKNALFAGST DGAKTWARIA SLIGTCRLNG
VNPEAYIAAT LRKILDQHMQ SDIAALMPWN FRE