Gene Rsph17029_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1859 
Symbol 
ID4896845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1963520 
End bp1966621 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content67% 
IMG OID640112451 
Producthypothetical protein 
Protein accessionYP_001043735 
Protein GI126462621 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCC ACAAGGAAAT CGGCTTCGAG GAAGTGATCT GCGCCCATCT CGCCTCGCGG 
GGCTGGCTCT ATTCCGACGG CGACGCGCAG CAGTATGACC GCACCCGGGC GCTTCTGCCC
GCCGATGTCG CCGCCTGGGC CGAGACCACG CAGCCGGAGG CCTGGGCGGA GGCTGTGGCC
AAGCATGGCG AGGCGGCGAT CCTCGACCGG CTGCGGAAAC TCCTGAACGA GCGCGGCACG
CTCGAGGTGC TGCGGCGGGG GATCGACATC GTGGGCCTCA AGGTGCCGCT GGCGCTGGCC
CAGTTCCGCC CGGCCATGGG CATGAACCCC GAGATCACCG CGCGCTATCA CGCGAACCGG
CTGCGGGTGG TGCGGCAGGT GCGCTATTCG GGGACGAACG AGAAATGCCT CGACCTCGTG
CTGTTCCTGA ACGGCATTCC GGTGGCGACG GCGGAGCTCA AGACCGACTT CACCCAAGGC
GTGCAGGACG CGGTCGACCA ATACCGCTTC GACCGCAAGC CGCATGAGAA GGGACAGGCG
CCGGAGCCGC TCCTCTCCTT CCCCGGCGGC GCGCTGGTGC ATTTCGCGGT CAGCGACAGC
GAGGTGATGA TGACGACCCG GCTCGAGGGG CCGGGCACGC GCTTCCTGCC GTTCAACAAG
GGCGACCATG GTGCCAAGGG CAATCCGGTG AACCCGATGG GCCACCGCAC CGCCTATCTG
TGGGAGGAGA TCTGGGCGCG CGACAGCTGG CTCGAGATCC TCGGCCGCTA CATGGTCGGC
ACGCGAGACG CGAAGAAGGC TCTGACGGGG ATGATCTTCC CGCGCTTTCA CCAGCTCGAC
GCCACGCGCA AGCTGCGGGC GGCGGTGCTG GAGGAAGGCC CGGGCGGCAA GTACCTGATC
CAGCACTCGG CGGGCTCGGG CAAGACCAAC TCGATCGCCT GGACGGCGCA TCAGCTGGCC
GACCTGCATG ACGCGGCGGA CGCGAAGGTG TTCTCCTCGG TGGTGGTGGT GTCGGATCGC
AATGTGATCG ACACGCAGCT GCAGGAGGCA ATCTTCGCCT TCGAGCGCAC GACCGGCGTG
GTGGCGACCA TCACGAACGA CAGCGGTGCG AAGAGCGGTC AGCTGGCCGA GGCGCTGGCG
GGCGGCAAGA AGATCGTGGT CTGCACGATC CAGACCTTTC CTCACGCGCT GAAGGCGGTG
CAGGAGCTGG CGGCGGCGAC GGGCAAGCGC TTCGCGGTGA TCGCGGACGA GGCGCATTCC
AGTCAGACCG GCGAGGCAGC GGCGAAACTG AAGCAGGTGC TCTCGGCCGA GGAACTGGCG
GCGGTCGCGG ATGGCGGCGA GATCGGCACC GAGGACATCC TCGCCGCGCA GATGGCCGCG
AAGGCGTCGA GCGCGGGCAT CAGCTTCGTG GCCTTCACCG CGACGCCGAA GGCCAAGACG
CTGGAGCTGT TCGGCCGCCG CCCCGATCCC TCGGCGCCCG CGGGGCCAGG CAACCTGCCC
GCGCCCTTCC ACGTCTATTC GATGCGGCAG GCGATCGAGG AGGGCTTCAT CCTCGACGTG
CTGAAGAACT ACACGCCCTA TTCGCTGGCC TTCAAGCTGG CGAGCGACGG GCGGGAGCTG
GACGACGCCG AGGTCGAGCG CAGCTCGGCG CTGAAGTCGC TGATGAGTTG GGTCAAGCTG
CACCCCTGGA ACATCAGCCA GAAGGTGGCG ATCGTGGTCG AGCATTACCG CGCGATGGTG
ATGCCGCTGC TCGAGGGCCG CGCCAAGGCG ATGGTGGTGG TCGAGAGCCG GAAGGAGGCG
GTGCGCTGGC AGCTGGCGAT GCAGAGCTAC ATCGCCGAGA AGGGGTATCC GCTCGGCACG
CTGGTCGCCT TCTCGGGCGA GGTGACCGAC CCGGAGAGCG GCCCCGACCC GTTCTCCGAA
ACCAGCAAGG CGCTGAACCC GGGGCTGAAG GGCAACATCC GCGAGACCTT CAAGGGAGCC
GGATTCCATG TGCTGATCGT GGCGAACAAA TTCCAGACCG GGTTTGACCA GCCGCTCCTG
TGCGGGATGT ACATCGACCG GCGGCTGGCA GGGATCCAGG CGGTGCAGAC GCTGTCGCGG
CTGAACCGGG CCTACCAGCA GGGCAGCGTG GTGAAGGACA CGACCTATGT GCTCGACTTC
GCCAACAGCG CCGACGACAT CCTGGCGGCC TTCAAGACCT ACTACGAAAC TGCCGAGCTG
CAGGACGTCA CCGACCCGAA CCAGGTGCTG GACCTGCGCG CCAAGCTCGA CGGGCTCGGC
TACTACGACG ATTTCGAGGT CGACCGCGTG GTGAAGGCCG AGCTGAACCC GAAGGCCACC
CACGCCGACC TGTCCGCCGC GATCGAGCCG GTGGCCGACC GCCTGCTGAA AGCCTTCAAG
GCCGCGCGCG ACGTGGAAGC GAAGGCGCGC GCCGAGGGCA ACAACAAGGC GGCGGTGGCG
GCGAAGCAGG AGCAGGACGC GCTGATCGTC TTCCGCTCGA ACATGGGCGC CTTCCTGCGG
CTCTATGCCT TCCTGTCGCA GATCTTCGAC TATGGCGCGA CGGCGATCGA GGCGCGCAGC
CTGTTCTATC GGCGTCTGAT CCCGCTGCTG GACTTCGGCC GCGAGCGTGA GGGTGTCGAC
CTGAGCAAGC TGGTCCTGAC CCACCACAAG CTGTCCACCG TCGGAAAGAA GGACCTGCTC
CTCGGCGGGG CGGCCGAGAA GCTGAAGCCT ATGACCGAAA CCGGCTCGGC CAATGTACAG
GACAAGGAGA AGGCGCTGCT GTCCCAGATC GTGGCGAAGC TAAACGACCT CTTCACCGGC
GACTTGACCG ACAACGACCA GCTCTCCTAC GCGATGACGG TCAAGGGCAA GCTCCTGGAG
AACACGACCC TCGCCCAGCA GGCCGCAAAC AATCACAAGG AGCAGTTCGG CAACTCGCCC
AACCTGATGA ACGCCCTGAT GGACGCGATC ATCGACGCGC TGGACGCGCA TTCAGCGATG
AGCAAGCAGG CCCTCGACTC ACCGGTAATC CAGAAAGGCC TGAAGGACGC CCTCATGGGC
CCCGGTCGCC TCTATGAGGC CCTGCGGGAG CGAGCGGCAT GA
 
Protein sequence
MATHKEIGFE EVICAHLASR GWLYSDGDAQ QYDRTRALLP ADVAAWAETT QPEAWAEAVA 
KHGEAAILDR LRKLLNERGT LEVLRRGIDI VGLKVPLALA QFRPAMGMNP EITARYHANR
LRVVRQVRYS GTNEKCLDLV LFLNGIPVAT AELKTDFTQG VQDAVDQYRF DRKPHEKGQA
PEPLLSFPGG ALVHFAVSDS EVMMTTRLEG PGTRFLPFNK GDHGAKGNPV NPMGHRTAYL
WEEIWARDSW LEILGRYMVG TRDAKKALTG MIFPRFHQLD ATRKLRAAVL EEGPGGKYLI
QHSAGSGKTN SIAWTAHQLA DLHDAADAKV FSSVVVVSDR NVIDTQLQEA IFAFERTTGV
VATITNDSGA KSGQLAEALA GGKKIVVCTI QTFPHALKAV QELAAATGKR FAVIADEAHS
SQTGEAAAKL KQVLSAEELA AVADGGEIGT EDILAAQMAA KASSAGISFV AFTATPKAKT
LELFGRRPDP SAPAGPGNLP APFHVYSMRQ AIEEGFILDV LKNYTPYSLA FKLASDGREL
DDAEVERSSA LKSLMSWVKL HPWNISQKVA IVVEHYRAMV MPLLEGRAKA MVVVESRKEA
VRWQLAMQSY IAEKGYPLGT LVAFSGEVTD PESGPDPFSE TSKALNPGLK GNIRETFKGA
GFHVLIVANK FQTGFDQPLL CGMYIDRRLA GIQAVQTLSR LNRAYQQGSV VKDTTYVLDF
ANSADDILAA FKTYYETAEL QDVTDPNQVL DLRAKLDGLG YYDDFEVDRV VKAELNPKAT
HADLSAAIEP VADRLLKAFK AARDVEAKAR AEGNNKAAVA AKQEQDALIV FRSNMGAFLR
LYAFLSQIFD YGATAIEARS LFYRRLIPLL DFGREREGVD LSKLVLTHHK LSTVGKKDLL
LGGAAEKLKP MTETGSANVQ DKEKALLSQI VAKLNDLFTG DLTDNDQLSY AMTVKGKLLE
NTTLAQQAAN NHKEQFGNSP NLMNALMDAI IDALDAHSAM SKQALDSPVI QKGLKDALMG
PGRLYEALRE RAA